You are viewing an old version of this page. View the current version.

Compare with Current View Page History

Version 1 Next »


Since October 25th 2017,  the time period you can select on the ECMWF Web User Interface for ERA-Interim is limited, namely for daily data to one month, and for monthly data to one year. The purpose of the ECMWF web user  interface is only to provide small sample datasets for evaluation. Data over longer time periods should not be downloaded through the web interface, but using the ECMWF WebAPI with Python scripts as described below.

Please also note that the web user interface does not list all available data. For listings of all available data please see the ECMWF archive catalogue and the respective ERA-Interim dataset documentation.


1. Learn about ERA-Interim data

  1. Read What is ERA-Interim.
  2. Browse the ERA-Interim catalogue and make sure the data you are interested in is available in ERA-Interim. A demonstration (for ERA5 but the same principle applies for ERA-Interim) is available as part of the video tutorial below:

    Full Transcript (pdf)

  3. Read the ERA-Interim data documentation and take note of the parameters (fields) you want to use, and of the corresponding values for data type, stream, level type (surface, pressure levels, model levels, etc) base time and forecast steps.

2. User account and License agreement

  1. The ERA-Interim data is hosted at ECMWF. To access the ERA-Interim data you need a user account with ECMWF, so if you do not have a account, register a ECMWF account now.
  2. Read the Conditions of Use and accept the agreement at the bottom the page. This is necessary so we have a record of you accepting the license agreement.

3. Technical prerequisites

You need:

  • A computer with a *nix operating system is highly recommended. Microsoft Windows is known to work too, but is unsupported, see here for details.
  • The Python programming language. If you do not have it installed yet, install Python now.

4. Set up your computer

This is to configure your computer to programmatically retrieve data from ECMWF. You only need to do this once, and only if you have never retrieved ECMWF data using Python before. For more information refer to Access ECMWF Public Datasets of ECMWF WebAPI.

1. Install the ECMWF WebAPI client library by running:

 

sudo pip install https://software.ecmwf.int/wiki/download/attachments/56664858/ecmwf-api-client-python.tgz

2. Get and install an ECMWF API key:

  1. Go to https://apps.ecmwf.int/auth/login/ to verify you are logged in. If you are not, log in with your ECMWF account details (as registered in step 2).

  2. Go to https://api.ecmwf.int/v1/key/ . You will see something like below:

 

{
    "url"   : "https://api.ecmwf.int/v1",
    "key"   : "XXXXXXXXXXXXXXXXXXXXXX",
    "email" : "john.smith@example.com"
}

3. Copy the bit with the curly brackets as shown above into a text file and save the file as $HOME/.ecmwfapirc (on Unix/Linux) or %USERPROFILE%\.ecmwfapirc (on Windows)

5. Run a test

This is to verify that your computer is set up correctly.

1. Copy the following Python script to a text file and save it, for example as 'my_ERA-Interim_test_script.py'

#!/usr/bin/env python
from ecmwfapi import ECMWFDataServer

server = ECMWFDataServer()
server.retrieve({
    "class": "ei",
    "dataset": "interim",
    "expver": "1",
    "stream": "oper",
    "type": "fc",
    "levtype": "sfc",
    "param": "167.128",
    "date": "2017-08-01/to/2017-08-31",
    "time": "00:00:00",
    "step": "3",
    "grid": "0.75/0.75",
    "target": "my_ERA5_test_file.grb",
 })

2. Run the script. On most computers you would do this by opening a command prompt and typing

python my_ERA-Interim_test_script.py

The test request will download data from ECMWF and save as file 'my_ERA-Interim_test_file.grb' in the directory you issued the command from.

Note that the data retrieval is not designed to be instant. The test request above usually takes a few minutes to complete, but depending on demand can take longer.

3. With your favourite GRIB file reader verify that you can read the file. If you are not familiar with GRIB please see What are GRIB files and How to read or decode a GRIB file

If you get an error message or the output is not as specified in to the Python script, most likely your computer setup is wrong, or you did not accept the ERA5 license, so please go back to the previous steps.

6. Create your data retrieval script

  1. Browse the ERA-Interim data catalogue for the data you are interested in, and in the last step make a selection in all boxes and click 'View the MARS request'. This shows a template Python script with your selected options.
  2. Copy the Python script to a text file and save it, for example as 'my_ERA-Interim_script_v1.py'
  3. If necessary, adapt the Python script to your requirements,e.g. change date, time and grid.
    1. For an output in netcdf format, specify "format" as "netcdf"
    2. If you specify "format" as "netcdf", then you will need to set "grid" e.g. "grid": "0.3/0.3". Otherwise, GRIB to netCDF conversion will fail. 
    3. To subset to a geographical area, specify "area" as N/W/S/E in geographic lat/long degrees. Southern latitudes and western longitudes must be given as negative numbers e.g. "area": "75/-20/10/60". Requires "grid" to be set to a regular grid e.g. "grid": "0.3/0.3".
    4. Change the "target": "CHANGEME", to the desired output path and file name, e.g. "data1.nc" or "./data/data1.grib". The default path is the current working directory.

Limitations:

  • You can have a maximum of three active requests at a time.
  • The maximum data volume is 20GB per request.
  • The maximum number of fields to be retrieved is 600,000 fields per request.

Note that the data retrieval is not designed to be instant. A larger request can take hours and even days to complete.

To retrieve data efficiently (and get your data quicker!) you should retrieve all the data you need from one tape, then from the next tape, and so on. In most cases this means retrieving all the data you need for one month, then for the next month, and so on. See Retrieval efficiency pages.


7. Run your data retrieval script

On most computers you would do this by opening a command prompt and typing

python my_ERA-Interim_script_v1.py

This will retrieve the data as specified and download it as a single file to your computer into the current directory (or whichever "target" directory you specified).

For long running processes you can check the progress of your request in your job list.

8. Check results

Check that the data you downloaded meets your requirements.

To report an issue or bug please contact Copernicus Support at ECMWF

 

  • No labels