...
Methodology
The process to generate this dataset involvesgenerate this dataset involves:
- Fetching ERA5 data in GRIB format.
- Apply homogenization conventions, as applicable
- Rename spatial coordinates to latitude, longitude
- Ensure latitude, longitude ranges are between [-90, +90], [-180, +180]
- In the accumulation variables (e.g. Total precipitation), time is expressed in one single dimension called time which in ERA5 netCDFs is usually called valid_time.
- In the accumulation variables, if the number of hourly samples is incomplete (less than 24) the date is skipped (e.g. 1940-01-01).
- Drop "number", "step", "surface" and "valid_time".
- Written to an ARCO Zarr archive (which is similar in structure to netCDF)
- The CDS requests the data from the Zarr archive using xarray and writes the data to netCDF or CSV (as requested).
...