Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Methodology

The process to generate this dataset involvesgenerate this dataset involves:

  • Fetching ERA5 data in GRIB format.
  • Apply homogenization conventions, as applicable
    • Rename spatial coordinates to latitude, longitude
    • Ensure latitude, longitude ranges are between [-90, +90], [-180, +180]
    • In the accumulation variables (e.g. Total precipitation), time is expressed in one single dimension called time which in ERA5 netCDFs is usually called valid_time.
    • In the accumulation variables, if the number of hourly samples is incomplete (less than 24) the date is skipped (e.g. 1940-01-01).
    • Drop "number", "step", "surface" and "valid_time".
  • Written to an ARCO Zarr archive (which is similar in structure to netCDF)
  • The CDS requests the data from the Zarr archive using xarray and writes the data to netCDF or CSV (as requested).

...