Issued by: CSIC
Date: 15/12/2023 (updated on )
Service contract: 2022/C3S2_381_Lot1_Contractor/SC1
1. Introduction
This document describes the "Copernicus Interactive Climate Atlas gridded monthly dataset" (C3S Atlas Dataset in short) which constitutes an evolution of the IPCC AR6 Atlas gridded monthly dataset, also published in the CDS1. This dataset has been entirely produced using climate information fully available in C3S CDS and a processing workflow available from the GitHub C3S Atlas repository2 (including the underlying software and illustrative notebooks). The dataset underpins the different products visualised in the Copernicus Interactive Climate Atlas Application (C3S Atlas Application in short), available at https://atlas.climate.copernicus.eu
The dataset available in the CDS at https://cds.climate.copernicus.eu/datasets/multi-origin-c3s-atlas is always the latest version, which is version 2 (v2) at the moment. Information on changes with respect to previous versions is available at Sections 3 and 6 of this document.
In addition to describing the dataset, we summarise the fit-for-purpose quality control and curation applied, based on insights from basic products (e.g., maps and climate stripes), and outline the archiving metadata and standards employed.
2. CDS catalogues, variables and harmonization
2.1. CDS catalogues and datasets used
The Copernicus Interactive Climate Atlas gridded monthly dataset integrates information from a range of climatic observations, reanalysis products and projection datasets. Table 1 lists the datasets and the corresponding CDS catalogues used (datasets included in the latest version – v2 – are highlighted in red). Version v2 includes three new observational datasets: NOAA Climate Prediction Center dataset (CPC) for precipitation, Berkeley Earth (BERKEARTH in short) for air temperatures, and the European Space Agency (ESA) SST Climate Change Initiative (SST-CCI) project for sea surface temperature. In addition, version v2 includes the regional reanalyses CERRA (adding extra details on ERA5 with the use of a high resolution limited area model for downscaling including data assimilation), an additional high-resolution source of information to assess recent changes over Europe.
Note that the CORDEX catalogue comprises two distinct datasets with different horizontal spatial resolution3: (1) CORDEX-CORE, a homogeneous subset of CORE4 simulations at 0.22° and (2) CORDEX-EUROPE, covering Europe at a finer 0.11° resolution. To harmonise the spatial grids across datasets within the Atlas, a set of common nested regular sub-grids (2°, 1°, 0.5°, 0.25°, 0.125°, and 0.0625°) were used to (conservatively) interpolate some of the original datasets, particularly multi-model ensembles. However, certain products, such as ERA5-Land and ERA55, are retained on their original regular grids provided by the CDS to preserve their native resolution. The reference grids used for the various datasets, along with additional auxiliary information, are available in the GitHub C3S Atlas repository6.
Some of the datasets available in the CDS are distributed across multiple catalogues, depending on temporal resolutions (hourly, daily, or monthly) or subsets of variables (e.g. single-level or multi-level). To produce the Atlas dataset, we selected the most appropriate catalogue for each variable or index (see Table 2), guided by the principle that primary variables and their derived indices (e.g. maximum temperature and number of days with maximum temperature over 35°C) should originate from the same source catalogue. For example, maximum temperature (TX) is needed at daily resolution to compute derived indices; therefore, for ERA5, the hourly catalogue (Reanalysis-era5-single-levels) was used for this and other related variables. In contrast, variables such as sea surface temperature, which do not require high temporal resolution, were sourced from the monthly catalogue (Reanalysis-era5-land-monthly-means). Table 1 uses colour coding to indicate the temporal resolution of the original datasets in the CDS catalogues.
Table 1: Description of the projection, reanalyses and observational datasets and CDS catalogue entries used in the C3S Atlas Dataset. Colour codes indicate the different temporal resolutions (hourly, daily, and monthly, one or several for each catalogue). The column “horizontal resolution” indicates the different grids used (products regridded to the common regular sub-grids: 2°, 1°, 0.5°, 0.25°, 0.125°, 0.0625° are indicated with an asterisk). Note that the periods of the Atlas dataset correspond to full years. BERKEARTH stands for Berkeley Earth Foundation dataset, CPC for NOAA Climate Prediction Center datasets, and SST-CCI for European Space Agency (ESA) SST Climate Change Initiative (CCI) project. Red colour indicates new elements in version 2 (latest version) of the dataset (v2).
Type | Datasets | Horizontal resolution | Time period | CDS-catalogues |
Proj. | CMIP6 | 1° (*) | Historical: 1850 - 2014 Scenarios: 2015 - 2100 | Projections-cmip6 [daily/mon] |
CMIP5 | 2° (*) | Historical: 1850 - 2005 Scenarios: 2006 - 2100 | Projections-cmip5-daily-single-levels [daily] Projections-cmip5-monthly-single-levels [mon] | |
CORDEX-CORE | 0.25° (*) | Historical: 1970 - 2005 Scenarios: 2006 - 2100 | Projections-cordex-domains-single-levels [daily/mon] | |
CORDEX-EUR-11 | 0.125° (*) | Historical: 1970 - 2005 Scenarios: 2006 - 2100 | ||
Rea. | ERA5 | 0.25° | 1940 - 2024 | Reanalysis-era5-single-levels [hourly] Reanalysis-era5-single-levels-monthly-means [mon] |
ERA5-Land | 0.1° | 1950 - 2024 | Reanalysis-era5-land [hourly] Reanalysis-era5-land-monthly-means [mon] | |
CERRA | 0.0625° (*) | 1985-2021 | Reanalysis-cerra-single-levels [hourly] | |
ORAS5 | 0.25° (*) | 1958 - 2014 | Reanalysis-oras5 (consolidated, single levels) [daily] | |
Obs. | E-OBS | 0.125° (*) | 1950 - 2021 | Insitu-gridded-observations-europe (0.1.deg) [daily] |
BERKEARTH | 1° | 1960-2017 | insitu-gridded-observations-global-and-regional [daily] | |
CPC | 0.5° | 1979-2020 | ||
SST-CCI | 0.05° | 1982-2022 | satellite-sea-surface-temperature [daily] |
Data harmonisation across the various datasets and catalogues is critical aspect of this data preparation work. For example, ERA5 and ERA5-Land do not provide daily data directly, requiring daily values to be derived from hourly data. However, both datasets use different conventions for encoding hourly data: ERA5 provides hourly accumulated values, while ERA5-Land provides accumulations from the start of the forecast (00 UTC) over a 24-hour period. As a result, different post-processing steps are needed to compute consistent daily values7. These differences must be carefully addressed during harmonisation to ensure that all variables share a consistent definition, temporal aggregation method, and unit standardisation (see Section 2.3).
2.2. Variables and indices
Table 2 shows the set of 35 variables included in the dataset, comprising both primary variables (highlighted in bold) and derived indices (see Table 4 for details). This expands the 21 climate variables and indices featured in the original IPCC AR6 Atlas gridded monthly dataset. These variables are organised in different Climatic Impact Driver (CID) categories: heat and cold, wet and dry, drought, wind and radiation, snow and ice, ocean, and circulation.
Table 2: Description of the climate variables and indices included in the second version of the C3S Atlas (v2), with new additions and changes relative to version 1 (v1) highlighted in red. The first column indicates the Climatic Impact Drivers (CID) categories: heat and cold, wet and dry, drought, wind and radiation, snow and ice, ocean, and circulation. The “bias adjustment (BA)” column indicates the bias adjustment methods applied to each index, either simple linear scaling (“S”), the ISIMIP method (“I”), or both. A bias correction selection widget has been added to the CDS form, and in the files this is indicated with 'bals' and 'baisimip' being appended to the variable code. Primary variables are shown in bold, while derived indices appear in regular font (see Table 4 for details). The column "CDS Names" indicates the names used in the CDS catalogue of the C3S Atlas Dataset and further details on thresholds and temporal aggregation are provided in the "Description" column.
CID | # | Code | CDS Name | Description | Units | BA |
heat and cold | 1 | t | Monthly temperature | Monthly mean of daily mean temperature | °C | |
2 | tn | Monthly daily minimum temperature | Monthly mean of daily minimum temperature | °C | ||
3 | tx | Monthly daily maximum temperature | Monthly mean of daily maximum temperature | °C | ||
4 | dtr | Monthly daily temperature range | Monthly mean of daily temperature range | °C | ||
5 | txx | Monthly maximum of daily maximum temperature | Monthly maximum of daily maximum temperature | °C | ||
6 | tx35 | Monthly extreme hot days | Monthly count of extreme hot days (maximum temperature above 35 degC) | days | S,I | |
7 | tx40 | Monthly very extreme hot days | Monthly count of very extreme hot days (maximum temperature above 40 degC) | days | S,I | |
8 | tr | Monthly tropical nights | Monthly count of tropical nights (minimum temperature above 20 degC) | days | S,I | |
9 | cd | Annual cooling degree-days | Annual cooling degree-days, as defined in IPCC AR6 | °C day | S,I | |
10 | tnn | Monthly minimum of daily minimum temperature | Monthly minimum of daily minimum temperature | °C | ||
11 | fd | Monthly frost days | Monthly count of frost days (minimum temperature below 0 degC) | days | S,I | |
12 | hd | Annual heating degree-days | Annual heating degree-days, as defined in IPCC AR6 | °C day | S,I | |
wet and dry | 13 | r | Monthly precipitation | Monthly mean of daily accumulated precipitation | mm | |
14 | r01 | Monthly wet days | Monthly count of wet days (precipitation above 1 mm) | days | I | |
15 | sdii | Monthly precipitation intensity | Monthly mean of daily precipitation on wet days | mm | I | |
16 | rx1day | Monthly maximum 1-day precipitation | Monthly maximum of 1-day accumulated precipitation | mm | ||
17 | rx5day | Monthly maximum 5-day precipitation | Monthly maximum of 5-day accumulated precipitation | mm | ||
18 | r10 | Monthly heavy precipitation days | Monthly count of heavy precipitation days (above 10 mm) | days | I | |
19 | r20 | Monthly very heavy precipitation days | Monthly count of very heavy precipitation days (above 20 mm) | days | I | |
20 | huss | Monthly near surface specific humidity | Monthly mean near surface specific humidity | g kg-1 | ||
21 | evspsbl | Monthly evaporation | Monthly mean of daily accumulated evaporation | mm | ||
22 | mrro | Monthly runoff | Monthly mean of daily accumulated runoff | mm | ||
drought | 23 | mrsos | Monthly soil moisture in upper soil portion | Monthly mean soil shallow moisture content | kg m-2 | |
24 | cdd | Annual maximum consecutive dry days | Annual maximum consecutive dry days (below 1 mm) | days | I | |
25a | spi6 | Monthly Standardised Precipitation Index for 6 months cumulation period | Monthly Standardised Precipitation Index (SPI) for 6 months cumulation period | Dimensionless | ||
25b | spi6 | Monthly Standardised Precipitation Index for 6 months cumulation period normalised by the full period | Monthly Standardised Precipitation Index (SPI) for 6 months cumulation period (normalised by the full period) | Dimensionless | ||
26a | spei6 | Monthly Standardised Precipitation-Evapotranspiration Index for 6 months cumulation period | Monthly Standardised Precipitation-Evapotranspiration Index (SPEI) for 6 months cumulation period (Hargreaves method, 1985) | Dimensionless | ||
26b | spei6 | Monthly Standardised Precipitation-Evapotranspiration Index for 6 months cumulation period normalised by the full period | Monthly Standardised Precipitation-Evapotranspiration Index (SPEI) for 6 months cumulation period (Hargreaves method, 1985; normalised by the full period) | Dimensionless | ||
27 | pet | Monthly accumulated potential evapotranspiration | Monthly mean of daily accumulated potential evapotranspiration (Hargreaves method, 1985) | mm | ||
wind and radiation | 28 | sfcwind | Monthly wind speed | Monthly mean of daily mean wind speed | m s-1 | |
29 | clt | Monthly cloud cover percentage | Monthly mean cloud cover area percentage | % | ||
30 | rsds | Monthly surface solar radiation downwards | Monthly mean surface solar radiation downwards | W m-2 | ||
31 | rlds | Monthly surface thermal radiation downwards | Monthly mean surface thermal radiation downwards | W m-2 | ||
snow and ice | 32 | prsn | Monthly snowfall precipitation | Monthly mean of daily accumulated snowfall precipitation | mm | |
33 | siconc | Monthly sea-ice area percentage | Monthly mean sea-ice area percentage | % | ||
ocean | 34 | sst | Monthly sea surface temperature | Monthly mean sea surface temperature | °C | |
circulation | 35 | psl | Monthly sea level pressure | Monthly average air pressure at mean sea level | hPa |
Table 3 lists the variables from Table 2 that are computed across the different datasets referenced in Table 1. Note that bias-adjusted indices are calculated only for climate projection datasets (currently CMIP5/6, CORDEX-CORE and CORDEX-EUR-11) and not for reanalysis or observational datasets.
Table 3. Description of variables/indices (in rows) calculated for the different datasets (in columns). Data not available is indicated by ‘-’ and data not considered due to insufficient data (only available for a reduced subset) is indicated by (*). The table indicates the availability for the two versions of the SPI and SPEI indices (see Section 2.3 for details).
# | VAR | CMIP6 | CMIP5 | CORDEX- EUR-11 | CORDEX- CORE | ERA5 | ERA5- Land | CERRA | ORAS5 | E-OBS | BERK | CPC | SST |
1 | t | X | X | X | X | X | X | X | - | X | X | - | - |
2 | tn | X | X | X | X | X | X | X | - | X | X | - | - |
3 | tx | X | X | X | X | X | X | X | - | X | X | - | - |
4 | dtr | X | X | X | X | X | X | X | - | X | X | - | - |
5 | txx | X | X | X | X | X | X | X | - | X | X | - | - |
6a | tx35 | X | X | X | X | X | X | X | - | X | X | - | - |
6b | tx35bals | X | X | X | X | - | - | - | - | - | - | - | - |
6c | tx35baisimip | X | X | X | X | - | - | - | - | - | - | - | - |
7a | tx40 | X | X | X | X | X | X | X | - | X | X | - | - |
7b | tx40bals | X | X | X | X | - | - | - | - | - | - | - | - |
7c | tx40baisimip | X | X | X | X | - | - | - | - | - | - | - | - |
8a | tr | X | X | X | X | X | X | X | - | X | X | - | - |
8b | trbals | X | X | X | X | - | - | - | - | - | - | - | - |
8c | trbaisimip | X | X | X | X | - | - | - | - | - | - | - | - |
9a | cd | X | X | X | X | X | X | X | - | X | X | - | - |
9b | cdbals | X | X | X | X | - | - | - | - | - | - | - | - |
9c | cdbaisimip | X | X | X | X | - | - | - | - | - | - | - | - |
10 | tnn | X | X | X | X | X | X | X | - | X | X | - | - |
11a | fd | X | X | X | X | X | X | X | - | X | X | - | - |
11b | fdbals | X | X | X | X | - | - | - | - | - | - | - | - |
11c | fdbaisimip | X | X | X | X | - | - | - | - | - | - | - | - |
12a | hd | X | X | X | X | X | X | X | - | X | X | - | - |
12b | hdbals | X | X | X | X | - | - | - | - | - | - | - | - |
12c | hdbaisimip | X | X | X | X | - | - | - | - | - | - | - | - |
13 | r | X | X | X | X | X | X | X | - | X | - | X | - |
14a | r01 | X | X | X | X | X | X | X | - | X | - | X | - |
14b | r01baisimip | X | X | X | X | - | - | - | - | - | - | - | - |
15a | sdii | X | X | X | X | X | X | X | - | X | - | X | - |
15b | sdiibaisimip | X | X | X | X | - | - | - | - | - | - | - | - |
16 | rx1day | X | X | X | X | X | X | X | - | X | - | X | - |
17 | rx5day | X | X | X | X | X | X | X | - | X | - | X | - |
18a | r10 | X | X | X | X | X | X | X | - | X | - | X | - |
18b | r10baisimip | X | X | X | X | - | - | - | - | - | - | - | - |
19a | r20 | X | X | X | X | X | X | X | - | X | - | X | - |
19b | r20baisimip | X | X | X | X | - | - | - | - | - | - | - | - |
20 | huss | X | X | X | X | - | - | - | - | - | - | - | - |
21 | evspsbl | X | X | X | X | X | X | - | - | - | - | - | - |
22 | mrro | X | X | X | - | X | X | - | - | - | - | - | - |
23 | mrsos | X | X | - | - | X | X | - | - | - | - | - | - |
24a | cdd | X | X | X | X | X | X | X | - | X | - | X | - |
24b | cddbaisimip | X | X | X | X | - | - | - | - | - | - | - | - |
25a | spi6 | X | X | X | X | X | X | X | - | X | - | X | - |
25b | spi6fullperiod | X | X | X | X | - | - | - | - | - | - | - | - |
26a | spei6 | X | X | X | X | X | X | X | - | X | - | - | - |
26b | spei6fullperiod | X | X | X | X | - | - | - | - | - | - | - | - |
27 | pet | X | X | X | X | X | X | X | - | X | - | - | - |
28 | sfcwind | X | X | X | X | X | X | X | - | X | - | - | - |
29 | clt | X | X | X | (*) | X | - | X | - | - | - | - | - |
30 | rsds | X | X | X | X | X | X | X | - | X | - | - | - |
31 | rlds | X | X | X | X | X | X | X | - | - | - | - | - |
32 | prsn | X | X | - | - | X | X | - | - | - | - | - | - |
33 | siconc | X | X | - | - | X | - | - | X | - | - | - | - |
34 | sst | X | X | - | - | X | - | - | X | - | - | - | X |
35 | psl | X | X | X | (*) | X | - | X | - | X | - | - | - |
2.3. Index calculation
Most variables and indices included in the C3S Atlas Dataset, defined in Table 2, are derived from straightforward temporal aggregations or threshold-based computations. However, several more complex cases are described below.
Cooling degree-days (CD) and Heating degree-days (HD) have been implemented following the definition used in the IPCC AR6 Atlas (IPCC, 2021: Annex VI: Climatic Impact-driver and Extreme Indices).
The Standardised Precipitation Index (SPI-6) is a monthly index that compares 6-month accumulated precipitation against the long-term distribution for the same location and accumulation period, as the number of standard deviations from the median. Similarly, the Standardised Precipitation Evapotranspiration Index (SPEI-6) compares the 6-month accumulation of precipitation minus potential evapotranspiration (PET) to its long-term distribution. In version 2 of the dataset, the definition of PET (and consequently the SPEI index) has been updated to use the Hargreaves formulation (Hargreaves, 1994), replacing the Thornthwaite method used in v1. These indices were computed using using the Python-based xclim software. The updated PET component is provided alongside the SPEI index in the new version of the dataset.
Both SPI and SPEI involve standardisation using a specific reference period to compute long-term distributions. The standard reference period used is 1971-2010 for observations, reanalyses, and projections. In addition, the full 1971-2100 period is used for projections as an alternative reference period providing both versions in the dataset. In the C3S Atlas viewer the combination of the two normalisations are used (see more details in the C3S Atlas User Guide). Both approaches, using a historical reference periods and pooling historical and future scenario data to define a reference are well established practices in the literature to analyse future projections (e.g. Thota et al. 2025 and Spinoni et al. 2020, respectively). Therefore, the Atlas dataset provides both so they can be used for different purposes.
Full details of the processing workflow (including index calculation) are available in the GitHub C3S Atlas repository which includes the software and notebooks documenting the entire workflow, ensuring reproducibility and facilitating reuse.
2.4. Harmonisation of variables across catalogues
Table 4 details the specific CDS variables from the catalogues displayed in Table 1 that were used to construct the variables and indices presented in Table 2. If focuses only on the primary climate variables (excluding derived indices) which are highlighted in bold in Table 2. These raw variables originate from different catalogues, each with varying temporal resolutions, units and definitions8. The table also outlines the harmonisation and conversion processes applied to ensure consistent definitions and units across the dataset. The first column provides the variable code along with its standard name and units. The second column lists the CDS variable names and describes the harmonisation procedures applied across datasets to obtain standardised variables and units. Colour coding reflects the temporal aggregation (hourly, daily and monthly) of the catalogue from which each variable was sourced. N/C indicates variables that were "Not Considered" for some particular reason.
In addition to the name and unit standardisation described in Table 2, the harmonisation pre-processing also includes calendar conversion to a standard calendar9 and the use of a consistent coordinate system (longitude range from -180° to 180°). Furthermore, all variables and indices derived from the source datasets are interpolated onto common grids to produce the final harmonised dataset.
Table 4: Harmonisation work across the different catalogues carried out for the variables of the dataset. Colour codes indicate the temporal resolutions used for the different datasets available in the CDS catalogues, which correspond to the colours indicated in Table 1 (green for hourly, blue for daily and purple for monthly). Unit harmonisation (when needed) is indicated with "==>".
Code | Standard name | (units) | From raw input data to harmonized results (standard units) |
t near_surface_air_temperature | CMIP6: near_surface_air_temperature (K) ==> -273.15 (°C) |
tn daily_minimum_near_surface_air_temperature | CMIP6: daily_minimum_near_surface_air_temperature (K) ==> -273.15 (°C) |
tx daily_maximum_near_surface_air_temperature
| CMIP6: daily_maximum_near_surface_air_temperature (K) ==> -273.15 (°C) |
pr precipitation | CMIP6: precipitation (kg m-2 s-1) ==> *86400 (mm) |
huss near_surface_specific_humidity | CMIP6: near_surface_specific_humidity (1) |
evspsbl evaporation_inc_sublimation_and_transpiration | CMIP6: evaporation_including_sublimation_and_transpiration (kg m-2 s-1) ==>*86400 (mm) |
mrsos mass_content_of_water_in_soil | CMIP6: moisture_in_upper_portion_of_soil_column (kg m-2) |
mrro runoff_amount | CMIP6: total_runoff (kg m-2 s-1) ==> *86400 (mm) |
prsn snowfall_flux | CMIP6: snowfall_flux (kg m-2 s-1) ==> *86400 (mm) |
siconc sea_ice_area_percentage_on_ocean_grid | CMIP6: sea_ice_area_percentage_on_ocean_grid (%) |
sfcwind near_surface_wind_speed | CMIP6: near_surface_wind_speed (m s-1) |
clt total_cloud_cover_percentage | CMIP6: total_cloud_cover_percentage (%) |
rsds surface_downwelling_shortwave_radiation | CMIP6: surface_downwelling_shortwace_radiation (W m-2) |
rlds surface_downwelling_longwave_radiation | CMIP6: surface_downwelling_longwave_radiation (W m-2) |
sst sea_surface_temperature | CMIP6: sea_surface_temperature (K) ==> -273.15 (°C) |
psl sea_level_pressure | CMIP6: sea_level_pressure (Pa) => /100 (hPa) |
2.5. Available ensembles for climate projection products
The global and regional climate projection datasets listed in Table 1 are based on multi-model ensembles that include simulations from both historical and future scenarios (RCPs or SSPs). To compute climate change signals, simulations from both time periods are required. As a result, the Atlas dataset includes only those models that provide data for the historical period and at least one future scenario. Figures 1, 2 and 3 show the final ensembles used for each dataset (CMIP6, CMIP5, and CORDEX-EUR-11) organised by variable (shown in columns). Asterisks indicate variables used to compute derived indices (see Table 4), which share the same model availability. Note that the CORDEX CDS catalogue was previously used in the IPCC Atlas gridded monthly dataset to create regional datasets over the continental domains covered by CORDEX, at the common 0.44° resolution (not included in the current version). This ensemble is described in the documentation accompanying the corresponding IPCC AR6 Atlas entry in the CDS catalogue. The C3S Atlas Dataset includes the datasets with highest resolution available from CORDEX, in particular the 0.11º version of the European regional dataset (CORDEX-EUR-11) described here and a worldwide 0.22º mosaic produced with CORDEX-CORE simulations described in Section 2.5.
Figure 1: Final CMIP6 ensemble used in the C3S Atlas dataset, showing the available models (rows) across the different variables (columns). Colours represent the historical and future SSP scenarios available in the CDS.
Figure 3: Final CMIP5 ensemble used in the C3S Atlas dataset, showing the available models (rows) across the different variables (columns). Colours represent the historical and future RCP scenarios available in the CDS.
Figure 2: Final CORDEX-EUR-11 ensemble used in the C3S Atlas dataset, showing the available models (rows) across the different variables (columns). Colours represent the historical and future RCP scenarios available in the CDS.
Note that the CDS catalogue for CMIP6 includes only a single realisation per model. In contrast, for CMIP5, multiple realisations are included in some cases. Figure 3 displays the CMIP5 realizations used in the C3S Atlas Application, typically one per model to ensure consistency across all projection datasets. However, for models that provided multiple realisations used in CORDEX simulations, all such realisations have been included. The full C3S Atlas Dataset, as available in the CDS, includes all realisations for each model.
2.6. Mosaic approach for CORDEX-CORE
One of the main novelties of the dataset is related to the CORDEX-CORE project10. CORDEX-CORE is a highly ambitious initiative aimed at providing regional climate projections with global coverage (see Figure 4) and high resolution (0.25°x0.25°) by combining of 2 RCMs nested to 5 GCMs selected to span the widest range of uncertainty (see Table 6). The dataset includes, for the first time, the CORDEX-CORE simulations spatially blended using the mosaic approach described in Diez-Sierra et al. (2022). This dataset addresses two main issues related to the CORDEX-CORE simulations: (1) avoiding domain selection in overlapping areas by choosing the domain that best fits each one of the IPCC AR6 reference regions and (2) avoiding the multiple native projections of the different domains by providing the data in a regular global mesh. Due to its global coverage and higher resolution, this dataset constitutes the main source of information to analyze regional climate change globally, so it is a strategic dataset for the C3S Atlas.
The two RCMs used in CORDEX-CORE are REMO2015 and RegCM4; the three GCMs used are MOHC_HadGEM2-ES, MPI-M_MPI-ESM-MR (with the LR version used in some domains), and NCC_NorESM1-M (sorted in high to low climate sensitivity); note that MIROC-MIROC5 and NOAA-GFDL-GFDL-ESM2M are used as backups for high and low sensitivity, respectively. The list of simulations is given in Table 5, which describes the matrix of precise model versions used in the different domains. Figure 5 shows the final ensembles computed for the different variables (in columns) for the CORDEX-CORE dataset.
Figure 4: Spatial coverage of the CORDEX-CORE simulations (displaying mean temperature change for a 3º global warming relative to 1981-2010). Image from the C3S Atlas Application (https://atlas.climate.copernicus.eu)
Table 5: CORDEX simulations for the different domains used to build the CORDEX-CORE dataset (note that EUR-11 simulations are upscaled to the global 0.22 grid). Empty cells indicate that the default versions for the two RCMs (RegCM4_7 and REMO2015) are used. Different versions and/or backup GCMs are indicated in the table.
AFR-22 | AUS-22 | CAM-22 | EAS-22 | EUR-11 | NAM-22 | SAM -22 | SEA-22 | WAS-22 | |
MOHC_HadGEM2-ES_REMO | |||||||||
MOHC_HadGEM2-ES_RegCM | v4-4_v0 | v4-6_v1 | v4-4-rc8 | MIROC5 | |||||
MPI-M_MPI-ESM-LR_REMO | r3i1p1 | ||||||||
MPI-M_MPI-ESM-MR_RegCM | v4-4_v0 | MPI-LR | MPI-LRv4-4-rc8 | v4-7_v0 | |||||
NCC_NorESM1-M_REMO | |||||||||
NCC_NorESM1-M_RegCM | GFDL | v4-4_v0 | v4-6_v1 | GFDLv4-4-rc8 | v4-7_v0 |
Figure 5: Final CORDEX-CORE ensemble used in the C3S Atlas Dataset, showing the available models (rows) across the different variables (columns). Colours represent the historical and future RCP scenarios available in the CDS (see Table 5 for the specific configuration of the global mosaic).
2.7. Bias adjustment
Bias adjustment is a recommended processing step for threshold-related indices, as highlighted in the IPCC AR6 (Cross-Chapter Box 10.2: Relevance and Limitations of Bias Adjustment). Numerous methods have been proposed and compared in the literature, each with distinct advantages and limitations (e.g. Casanueva, 2022). For threshold-based temperature indices, simple linear scaling is often an effective choice; by adjusting the mean, it generally yields satisfactory results when compared to more complex approaches such as ISIMIP3. In the C3S Atlas Dataset, both raw and bias-adjusted versions of threshold-dependent indices are provided, following IPCC AR6 best practices. The ISIMIP3 method is applied for both temperature and precipitation indices, while the simple linear scaling method is also applied to temperature indices to highlight differences between basic and more advanced methods, and allowing users to explore this source of uncertainty.
All calculations are done using the ibicus package (see Table 4 for details) which implements a variety of bias adjustment methods (including linear scaling and ISIMIP3 among others).
Ibicus method | Function | Parameters | Description |
LinearScaling (LS) |
| "running_window_mode" : True, "running_window_length" : 30, "running_window_step_length" : 30 | Linear scaling method based on Maraun (2016). The reference period used for bias adjustment: 1980-2005 (CMIP5/6 using WFDE5) and 1970-2005 (CORDEX-EUR-11 using E-OBS and CORDEX-CORE using ERA5-Land). This method is applied to temperature-related indices (see Iturbide et al. 2022). Bias adjustment is applied to the primary variables (mean, minimum and maximum temperatures) before calculating the index. |
ISIMIP
| ISIMIP (temperature) | “running_window_mode”: False, | ISIMIP trend preserving method based on Lange (2022). The reference period used for bias adjustment: 1980-2005 (CMIP5/6 using WFDE5) and 1970-2005 (CORDEX-EUR-11 using E-OBS and CORDEX-CORE using ERA5-Land). This method is applied to both temperature- and precipitation-related indices. Bias adjustment is applied to the primary variables (mean, minimum and maximum temperatures and precipitation) before calculating the index. An intercomparison of this method with other standard techniques is provided in Casanueva et. al (2022). |
ISIMIP (precip) | "lower_bound": 0, |
Table 4. Functions and parameters of the Python Ibicus package (https://ibicus.readthedocs.io) used in the C3S Atlas for bias adjusted variables. Note that the ISIMIP method requires different parameters for bias adjusting temperature and precipitation.
Full details of the processing workflow (including bias adjustment) are available in the GitHub C3S Atlas repository which includes the software and notebooks documenting the entire workflow, ensuring reproducibility and facilitating reuse.
3. Technical information and archiving
3.1. File format and archiving
Files have been generated using the NetCDF-C library version 4.4.1.1 and HDF5 library version 1.10.1, using the NETCDF4 data model. The resulting files in NetCDF format follow the CF 1.9 metadata convention, which supports string-type NetCDF variables (used to define some attributes in the files, such as members). The attribute convention for data discovery is compliant with ACDD-1.3, including the Coordinate Reference System (CRS) and other descriptive elements.
Data is stored in separate files for each experiment, scenario, and variable/index, with all members included in the same file, identified by the member attribute, and covering the full time periods listed in Table 1. A compression level of 1 (DeflateLevel = 1) is used when creating NetCDF files to provide a good balance between compression and read/write performance.
This, as well as the new indices, resulted in 2.1 TB with a total of 1057 files for the present version (v2), as compared with the 1.3 TB and 412 files of the first version (v1).
Several technical issues were identified in the first version of the C3S Atlas Dataset and were corrected in the latest (present) version:
- Problems with _FillValue and missing_value attributes: These two attributes are used to identify gridboxes with no information and are mapped to NaN/missing-values by standard software frameworks. The latter is used to specify a code for NaN values (missing data), where the dataset does not provide a numeric value (e.g. regions not covered by the dataset, removed outliers, etc.). The former is used to identify the gridboxes that should be filled during the generation of the archive (either with a value or with a NaN) and is used to flag potential problems in the writing process. In some cases, datasets are generated using the same value for both attributes. In the C3S Atlas Dataset we intentionally use different values for the "_FillValue" and "missing_value" properties to ensure that all data has been written to the file, including values representing "missing_value" where data is missing and needs to be interpreted by the user's application (e.g. SST on land-points or data outside the CORDEX domain). This caused issues for the WPS subsetting system (which is based on Python and uses xarray), as xarray cannot handle multiple types of missing values when saving a file, although reading such files poses no problem. To resolve this, the WPS adopted a solution of merging the two missing value types and retaining a single 'missing_value' when saving files.
- The use of character coordinate variables in the C3S Atlas Dataset, specifically the character-based ensemble attribute used to describe the models in the ensemble, led to incompatibilities with certain versions of netCDF4, particularly regarding compression options. However, this issue has been addressed in recent versions of the libnetcdf library.
- Originally a high compression level (level 9) was used to achieve maximum data reduction. However, this significantly slowed down data access. After some discussion, an analysis was carried out to find a better balance between file size and performance, and it was found that a compression level of 1 offers a more optimal trade-off between compression and read/write speed. We adopted this change for the present version of the C3S Atlas Dataset (v2).
- CDO cannot directly process the C3S Atlas dataset due to issues with the ensemble attribute. Specifically, CDO has a legacy limitation requiring 'time' to be the first dimension, which requires pre-processing before applying CDO operators. Today, it is widely accepted that users must pre- and post-process data when working with CDO, rather than feeding it raw files. It is also important to note that CDO is designed to handle ensemble members distributed across separate files; placing “time” as the first dimension prevents the use of CDO functions tailored for ensembles.
3.2. Spatial subsetting
The CDS form (and API) allows users to spatially subset the data for download by selecting any latitude–longitude region. If the selected area exceeds the dataset’s spatial coverage, the full dataset will be returned. If the area overlaps partially, only the data points within the covered region will be provided. However, requests that do not intersect the dataset’s region at all will fail.
The BERKEARTH, CPC, SST-SCI, ERA5, ERA5-Land, CMIP5 and CMIP6 datasets cover the entire Globe, while E-OBS, CERRA and CORDEX-EUR-11 are for Europe (see Figure 7) and CORDEX-CORE is a mosaic covering mostly the continental parts of the World (see Figure 4).
Figure 7: Illustrations of the extent of the E-OBS (left), CORDEX-EUR-11 (center), and CERRA (right) European domains.
In the longer term, we intend to introduce temporal subsetting too easing the selection of the time period and allowing to choose shorter time periods.
4. Fit-for-purpose quality control
Some basic quality control procedures were implemented to check the consistency of the data produced for the different indices and variables across datasets. This section describes the tests implemented to analyze and fix issues in the dataset.
4.1. Tests implemented
A comprehensive quality control process was conducted on all variables and indices to identify potential issues in both the original data (e.g., units, coordinates) and throughout the index generation workflow. This process enabled the detection of metadata errors and data inconsistencies during data harmonisation, and fixing or excluding the involved simulations. The quality control involved two qualitative tests:
- Spatial maps: These were used to identify spatial misalignments relative to reference features, such as coastlines (see Figure 8). This analysis was particularly useful in spotting interpolation errors and coordinate issues in the raw data. Only a few problems were found in the original datasets. Notably, the IPSL-CM5A2-INCA_r1i1p1f1 simulation showed irregular native longitude and latitude coordinates for sst and siconc, and was therefore excluded from the dataset. In addition, mosaics of different ensemble members were used to assess ensemble dispersion. This analysis helped identify anomalies in the spread or spatial patterns of climate projections. Notably, the simulation EUR11_MPI-M_MPI-ESM-LR_r1i1p1_IPSL_WRF381P_v1 was found to be a clear outlier, showing physically inconsistent values (when compared with the rest of members) across most variables. As a result, it was excluded from the dataset.
Figure 8: Spatial map for near surface air temperature (t) for Euro-CORDEX.
- Climate Stripe Plots: Stripe plots were used to detect various issues, including simulations that deviate significantly from the ensemble range for particular years or contain extended periods of missing data. These plots are generated by aggregating variables annually over the entire domain—globally for CMIP and regionally for CORDEX. Figure 9 presents a stripe plot for near-surface air temperature (tas) from CMIP6 under the SSP5-8.5 scenario. Each row represents a different model in the ensemble (see Section 2.4), and each column corresponds to a year, showing spatially aggregated annual means. This analysis identified several models with prolonged data gaps that were excluded from the dataset. Specifically, CNRM-CM6-1_r1i1p1f2 (for variable sst under scenario ssp370) and IITM-ESM_r1i1p1f1 (for sst under all scenarios, and all variables under ssp370). Note that in many cases, the ensemble shows large uncertainty, with a large spread with low- and high-end models. The qualitative inspection of the stripe plots was used to detect outliers (models with values which are significantly lower/higher than the ensemble spread, indicating wrong units, or similar problems). Although the spread was high in some cases, the outliers were physically consistent (with the exception of EUR11_MPI-M_MPI-ESM-LR_r1i1p1_IPSL_WRF381P_v1, already mentioned) and, therefore, they were not removed from the dataset.
Figure 9: Example of climate stripe plot for CMIP6 showing annual mean values (in columns) for the different GCMs forming the ensemble (in rows). Note that different members exhibit different temperature magnitudes (i.e. systematic biases if they were compared to an observational dataset).
5. Known issues
No issues reported.
6. Versions of the C3S Atlas Dataset
The following table details the different versions of the C3S Atlas Dataset (and dates, in chronological order), with main changes and new elements. Note that in the CDS only the latest version (v2 at the moment) is available.
Date | Changes description |
---|---|
C3S Atlas Dataset version v2
| Second version of the C3S Atlas Dataset (v2) published. The major new elements of this version are:
See Table 5 for details. |
C3S Atlas Dataset version v1 | First version of the C3S Atlas Dataset (v1) published. |
The latest version (v2) introduced several modifications to some variables/datasets already existing in v1. They are summarised below in Table 5.
Table 5. Description of changes introduced in the latest version of the C3S Atlas (v2) for variables and datasets already existing in the first version (v1).
Var | Dataset | Exp. | Model | Change |
sst | CMIP6 | All | All | land mask included in the dataset |
siconc | CMIP6 | All | All | land mask included in the dataset |
sfcwind | CMIP6 | All | All | year 2015 refactorised. It was missing |
evpsbl | CORDEX-EUR-11 | rcp85 | ICHEC-EC-EARTH_r12i1p1_UHOH-WRF361H | Recalculated due to processing errors |
evpsbl | CORDEX-EUR-11 | rcp45 | CNRM-CERFACS-CNRM-CM5_r1i1p1_CLMcom-CCLM4-8-17 | Recalculated due to processing errors |
evpsbl | CORDEX-EUR-11 | rcp45 | ICHEC-EC-EARTH_r12i1p1_CLMcom-CCLM4-8-17 | Recalculated due to processing errors |
pr | CORDEX-CORE | rcp26 | NAM-22_NCC-NorESM1-M_r1i1p1_GERICS-REMO2015 | Recalculated to avoid missing patches |
pr | CORDEX-CORE | historical | MPI-M-MPI-ESM-LR_r1i1p1_NCAR-RegCM4 | Recalculated to avoid missing patches |
pr | CORDEX-CORE | rcp26 | EAS-22_NCC-NorESM1-M_r1i1p1_GERICS-REMO2015 | Recalculated to avoid missing patches |
pr | CORDEX-CORE | rcp85 | WAS22_MPI-M-MPI-ESM-LR_r1i1p1_GERICS-REMO2015 | Recalculated to avoid missing patches |
pr | CORDEX-CORE | rcp85 | MOHC-HadGEM2-ES_r1i1p1_ISU-RegCM4 | Recalculated to avoid missing patches |
tx | CORDEX-CORE | rcp26 | CAM22_NOAA-GFDL-GFDL-ESM2M_r1i1p1_ICTP-RegCM4-7 | Recalculated to include a missing year |
cd | ERA5-Land | - | None | correct the threshold. It was lower than 22ºC in v1 resulting in larger values. |
All | ERA5,ERA5-Land | - | None | extended until year 2024 |
tx35bals | CMIP6, CORDEX-EUR, CORDEX-CORE | All | All | Recalculated bals with parameters of v2 (reference period for the bias adjustment) |
psl | CORDEX-EUR-11, CMIP5, CMIP6,ERA5,E-OBS | All | All | unit change to hPA |
huss | CORDEX-EUR-11, CMIP5, CMIP6,CORDEX-CORE | All | All | unit change to g/kg |
spi6 | CORDEX-EUR-11, CMIP6,ERA5,E-OBS,CORDEX-CORE,ERA5-Land | All | All | new reference period, output period and parameters |
spei6 | CMIP6,ERA5,E-OBS | All | All | new reference period, output period and parameters |
7. Versions of the document
The table below outlines the changes made to this document, listed in chronological order. Document versions follow the Vn.m format, where n refers to the dataset version and m denotes the document version within that version cycle.
Date | Changes description |
---|---|
Version V2.0 | This document has been updated to indicate the new elements of the second version of the C3S Atlas Dataset (v2), indicated in red colour in the different tables of the document. |
Version V1.2 |
|
Version V1.1 |
|
Version V1.0 | First version of the document published together with the C3S Atlas launch (first version, v1), before the dataset is available in the CDS. This document serves as documentation for both the dataset and the C3S Atlas. |
This document has been produced in the context of the Copernicus Climate Change Service (C3S).
The activities leading to these results have been contracted by the European Centre for Medium-Range Weather Forecasts, operator of C3S on behalf of the European Union (Delegation Agreement signed on 11/11/2014 and Contribution Agreement signed on 22/07/2021). All information in this document is provided "as is" and no guarantee or warranty is given that the information is fit for any particular purpose.
The users thereof use the information at their sole risk and liability. For the avoidance of all doubt , the European Commission and the European Centre for Medium - Range Weather Forecasts have no liability in respect of this document, which is merely representing the author's view