You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 3 Next »

Table of Contents

Introduction

This dataset provides every day, hourly timeseries of air quality forecasts at European observation sites that have been optimised using a Model Ouput Statistic (MOS) method.

The CAMS MOS uses a machine learning algorithm to improve the Chemistry-Transport CAMS European air quality Ensemble forecasts for 4 pollutants (O3, NO2, PM10, PM2.5) at the observation sites. It is optimised on an automatic basis from predictive variables (predictors) over a learning period and delivers a 4-day forecast (0h to 96h). 

By performing an adjustment of the raw forecast of the regular chemistry-transport forecasting system, it belongs to the category of Model Ouput Statistic (MOS) products.

Description of the MOS method

Approaches

In the framework of a previous CAMS Service (CAMS_63), several machine learning postprocessing approaches (as MOS methods) have been experimented in order to take stock of the recent development of machine learning applications. 

Among the different configurations, several learning periods were tested as well as different set of predictors for each species and various performance indicators.

The selected MOS approach applies to the whole Europe meaning that a unique statistical model is built with CAMS Ensemble forecast, IFS meteorological and Observation data covering the whole modelling domain.  The advantage of this global modelling approach is that a very short time period is needed to gather enough data to train a robust model (good performances have been obtained with a short time period for training). To optimise the performances, a new model is built on a daily basis with the most recent available data. Any change in the modelling system (upgrade of a member of the Ensemble model, addition of new observation sites…) is thereby automatically and rapidly passed on a new MOS model producing appropriate correction.

In order to validate the definitive configuration in term of robustness, performance and computing time, assessments have been carried out and published in Bertrand et al., 2023: https://acp.copernicus.org/articles/23/5317/2023/

At this time, the learning period is defined at 3 days.

Predictors

The MOS is trained, over the 3 days learning period, with hourly air quality observations and modelling data (for both air quality and meteorological parameters) and predicts hourly concentrations.

The training is based on the relation between predictors and observations as the target element to define a statistical model. This statistical model is able to convert the same predictors into a concentration forecast, which is here our predictand.

Several sets of predictors have been investigated and results have shown that a limited set of predictors including the concentrations from CAMS Ensemble forecast, some predicted meteorological variables and recent observations provides good performances.

Criteria for used observations :

  • background observation sites, specific selection based on an objective classification (Categories 1 to 7 of the Joly and Peuch, 2012 classification, corresponding roughly to urban, suburban and rural background observation sites)
  • hourly observations
  • 75% availability rate of observations over the learning period

The MOS production takes place once a day from 6:30 UTC and produce forecasts for all observation sites available if the above criteria are met. Thus, the amount of observation sites varies following the species and the date.

Data used for MOS : 

European air quality Ensemble forecast variablesIFS Meteorological forecast parametersEEA Air quality observations
  • O3
  • NO2
  • PM10
  • PM2.5
  • Temperature at 2m
  • Relative humidity
  • Wind speed (zonal and meridional)
  • Boundary layer's height
  • measurements of the previous day

The MOS takes as input some model data but also observations of the previous day acknowledging the importance of the persistence in the forecast skill.

European air quality Ensemble forecast at the observation site location is taken as input for O3, NO2, PM10 and PM2.5 concentrations. 

IFS is used for all weather predictors due to the spatial coverage over Europe and high quality of its forecast.

The model data is gridded, thus a bilinear interpolation is performed to get the value at all observation sites.

Data access

Data is available for download from the CAMS Atmosphere Data Store (ADS). CAMS ADS registered users can access the available data interactively through the CAMS European air quality forecast optimised at observation sites ADS download web interface and/or programmatically using the API as per instructions detailed here.

Data availability (HH:MM)

The processing takes place at 6:30 UTC and the delivery is guaranteed by 8:00 UTC on the ADS.

Spatial resolution

Timeseries are provided at individual observation sites.

Temporal frequency

The MOS model runs once a day from 6:30 UTC.

Data are available with a time resolution of 1 hour and forecasts period from step 0h to step 96h.

Data format

Data are available in csv format with semi-comma separator. The files are split by date, countries and species. 

In the file, the observation sites are declared by their EIONET identifier.

An associated metadata file is available from the download form and gives information on the observation sites (coordinates, altitude, type of observation site as provided by the European Environment Agency, date_start, date_end).

Please also note that the location of some observation sites may change in time. As soon as an observation site displacement occurs, a new line appears in the metadata file with the new coordinates. To date these coordinate changes, date_start and date_end columns indicate the start and end dates for which MOS was produced at these specific coordinates.

Product listings

Please note that not all species are available at all observation sites for all the timesteps.

Variable NameNetCDF UnitsVariable name in ADSNote
Nitrogen dioxideµg m-3
nitrogen_dioxide
Data are available from 17-01-2024
Ozoneµg m-3
ozone
Data are available from 17-01-2024
Particulate matter < 10 µmµg m-3
particulate_matter_10um
Data are available from 17-01-2024
Particulate matter < 2.5 µmµg m-3
particulate_matter_2.5um
Data are available from 17-01-2024

Validation reports

MOS production evaluation will be made available at station level and aggregated by country through an interactive visualization platform 

Guidelines

  • Users can select either 'Raw' or 'MOS-optimiseddaily air quality forecasts at European observation sites. The raw forecasts are European air quality ensemble forecast interpolated to the observation site location. The MOS-optimised forecasts are produced from the raw forecasts using a statistical post-processing method called machine learning postprocessing as an Model Output Statistic (MOS) method. Both types are provided in the same format.
  • Missing values may be present in the MOS product for some species and/or hours, due to the lack of observations available at the observation site for that species/hours. Indeed last observations are needed to produce MOS because they are used as predictor.

How to acknowledge, cite and refer to the data

All users of data uploaded on the Atmosphere Data Store (ADS) must provide clear and visible attribution to the Copernicus programme and are asked to cite and reference the dataset provider.

(1) Acknowledge according to the licence to use Copernicus Products.

(2) Cite each dataset used:

  • METEO FRANCE, Institut national de l'environnement industriel et des risques (Ineris), Aarhus University, Norwegian Meteorological Institute (MET Norway), Jülich Institut für Energie- und Klimaforschung (IEK), Institute of Environmental Protection – National Research Institute (IEP-NRI), Koninklijk Nederlands Meteorologisch Instituut (KNMI), Nederlandse Organisatie voor toegepast-natuurwetenschappelijk onderzoek (TNO), Swedish Meteorological and Hydrological Institute (SMHI) and Finnish Meteorological Institute (FMI) (2024): CAMS European Air Quality Forecast Optimised at observation sites. Copernicus Atmosphere Monitoring Service (CAMS) Atmosphere Data Store (ADS).  (Accessed on <DD-MMM-YYYY>), https://ads.atmosphere.copernicus.eu/cdsapp#!/dataset/cams-europe-air-quality-forecasts-optimised-at-observation-sites?tab=overview

(3) Throughout the content of your publication, the dataset used is referred to as Author (YYYY) i.e.  METEO-FRANCE et. al (2024)

References


This document has been produced in the context of the Copernicus Atmosphere Monitoring Service (CAMS).

The activities leading to these results have been contracted by the European Centre for Medium-Range Weather Forecasts, operator of CAMS on behalf of the European Union (Delegation Agreement signed on 11/11/2014 and Contribution Agreement signed on 22/07/2021). All information in this document is provided "as is" and no guarantee or warranty is given that the information is fit for any particular purpose.

The users thereof use the information at their sole risk and liability. For the avoidance of all doubt , the European Commission and the European Centre for Medium - Range Weather Forecasts have no liability in respect of this document, which is merely representing the author's view.

  • No labels