Contributors: Paolo Sanò (CNR-ISAC), Giulia Panegrossi (CNR-ISAC), Leonardo Bagaglini (CNR-ISAC), Elsa Cattani (CNR-ISAC), Hannes Konrad (DWD), Thomas Sikorski  (DWD), Marc Schröder  (DWD)

Table of Contents

History of modifications

Version

Date

Description of modification

Chapters / Sections

1.0

31/03/2021

Initial version

1–6, References

1.1

07/09/2021

Clarification of PNPR-CLIM input data

2.1.2 – Table 10

List of datasets covered by this document

Deliverable ID

Product title

Product type (CDR, ICDR)

Version number

Delivery date

D3.3.3-v1.0

COBRA daily and monthly precipitation

CDR

1.0

2021/03/31

Related documents

Reference ID

Document

D1

Algorithm Theoretical Baseline Document HOAPS version 4.0, v2.3, 2017/12/31, CM SAF, https://www.cmsaf.eu/SharedDocs/Literatur/document/2017/saf_cm_dwd_atbd_hoaps4_2_3_pdf.pdf?__blob=publicationFile&v=3
DOI of corresponding dataset: 10.5676/EUM_SAF_CM/HOAPS/V002

D2

Product User Manual SSM/I and SSMIS data record products HOAPS v4.0, v1,1, 2017/01/31, CM SAF, https://www.cmsaf.eu/SharedDocs/Literatur/document/2017/saf_cm_dwd_pum_hoaps4_1_1_pdf.pdf?__blob=publicationFile&v=3
DOI of corresponding dataset: 10.5676/EUM_SAF_CM/HOAPS/V002

Acronyms

Acronym

Definition

1DH

One Degree Hourly – Spatio-temporal Grid Description

1DD

One Degree Daily – Spatio-temporal Grid Description

1DM

One Degree Monthly – Spatio-temporal Grid Description

2B-CMB

GPM's 2B-CMB L2 precipitation product

AMSR-E

Advanced Microwave Scanning Radiometer – Earth Observing System

AMSU

Advanced Microwave Sounding Unit

ANG

Scan angle

AO

Area of Overlap

ATMS

Advanced Technology Microwave Sounder

AVHRR

Advanced Very High Resolution Radiometer

BT

Brightness Temperature

CC

Correlation Coefficient

CDR

Climate Data Record

CM SAF

Satellite Application Facility on Climate Monitoring

COBRA

Copernicus Microwave-based Global Precipitation

CRM

Cloud Resolving Model

DMSP

Defense Meteorological Satellite Program

DPR

Dual-Frequency Precipitation Radar

ECMWF

European Centre for Medium-Range Weather Forecasts.

EFOV

Effective Field of View

EPSG

European Petroleum Survey Group

ERA5

ECMWF Reanalysis v5

ETOPO1

One Arc-Minute Global Relief Model

FCDR

Fundamental Climate Data Record

FIDUCEO

FIDelity and Uncertainty in Climate data records from Earth Observations

FOV

Field of View

FPG

Footprint Gridding

GMI

GPM Microwave Imager

GPM

Global Precipitation Measurement

GPM-CO

Global Precipitation Measurement's Core Observatory satellite

FL

Freezing Level

HIRS

High-resolution Infra-Red Sounder

HOAPS

Hamburg Ocean Atmosphere Parameters and Fluxes from Satellite Data

H SAF

Satellite Application Facility on Support to Operational Hydrology and Water Management

IFOV

Instantaneous Field of View

KaPR

Ka-band Precipitation Radar

KuPR

Ku-band Precipitation Radar

MetOp

Meteorological operational satellite

MetopA/B

Meteorological operational satellite – A/B

MHS

Microwave Humidity Sounder

MRMS

Multi-Radar-Multi-Sensor

MW

Microwave

NN

Neural Network

NOAA

National Oceanic and Atmospheric Administration

NWP SAF

Satellite Application Facility for Numerical Weather Prediction

OLC

Ocean-Land-Coast mask

PDF

Probability Density Function

PNPR

Passive microwave Neural network Precipitation Retrieval

PNPR-CLIM

PNPR adapted for climatological applications

PRE

Precipitation Rate Estimation

RMSE

Root Mean Square Error

PNC

Precipitation/No-precipitation Classification

RTE

Radiative Transfer Equation

SD

Snow depth

SIF

Sea-ice fraction

SSM/I

Special Sensor Microwave/Imager

SSMIS

Special Sensor Microwave Imager/Sounder

SSM/T

Special Sensor Microwave Temperature Sounder

SSM/T-2

Special Sensor Microwave Humidity Sounder

T2m

2-meter Temperature

TMI

TRMM Microwave Imager

TPWV

Total Precipitable Water Vapor

TRMM

Tropical Rainfall Measuring Mission

VD

Validation dataset

Scope of the document

This document is the Algorithm Theoretical Basis Document (ATBD) for the Copernicus micrOwave-based gloBal pRecipitAtion (COBRA) product. Provided are gridded (level 3) daily and monthly estimates of precipitation rates, derived by combining precipitation information from two sources:

  1. the dedicated newly developed Passive Microwave Neural network Precipitation Retrieval for Climate Applications (PNPR-CLIM) exploiting the cross-track scanning MW sounders;
  2. the Hamburg Ocean Atmosphere Parameters and Fluxes from Satellite (HOAPS), developed in the frame of the Satellite Application Facility on Climate Monitoring (CM SAF) and using the observations of conically scanning MW imagers.

The first part of the ATBD is related to the PNPR-CLIM, version 1.3, designed and developed by the National Research Council of Italy – Institute of Atmospheric Sciences and Climate (CNR-ISAC) under the Copernicus Climate Change Service. The document describes the scientific foundation, the applied approach, and the design and development procedure.

The HOAPS algorithm is described in its dedicated ATBD [D1] and only summarized here.

The document includes the description of gridding, post-processing and merging procedures for combining observations obtained through HOAPS and PNPR-CLIM into the final COBRA product.


Executive summary

Precipitation rate estimates obtained through HOAPS and PNPR-CLIM are combined to form the Copernicus micrOwave-based gloBal pRecipitAtion (COBRA) dataset.

PNPR-CLIM retrieves the instantaneous precipitation rate (mm/h, level 2 product) from the FIDUCEO Fundamental Climate Data Record (FCDR) of brightness temperatures measured by the Advanced Microwave Sounding Unit-B (AMSU-B) and Microwave Humidity Sounder (MHS) cross-track scanning radiometers, exploiting an Artificial Neural Network (NN) approach and a global observational training dataset. An indication of the reliability of the surface precipitation rate estimates is also provided through two pixel-based quality indexes.

The HOAPS algorithm, operating on the CM SAF FCDR of brightness temperatures observed by conically scanning Microwave Imager instruments, also exploits an NN approach. Retrieved precipitation rates are available only over the ice-free ocean.

Observations from the two instrument classes are brought onto an hourly 1° × 1° grid. Respective averaging is carried out by weighting single measurements with the overlap area between the corresponding observational footprint and the grid cell. Post-processing steps at this stage are i) a bias correction (quantile mapping) depending on latitude, month of the year, and surface type (ocean vs. land), ii) filtering of unstable observations by AMSU-B, and iii) filtering of unphysical observations by the Advanced Microwave Scanning Radiometer – Earth Observing System (AMSR-E) for HOAPS, probably induced by a mismatch in the HOAPS sea-ice mask in the case of the higher resolved AMSR E observations.

Daily gridded estimates of precipitation are obtained by first averaging the instantaneous precipitation rates from all available platforms for each hour and spatial grid cell. Gaps are filled by nearest-neighbour interpolation. The hourly data are then accumulated to daily precipitation and converted to daily precipitation rate (mm/d). Among others, the mean hourly standard deviations, mean PNPR-CLIM quality flag, and number of underlying observations are provided as a measure of spatiotemporal variability and indicators of quality, respectively.

Monthly gridded estimates of precipitation are obtained by first averaging over all instantaneous precipitation rate estimates stored in the hourly gridded files for each available platform. Again, these are weighted by the overlap area between the corresponding observational footprint and the grid cell. Finally, the monthly gridded precipitation estimates are averaged over all available platforms by weighting the monthly contribution from each platform by the availability of the respective platform in each month. As in the daily data set, the same indicators of spatiotemporal variability and quality are provided.

1 Instruments

1.1 Instruments used within the PNPR-CLIM algorithm

The PNPR-CLIM algorithm is based on the AMSU-B (on board NOAA-15, NOAA-16, NOAA-17 satellites) and MHS (on board NOAA-18, NOAA-19, and MetOp satellites) microwave sounders. The local equator crossing time of the descending/ascending node for all NOAA satellites lies between 5:00–9:15 a.m./p.m., while for the MetOp satellites it lies between 8:45–9:30 a.m/p.m.. These cross-track scanning radiometers provide measurements at 90 steps of constant 1.1° angular sampling across track, which implies that the IFOV elongates as the beam moves from nadir toward the edge of the scan.

The IFOV resolutions/shapes depend on the radiometer, viewing angle, and height of the satellite, where shape is expressed in terms of cross-track (CT) and down-track (DT) elliptic dimensions. For the AMSU-B and MHS radiometers the satellite height is about 800 km, the nadir and scan edge IFOV resolutions/shapes are 15.88-CT × 15.88-DT km2 (mild-ovate) and 52.83-CT × 27.10-DT km2 (extreme-ovate).

The sampling distance also varies with the scan angle and corresponds to the sampling geometry of AMSU-B/MHS (1.1 degrees), which corresponds to 16 km at nadir. Table 1 presents some AMSU-B and MHS radiometer characteristics.

Table 1: AMSU-B and MHS radiometers' characteristics. QV = Quasi-vertical; polarization vector is parallel to the scan plane at nadir; QH = Quasi-horizontal; polarization vector is perpendicular to the scan plane at nadir.

AMSU-B

Advanced Microwave Sounding Unit-B

MHS

Microwave Humidity Sounder Unit

Satellites

AMSU-B: NOAA-15, NOAA-16, NOAA-17MHS: NOAA-18, NOAA-19, MetOp-A, MetOp-B

Mission

Humidity sounding in almost all-weather conditions and precipitation rate

Instrument type

Absorption-band MW radiometer/spectrometer - 5-channels

Scanning technique

Cross-track: 90 steps of 16 km at sub satellite point, swath 2180 km (2250 km) for MHS (AMSU-B) Along-track: one 16-km line every 8/3 s

Coverage/cycle

Near-global coverage twice/day

Resources

Mass: 63 kg – Power: 93 W – Data rate: 3.9 kbps

MHS/AMSU-B
Central frequency (GHz)

MHS/AMSU-B channel bandwidth (MHz)

MHS/AMSU-B channel polarisation

MHS/AMSU-B channel radiometric accuracy (NEΔT)

89.0

2800/1000

QV/QV

0.22/0.37 K

157.0/150.0

2800/1000

QV/QV

0.38/0.84 K

183.31 ± 3.0

2000/2000

QH/QV

0.42/0.60 K

183.31 ± 1.0

1000/500

QH/QV

0.57/0.70 K

190.311/183.31 ± 7

2000/1000

QV/QV

0.45/1.06 K


Other datasets used to develop and test the NN algorithm are:

  • ERA5 atmospheric variables, used as additional ancillary input data;
  • GPM 2B-CMB precipitation fields, used as reference precipitation data for training and testing.

The detailed description of them and their role in the algorithm development can be found in section 3.2.

1.2 Instruments used in the HOAPS-v4 algorithm

1.2.1 The SSM/I Instrument

The SSM/I is a microwave radiometer, which measures micro wave emission and scattering at 7 channels with four centre frequencies, 19.35, 22.235, 37.0 and 85.5 GHz. A summary of frequency and channel characteristics is shown in table 2.

The SSM/I instrument has been used within the DMSP aboard F-8, F-10, F-11, F-13, F-14 and F-15 spacecrafts. Apart from F-8, local equator crossing time of the descending/ascending node lies for all satellites between 5–10 a.m./p.m. In case of F-8, descending and ascending node were reversed. Thus, local equator crossing time was about 6 a.m. for the ascending node of F-8.

The satellites fly on a near-polar, sun-synchronous, circular orbit, whose period is about 102 minutes. This results in about 14.1 orbits per day and uncovered regions poleward of 87.5°.

Table 3 gives a short summary of further instrument characteristics. A full description of the SSM/I instrument is given by Hollinger (1987, 1990) and Wentz (1991).

Table 2: Summary of channel characteristics of SSM/I instruments as described by Wentz (1991)

Centre Frequency [GHz]

Polarization

Pixel Integration Time [ms]

Samples per scan

Inter-Scan Period [s]

Along-track Sample Spacing [km]

3 dB Footprint Size
(Along-track
× Cross-track)
[km]

19.35

v/h

7.95

64

3.8

25

69 × 43

22.235

v

7.95

64

3.8

25

50 × 40

37.0

v

7.95

64

3.8

25

37 × 28

37.0

h

7.95

64

3.8

25

37 × 29

85.5

v/h

3.89

128

1.9

12.5

15 × 13

Table 3: Summary of spacecraft's and SSM/I instrument's properties

Altitude

860 km

Inclination

98.8 °

Orbit Period

102 min

Swath Width

1394 km

Effective Scan Angle

102.4 °

Local Zenith Angle

53.1 °

Calibration Method

On board; each scan; fixed cold space reflector and reference black body hot load

1.2.2 The SSMIS Instrument

The SSMIS is a 24-channel microwave radiometer using multiple frequencies and therefore is able to replace different former instruments namely SSM/I, Special Sensor Microwave Temperature (SSM/T), and Special Sensor Microwave Humidity Sounder (SSM/T-2). For precipitation retrieval the channels 12–18, with centre frequencies at 19.35, 22.235, 37.0 and 91.655 GHz, are of main interest here. Thus, SSMIS continues the measurements of the SSM/I instrument with a shift of SSM/I highest frequency channel from 85.5 to 91.655 GHz. Further channel characteristics are given in table 4.

Its first usage was aboard the F-16 spacecraft of the DMSP in October 2003. Further missions followed by F-17 (2006), F-18 (2009) and F-19 (2014). Like earlier satellites of the DMSP (see section 1.2.1) these ones are on a near-polar, sun-synchronous, circular orbits with a period of 101.8 minutes. Their local equator crossing occurred during 2–11 a.m./p.m.

Table 5 summarizes further instrument characteristics. F-19 was out of control since 2016, thus its data might be of lower quality [D1].

A full description of the SSMIS instrument is given by Northrop Grumman Corporation (2002) and Kunkee (2008).

Table 4: Summary of SSMIS channel characteristics exploited in the precipitation retrieval

Channel

Centre Frequency [GHz]

Polarization

Pixel Integration Time [ms]

Samples per scan

Inter-Scan Period [s]

Along-track Sample Spacing [km]

3 dB Footprint Size
(Along-track
× Cross-track)
[km]

12

19.35

h

8.44

90

3.8

25

73.6 × 46.5

13

19.35

v

8.44

90

3.8

25

73.6 × 46.5

14

22.235

v

8.44

90

3.8

25

73.6 × 46.5

15

37.0

v

8.44

90

3.8

25

45.0 × 31.2

16

37.0

h

8.44

90

3.8

25

45.0 × 31.2

17

91.655

v

4.22

180

1.9

12.5

15.5 × 13.2

18

91.655

h

4.22

180

1.9

12.5

15.5 × 13.2

Table 5: Summary of spacecraft's and instrument's properties for SSMIS

Altitude

833 km

Inclination

98.9 °

Orbit Period

101.8 min

Swath Width

1707 km

Effective Scan Angle

143.2 °

Local Zenith Angle

53.1 °

Calibration Method

On board; each scan; fixed cold space reflector and reference black body warm target


1.2.3 The AMSR-E Instrument

The AMSR-E is a passive microwave radiometer with twelve channels at six frequencies. Its centre frequencies are located at 6.925, 10.65, 18.7, 23.8, 36.5, and 89.0 GHz. Each frequency band operates in dual-polarization, i.e., channels of horizontal and vertical polarization are measured separately. AMSR-E is a conical scanning instrument. Sensor characteristics are summarised in table 6.

The instrument is mounted on the Aqua spacecraft, which was launched in May 2002. Aqua's orbits are near-polar and sun-synchronous with an eccentricity of 0.0015. Equator overpasses of Aqua appear near 1.30 a.m./p.m. local time for the ascending/descending node. Table 7 presents additional information about the Aqua spacecraft and the AMSR-E instrument.

Table 6: Summary of channel characteristics for the AMSR-E instrument

Centre Frequency [GHz]

Bandwidth
[MHz]

Polarization

Pixel Integration Time [ms]

Scan cycle
[s]

Sampling Interval
[km]

IFOV
[km]

6.925

350

h/v

2.6

1.5

10 × 10

74 × 43

10.65

100

h/v

2.6

1.5

10 × 10

51 × 30

18.7

200

h/v

2.6

1.5

10 × 10

27 × 16

23.8

400

h/v

2.6

1.5

10 × 10

31 × 18

36.5

1000

h/v

2.6

1.5

10 × 10

14 × 8

89.0

3000

h/v

1.3

1.5

5 × 5

6 × 4

Table 7: Summary of spacecraft's and instrument's properties for AMSR-E

Altitude

705 km

Inclination

98.2 °

Orbit Period

98.8 min

Swath Width

1445 km

Effective Scan Angle

122 °

Earth Incidence Angle

55 °

Calibration Method

On board; each scan; fixed cold space reflector and reference black body warm target

The IFOVs of AMSR-E and SSM/I / SSMIS were harmonized by averaging the brightness temperatures of each three neighbouring AMSR-E scan positions.

1.2.4 The TMI Instrument

TMI is a passive microwave radiometer based on the characteristics of the earlier instrument SSM/I (see section 1.2.1). Compared to SSM/I, the TMI frequencies are expanded with an additional channel at 10.65 GHz. Moreover, the SSM/I channel at 22.235 GHz has been replaced with a new channel centred at 21.3 GHz. Table 8 summarizes more information on TMI channels.

The instrument was onboard the TRMM satellite, which operated from 1997 until April 2015. TRMM had a circular, non-sun-synchronous orbit with an inclination of 35 degrees to the Equator. Additional details about the spacecraft and instrument are shown in table 9 (Kummerow et al., 1998).

Table 8: Summary of channel characteristics for the TMI instrument

Channel

Centre Frequency [GHz]

Polarization

Pixel Integration Time [ms]

EFOVs per scan

EFOV
(Along-track
× Cross-track)
[km]

Inter-Scan Period [s]

IFOV
(Along-track
× Cross-track)
[km]

1

10.65

v

6.6

104

63.2 × 9.1

6.6

59.0 × 35.7

2

10.65

h

6.6

104

63.2 × 9.1

6.6

60.1 × 36.4

3

19.35

v

6.6

104

30.4 × 9.1

6.6

30.5 × 18.4

4

19.35

h

6.6

104

30.4 × 9.1

6.6

30.1 × 18.2

5

21.3

v

6.6

104

22.6 × 9.1

6.6

27.2 × 16.5

6

37.0

v

6.6

104

16.0 × 9.1

6.6

16.0 × 9.7

7

37.0

h

6.6

104

16.0 × 9.1

6.6

16.0 × 9.7

8

85.5

v

3.3

108

7.2 × 4.6

3.3

6.7 × 4.1

9

85.5

h

3.3

108

7.2 × 4.6

3.3

6.9 × 4.2

Table 9: Summary of spacecraft's and instrument's properties for TMI

Altitude

402 km

Inclination

35 °

Swath Width

758.5 km

Effective Scan Angle

130 °

Earth Incidence Angle

 52.8 °

Calibration Method

On board; each scan; fixed cold space reflector and reference black body warm target

2 Input and Auxiliary Data

2.1 PNPR-CLIM

2.1.1 Microwave Brightness Temperatures FCDR

Carefully calibrated and homogenised radiance datasets are a fundamental prerequisite for climate studies, climate monitoring and reanalysis. Climate research requires long-period, consistent and uncertainty-quantified data records that the available operational datasets do not provide for several reasons. There are biases between instruments and biases in measurements from the same instrument for different time periods/regions, due to temporary instrument failures. Moreover, the measurement noise may vary during instrument lifetime and calibration procedures also contribute to the overall uncertainty (Hans et al., 2017, 2018, 2019; Brogniez et al., 2016; Merchant et al., 2017; Burgdorf et al., 2018).

The FIDelity and Uncertainty in Climate data records from Earth Observations (FIDUCEO1) project, created to address these issues, delivered various climate datasets from Earth Observation Satellites, which have received rigorous harmonization treatments between various datasets with a specific analysis of the relative uncertainties (Merchant et al., 2019). The datasets include FCDRs containing harmonised radiances and Climate Data Records (CDRs).

FCDRs consist of continuous, harmonised records of calibrated, geolocated, uncertainty-quantified sensor observations in geophysical units (such as radiance), together with all ancillary and underlying data used to calibrate observations and estimate uncertainty.

FCDRs have been produced for the AVHRR, HIRS and microwave (MW) sensor series which have flown aboard the NOAA and MetOp satellites (the released data record contains all mission years of SSM/T-2 on F11, F12, F14, F15, AMSU-B on NOAA15, NOAA16 and NOAA17 and MHS missions (NOAA18, NOAA19, MetOpA/B), i.e. a data record long enough to generate CDRs for climate research.

Each of these FCDR files contains calibrated brightness temperatures, its uncertainties categorised by their origin from independent, structured and common effects and concise quality flags conveying helpful information on the usability of the data. The archived FCDRs, which can support numerous CDRs, have a length relevant for climate studies, a period of more than twenty years (1994–2017), and a global scale.

For the development of PNPR-CLIM and for the release of the COBRA precipitation CDR the FIDUCEO FCDR of Microwave Brightness Temperatures (BTs) with uncertainties v4.1 for AMSU-B and MHS (from 2000 – 2017) has been used.


2.1.2 ECMWF ERA5 auxiliary input variables

Table 10 presents some model-derived variables used by the algorithm in addition to the input BTs. The selected variables were obtained from the ECMWF ERA5 monthly and daily mean product at 0.25° × 0.25° resolution.

Table 10: Additional algorithm input variables. Sea ice information and snow cover information are Boolean masks derived from daily sea ice cover and snow depth variables, respectively.

Variable

Data source

Sea ice information (daily)

ECMWF ERA5

Snow cover information (daily)

ECMWF ERA5

2 m temperature (monthly)

ECMWF ERA5

Freezing level (monthly)

ECMWF ERA5

Sea ice cover (monthly)

ECMWF ERA5

Snow depth (monthly)

ECMWF ERA5

Total column integrated water vapor (monthly)

ECMWF ERA5

2.2 HOAPS-v4

The HOAPS v4 Level-2 data have been generated on the basis of the CM SAF SSM/I / SSMIS FCDR (Fennig et al., 2017; Fennig et al., 2020; CM SAF’s HOAPS v4.0 ATBD [D1], section 2.2). The initial dataset was extended to cover the full period until the end of 2017 for the present merged CDR. It is assumed that the inter-calibration coefficients remain applicable over the period 2015–2017.

The precipitation rates were originally extended to the microwave imager instruments AMSR-E and TMI in the scope of the German Federal Ministry for Education and Research’s project on decadal climate prediction. These precipitation rates have been included here, too.

The HOAPS v4.0 precipitation retrieval was trained with precipitation rates retrieved from assimilated brightness temperatures in a 1D-Var scheme from ECMWF. The training data set is based on radiative transfer calculations as described in Bauer et al. (2006a, b). The data set contains one month (August 2004) of assimilated SSM/I brightness temperatures and the corresponding ECMWF 1D-Var retrieved precipitation values of the ECMWF model. The NOAA 0.25° daily Optimum Interpolation Sea Surface Temperature (OISST) (Reynolds et al., 2007) has been used as input for the processing of HOAPS v4.0.

HOAPS products are only available over ice-free ocean. While the sea ice is identified in the observed brightness temperatures internally in the algorithm, the global land masses are filtered out using an auxiliary land/ocean mask.

2.3 Gridding, post-processing and merging

The Level 2 files, which are the output of the PNPR-CLIM and HOAPS algorithms (see Appendix), form the input to the gridding, post-processing, and merging procedures.

3 Algorithms

3.1 The processing chain

Figure 1 provides an overview of the processing chain with all relevant algorithms and respective in- and output data for the generation of COBRA. First, instantaneous precipitation rate estimates are derived from brightness temperatures and respective auxiliary data through the PNPR-CLIM (section 3.2) and HOAPS (section 3.3) algorithms (depending on the data source). These are then gridded to a 1° x 1° hourly global grid (FPG algorithm, section 3.4.1) and post-processed (bias correction and filtering for low-quality periods for certain platforms, section 3.4.2). In a last step, the hourly gridded intermediate results from various platforms are merged and agglomerated to daily (section 3.4.3) and monthly (section 3.4.4) fields.

Figure 1: Schematic of the processing chain for the generation of COBRA data products. The central panels with dark grey background indicate the input/output data at various levels. The panels to the sides with light grey background indicate the actual processing steps. The colors of processing arrows and respective algorithm box match. The sections in this document, where the respective algorithms or data are explained in detail, are named where applicable.

3.2 PNPR-CLIM

3.2.1 Theoretical Basis

Artificial neural networks (NNs) represent a highly flexible ensemble of non-linear and non-parametric regression and classification statistical models, increasingly applied in environmental sciences for their capability to approximate complex non-linear and imperfectly known functions to an arbitrary degree of accuracy (e.g., Liou et al., 1999; Aires et al., 2001; Blackwell and Chen, 2005). The opportunities offered by their ability to learn and generalize, as well as their robustness to noise, have encouraged their use in precipitation estimation retrieval from satellite and ground-based measurements. NN techniques have proven to be effective in this research area and have been successfully used in many rainfall estimation and monitoring applications (e.g., Hong et al., 2004; Surussavadee and Staelin, 2008; Mahesh et al., 2011; Tapiador et al., 2017).

A detailed description of the neural network design process analogous to what used for PNPR-CLIM can be found in Sanò et al. (2015, 2016). Here only a short summary is given for the sake of clarity.

A NN requires a large sample of observational data, wide enough to be representative of the true population, comprehensive of both predictors (e.g., BTs) and predictands (e.g., precipitation rates). During the training phase the network learns the intrinsic correlations among the observed and the hidden variables, by adjusting its inner parameters to increase the prediction accuracy. It consists of a sequence of layers connected through compositions of (parametric) affine transformations with certain (fixed) non-linear transfer functions, mapping a multi-dimensional vector space into another, whose components are called neurons (also perceptrons). An illustrative scheme of a NN is shown in figure 2, with the input layer receiving the input signals, the hidden layer(s) and the output layer, providing the network response.


Figure 2: Schematic diagram of a multilayer neural network with two hidden layers (from Sanò et al., 2015).

The number of hidden layers, and the number of neurons in each layer, are determined during the design of the network. Each node has its own transfer function and receives, as input, a weighted sum of the outputs of the previous layer. The output of the transfer function corresponds to the output of each node. For example, the output yk of the k-th node of the first hidden layer takes the form:

\[ y_{k}(\omega,x) = \psi_{1} \left( \sum_{j=1}^{n_{1}}\omega_{kj}^1 \psi_{0} \left( \sum_{i=1}^{n_{0}} \omega_{ji}^0 x_{i} + b_{j}^0 \right) + b_{k}^1 \right) \qquad (3.1) \]

where, for each i, j, \( \omega^0_{ij} \) is the weight connecting the i-th feature xi of the input (of n0 dimensions) to the j-th node of the input layer, \( \omega^1_{kj} \) is the weight connecting the j-th node of the input layer to the k-th node of the first hidden layer (of n1 dimensions), \( \psi_0 \) and \( \psi_1 \) are the transfer functions of the input layer and the first hidden layer respectively, and, finally, \( b^0_j \) and \( b^1_k \) are the biases of the corresponding nodes.

The estimation of the weights of each neuron-neuron connection is performed in the NN training phase, during which a training database is used, providing the network with the inputs (e.g., BTs) and the expected output (e.g., rainfall rate), and the value of each weight is modified to reduce the error between the network and the expected outputs. The training continues in order to minimize the error.

In the back propagation model of the network the input signal propagates forward from the input layer to the output layer. The output layer produces an output o, which is compared to the target t defined in the training set. An error value is calculated as

\[ E = \frac{1}{M} \sum_{m=1}^M (o_{m} - t_{m})^2 \qquad\qquad\qquad\qquad\qquad\qquad\qquad \text{for continuous targets,} \qquad (3.2) \]

\[ E = \frac{1}{M} \sum_{m=1}^M (t_{m} \cdot ln(o_{m} + (1-t_{m}) \cdot ln(1-o_{m})) \qquad \text{for binary targets,} \qquad (3.3) \]

where M is the number of elements of the training set. The network corrects its weights to lessen the errors through an iterative process aimed at the minimization of the error. At the end of the training, the final values of the weights connecting the neurons of the different layers, store the knowledge of the NN (McCann, 1992). The design of the network architecture is normally quite complex. The model selection in NN aims at finding as few hidden units and neuron-neuron connections as necessary for a good approximation of the true function.

3.2.2 NN Training

The approach based on NNs requires a "training phase", that uses a large sample of data representative of the input and output variables of the retrieval process (in this case the BTs with ancillary parameters and the surface precipitation rate, respectively). The performance of the NN is largely dependent on the completeness and representativeness of the database and on its consistency with the actual observations.

In this activity, an observational database was used to train the NN. It is worth noting that a database derived from observations has various advantages compared to those provided by simulations, for example coupling a cloud resolving model (CRM) with a radiative transfer equation (RTE) model (e.g., Casella et al., 2017; Sanò et al., 2013, 2015, 2016). This CRM-RTE coupling approach has been the only possible option to build a large and global Cloud-Radiation Database before the advent of spaceborne radars (e.g., Panegrossi et al., 1998, Kummerow et al., 2011). There are, however, some limitations associated with the use of CRMs, such as uncertainties in surface property characterization (e.g., surface emissivity), single scattering properties of ice or mixed phase hydrometeors, cloud microphysics parameterizations (particle size distributions, bulk densities, conversion processes), vertical and horizontal distribution of solid and liquid hydrometeors (Kuo et al., 2016; Liu et al., 2013; Skofronik-Jackson et al., 2011; Kummerow et al., 2011; Panegrossi et al., 1998; Grecu et al., 2006).

Since the launch of the Global Precipitation Measurement mission (GPM) on February 28, 2014, quasi-global high quality spaceborne radar precipitation measurements have become available. The Dual-frequency Precipitation Radar (DPR) onboard the GPM Core Observatory (GPM-CO) covers the area between 67 °N and 67 °S of the globe. The high quality of the precipitation measurements is supported by several validation (Schwaller et al., 2011, Kim et al., 2014, Speirs et al., 2017) and field campaigns (Lee et al., 2019; Houze et al., 2017; Tao et al., 2016). For the development of the PNPR-CLIM algorithm an observational dataset, built from coincident, in space and time, DPR precipitation measurements with the MHS radiometer measurements (BTs), has been used in the NN design phase.

3.2.2.1 The Dual-frequency Precipitation Radar

The GPM-CO DPR is the second space-borne precipitation radar, following the Precipitation Radar launched on the TRMM satellite in November, 1997. The DPR consists of a Ku-band (13.6 GHz) and a Ka-band (35.5 GHz) radars. These Earth-pointing KuPR and KaPR instruments provide 3D precipitation measurements over all surfaces between 67 °N and 67 °S since March 2014. The KuPR and KaPR design specifications are shown in table 11.

Table 11: Summary of the characteristics of the GMP Dual Precipitation Radar. The GPM KuPR minimum threshold is closer to 12–13 dBZ than the official 18 dBZ in the table (from Tang et al., 2017).

Instrument

GPM DPR

KaPR

KuPR

Launch time

27 Feb 2014

27 Feb 2014

Altitude (km)

407

407

Inclination angle (°)

65

65

Frequency (GHz)

35.547 and 35.553

13.597 and 13.603

Horizon resolution at nadir (km)

5

5

Swath width (km)

120

245

Vertical resolution (m)

250/500

250

Minimum detectable Ze (dBZ)

12 (KaHS)
18 (KaMS)

18

Measurement accuracy (dBZ)

< ±  1

< ± 1

3.2.2.2 The NN training dataset

Table 12 presents the main characteristics of the MHS-DPR coincidence database used for the NN design. This dataset was built as follows. Coincidences between NOAA-18, NOAA-19, MetOp-A, MetOp-B MHS measurements and DPR Ku-band measurements within a time interval of 15 minutes were considered for the creation of the database. The database covers the period from 1 January 2015 through 31 December 2016 (24 months). The GPM level-2 precipitation product obtained by combining the GMI and DPR measurements (Grecu et al., 2016) (2B-CMB, version 06A) is used as reference. In particular, the precipitation estimates used in the observational database are provided on the Ku-band radar swath (245 km wide) and obtained from the DPR Ku-band reflectivity and GMI brightness temperatures. The observational database is made of co-located vectors of MHS BTs (from the FIDUCEO dataset, see section 2.1.1) and 2B-CMB surface precipitation rate spatially averaged to match the MHS IFOV (variable along the scan line). Some model-derived variables (from ECMWF ERA5, see section 2.1.2) have been added to the database (see table 13) to be used, together with the input BTs, in the algorithm.

The year 2015 has been used for the training/optimisation phase, while the 2016 dataset has been used for validation and testing.

Table 12: Characteristics of the NN training dataset built from MHS-DPR coincidences

Period

1/01/2015 – 31/12/2016

Geographical area

67 °S – 67 °N, 180 °W – 180 °E

Number of MHS orbits

36,856

Number of pixels with precipitation

3,000,000

Horizontal resolution (Km)

16 km × 16 km (nadir) 26 km × 52 km (scan edge)

Reference precipitation product

2B-CMB level-2 GMI/DPR combined V06A on Ku-band radar swath (NS)

MHS BTs

FIDUCEO FCDR v4.1


Table 13: List of variables in the NN training dataset built from MHS-DPR coincidences.

Variable in the database

Data source

Latitude (MHS pixel)

FIDUCEO FCDR v4.1

Longitude (MHS pixel)

FIDUCEO FCDR v4.1

Mean Time (of DPR pixels within the ATMS pixel)

2B-CMB level-2 GMI/DPR combined V06A

Surface precipitation rate

2B-CMB level-2 GMI/DPR combined V06A

Precipitation liquid fraction information

2B-CMB level-2 GMI/DPR combined V06A

Time of MHS pixel

FIDUCEO FCDR v4.1

MHS Scan position

FIDUCEO FCDR v4.1

Sea ice information

ECMWF ERA5

2 m temperature

ECMWF ERA5

Total column integrated water vapor

ECMWF ERA5

Freezing level

ECMWF ERA5

Snow depth

ECMWF ERA5

Land/Sea Mask

ESA

3.2.3 Algorithm Flowchart

The PNPR-CLIM algorithm high-level flowchart is shown in figure 3.


Figure 3: Flowchart of the PNPR-CLIM precipitation rate retrieval algorithm.

The PNPR-CLIM algorithm is composed of three main parts:

  1. Input data block, which includes:
    1. The MHS / AMSU-B BTs provided by FIDUCEO (v4.1);
    2. The surface classification maps (surface type, presence of snow/ice);
    3. The model-derived ancillary variables (monthly means from ECMWF ERA5).
  2. Precipitation classification module that provides the rain/no-rain classification of pixels and the identification of rain areas;
  3. Precipitation rate estimate module that provides the retrieval of precipitation for the pixel classified by the classification module as precipitation pixels. This module also includes a calibration module for optimizing the convective precipitation estimate.

In the present scheme the algorithm takes as input the FIDUCEO v4.1 BTs of the MHS and AMSU-B radiometers by checking the quality of the input data (quality control module), some ancillary information regarding the thermodynamic state of the atmosphere (from the ECMWF ERA5 model), and regarding the state of the background surface. All the inputs feed the precipitation classification module that is optimized for the detection of the precipitation and its classification. The BTs arrays and the corresponding ancillary data of pixels classified as precipitating, feed the precipitation rate estimate and calibration module. The quality index module evaluates a pixel-based quality flag using the quality of the input data and the accuracy of the retrieval in different meteorological and environmental conditions (e.g. presence of ice/snow, dry condition, and strong convection).

3.2.4 Precipitation classification module

3.2.4.1 Introduction

In general, the identification of precipitation areas, or Precipitation/No-precipitation Classification (PNC) of pixels, represents a preliminary step to the MW precipitation retrieval and is considered crucial to obtain good performances in passive microwave precipitation retrieval (Ferraro et al., 1998; Seto et al., 2008; Sudradjat et al., 2011; Kirstetter et al., 2013; Kacimi et al., 2013). Therefore, the success of any MW retrieval algorithm relies on proper identification of precipitating pixels and the screening of non-precipitating pixels that might produce a signature similar to that of precipitation (Ferraro et al., 1998). For example, over land, the PNC discrimination is difficult due to the high variability of ground emissivity (Grecu and Anagnostou, 2001). This filtering process is therefore critical for instantaneous retrievals but even more so when developing accumulated rain products (Kacimi et al., 2013). PNC, in general, assigns a deterministic flag for precipitation or no-precipitation to each pixel; then, only observations with a rain flag are processed in the precipitation retrieval module.

3.2.4.2 PNC Module structure

PNC module consists of a stand-alone NN classifier with 2 hidden layers of 45 and 15 units, and sigmoid transfer functions. Its output turns out to be a continuous function with values in the range [0, 1] which, under suitable hypotheses on the training dataset distribution (see Bishop, 1995, for more details), approximates the probability of precipitation given the input observation. With this interpretation in mind, the threshold value 0.5 is used to distinguish precipitating (> 0.5) and non-precipitating states (≤ 0.5).

The network ingests three types of input variables: instantaneous, average, and static variables (see table 14). The instantaneous variables are the AMSU-B / MHS FCDR BTs (at 89, 150/157, 1831, 1833 and 190/1837 GHz). The monthly variables from the ERA5 reanalysis include the 2 m Temperature (T2m, K), Freezing Level (FL, m), Total Precipitable Water Vapor (TPWV, kg m-2), Snow Depth (SD, cm) and Sea-Ice fraction (SIF, dimensionless). Finally, the static variables are the (secant of the) scan angle (ANG) and the surface type (Ocean, Land or Coast (OLC)).

Table 14: PNC module input variables.

Input Variable

Variable dimensions

Variable type

BT

5

Instantaneous

monthly T2m

1

Average

monthly FL

1

Average

monthly TPWV

1

Average

daily SD

1

Average

daily SIF

1

Average

ANG

1

Static

OLC

1

Static


The static and average variables determine a sort of a priori status of the atmosphere, allowing the NN to correctly interpret the radiative signal (after applying a zenith correction, if necessary): for example, a depression in the BT around the 157 GHz window channel is commonly due to the scattering of precipitating particles. However, if the atmosphere is sufficiently dry, the emission signature of a radiatively cold surface may lead to the same effect on the measured BT. Through the use of the ancillary data the NN aims to discriminate these different states.

The network architecture described above was not arbitrarily chosen: several different combinations of layers, units per layer and activation functions were tested using the 2015 DPR-MHS coincidence dataset (training dataset, see section 3.2.2.2).

3.2.4.3 PNC Module performance verification

The performance analysis of PNC module was carried out using the 2016 DPR-MHS coincidence dataset to verify consistency and stability of the NN performance.

Before proceeding with the module assessment, let us introduce some further notation. Usually, for two-classes classification problems, where the target variable t and the prediction variable y assume binary values (1 for rain and 0 for no rain), the validation dataset (VD) can be divided into four disjoint subsets forming the contingency table defined by eqs. 3.4:

\[ VD = TP \cup TN \cup FP \cup FN \qquad (3.4) \] \[ TP = \{t=1,y=1\}, \qquad \] \[ TN = \{t=0,y=0\}, \qquad \] \[ FP = \{t=0,y=1\}, \qquad \] \[ FN = \{t=1,y=0\}, \qquad \]

where TPs are True Positives, TNs True negatives, FPs False Positives, and finally FNs represent False Negatives, respectively. These classes merge up to four additional classes: T (TPFN), P (TPFP), F (TNFP) and N (TN ∪ FN).

Considering the statistical parameters defined in eqs. 3.4, several indices can be computed. The accuracy (acc), probability of random agreement (rnd) and the Cohen's kappa (𝜅, Cohen, 1960) are defined as follows (where the modulus denotes the size):

\[ acc = \frac{|TP| + |TN|}{|VD|} \qquad (3.5) \]

\[ rnd = \frac{(|TP| + |FP|)(|TN|+|TP)+(|FP|+|FN|)(|TN|+|FN|}{|VD|^2} \qquad (3.6) \]

\[ \kappa = \frac{acc - rnd}{1 - rnd}. \qquad (3.7) \]

The accuracy is the probability that, given a sample, the model prediction is correct, whereas the probability of random agreement is the probability of having a random agreement (that is, only by chance) between prediction and truth. Finally, the Cohen's kappa, which is always less than or equal to 1 by definition, is a combination of the previous indices and can be considered a fair measure of the model reliability; his values are interpreted as follows:

  • : \( \kappa = 1 \) perfect classifier;

  • : \( \kappa = 0 \) success by chance;

  • : \( \kappa < 0 \) worse than random success.

For the PNC classifier, we expected that, due to resolution and channel assortment limitation (specifically the lack of low frequency channels, extremely useful for robust precipitation retrievals over ocean), very light precipitation detected by the DPR could be easily missed by the MHS and thus by any algorithm based on its observations. To highlight this behaviour and identify a sensitivity threshold, the Cohen's kappa was evaluated at various minimum precipitation rates (identifying the targets t = 1). Please, note that varying the minimum precipitation rate does not change the proportion of the predicted positives/negatives. Therefore, in order to balance the effect of introducing fictitious false alarms by increasing the detection threshold (small rates correctly identified as non-zero), the various indices were computed for 2B-CMB rate either equal to 0 mm/h or greater than the chosen minimum threshold. The results are shown in figure 4.

Figure 4: Sensitivity study for the PNC module through the analysis of the Cohen's kappa (blue dots) at various rainfall threshold values. The peak \( (\kappa = 0.71) \) testifies that the PNC module optimizes the detection of rainy pixels with rates greater than or equal to 0.30 mm/h

The maximum score \( (\kappa = 0.71) \) was achieved at around 0.30 mm/h. For the same threshold value the accuracy was about 0.79.

Generally, Cohen's kappa and accuracy account for right and wrong predictions in both rain and no rain classes. In this sense they are related to the overall model reliability. To assess the model on the single classes, instead, two other indices are used, the probability of detection (pod) and false alarm rate (far):

\[ pod=\frac{|TP|}{T} \qquad (3.8) \]

\[ far=\frac{|FP|}{P} \qquad (3.9) \]

The probability of detection represents the probability of correctly identifying precipitating pixels, whereas the false alarm rate indicates the fraction of samples wrongly classified as precipitating. Consequently, a perfect classifier will have pod = 1 and far = 0.

On the validation dataset the PNC module had a far of about 0.20. The pod can be evaluated for different thresholds. In this case, the index represents the probability of detecting rainy samples within a prescribed range, i.e., precipitation rates greater than a minimum threshold value, as shown in figure 5. It can be seen that, at the sensitivity threshold of 0.30 mm/h, the pod turned out to be 0.79. For higher rates, greater than or equal to 0.50 mm/h, its value was 0.87.


Figure 5: PNC module Probability of Detection at various rainfall threshold values.

To summarize, the PNC module, after being trained on the 2015 DPR-MHS coincident observations, was tested on the 2016 analogous dataset. For a minimum detectable precipitation of 0.30 mm/h, the module showed \( \kappa = 0.71 \) and acc = 0.79. False alarm rate, on the other hand, was about 0.20. At the same time, pod values above 0.80 were found for precipitation rates greater than or equal to 0.34 mm/h, increasing to values above 0.87 for precipitation rates greater than 0.50 mm/h.

3.2.5 Precipitation rate estimation module

The design of the NN the for the precipitation rate estimation (PRE) module of PNPR-CLIM algorithm has exploited the experience gained at the CNR-ISAC in the development of NN-based precipitation retrieval algorithms for cross-track scanning MW radiometers (Sanò et al., 2015, 2016). These algorithms (the PNPR v1 for AMSU/MHS and PNPR v2 for ATMS) were developed in the frame of the EUMETSAT H SAF to deliver H SAF L2 operational products P-IN-MHS (H02B) and P-IN-ATMS (H18) over the MSG full disc area (Mugnai et al., 2013). It is important to stress that while these algorithms use a model-based training dataset optimized for European and African regions for near real time applications, the PNPR-CLIM approach is based on the use of a global GPM-based observational dataset and on climatological ancillary data, for climatological applications.

The criterion used in the design of the PRE module, already tested in Sanò et al. (2015, 2016), is to use a single neural network for detecting precipitation on different types of surface. The use of different NNs for different surface types (i.e. land or ocean (Surussavadee and Staelin, 2008)) is suggested by the remarkably different characteristics of the microwave signatures of different backgrounds, especially in the window channels (e.g., 89 GHz) and/or in dry conditions. However, the use of different networks can often lead to discontinuity of the estimates in correspondence with transitions between different conditions. The approach of a single NN-based algorithm prevents discontinuities or inconsistencies in the retrieved precipitation patterns, while making the various phases of the design more complex (database, learning, network architecture, and selection of inputs).

Regarding the input selection, different typologies of BT and BT-derived variables have been considered and several test networks (more than 100) were designed. In addition to the use of BTs of the different MHS channels, also the BT differences in the water vapour absorption band channels at 183.31 GHz are used, although originally designed to retrieve water vapour profiles due to their different sensitivity to specific layers of the atmosphere (Wang et al., 1997; Staelin and Chen, 2000; Blackwell and Chen, 2005). These channels have shown great potential for the characterisation of precipitating cloud and for precipitation retrieval. The different penetration ability of these channels in the atmosphere can be exploited to analyse the vertical distribution of hydrometeors (Wang et al., 1989, 1997; Burns et al., 1997; Staelin and Chen, 2000; Ferraro et al., 2005; Hong et al., 2005, 2008; Funatsu et al., 2007, 2009; Laviola and Levizzani, 2011), and to obtain some criteria for the characterization of precipitation as weak, moderate, strong convective or stratiform, using the BT differences (e.g., Ferraro, 2004; Qiu et al., 2005).

The PRE module optimal NN consists of a stand-alone NN model with 2 hidden layers of 28 and 8 units, and sigmoid transfer functions. The NN has been developed using the 15 input variables shown in table 15.

Table 15: Optimal NN module input variables.

Variable Type

Input Variable

Variable dimension

Variable type

BT

89.0 GHz
157.0 GHz
183.31 ± 7.0 GHz

3

Instantaneous

BT Differences

183.31 ± 3.0–183.31 ± 1.0 GHz
183.31 ± 3.0–183.31 ± 7.0 GHz
183.31 ± 1.0–183.31 ± 7.0 GHz
183.31 ± 3.0–157 GHz
183.31 ± 7.0–157 GHz

5

Instantaneous

Model Derived Ancillary Variables

monthly T2m (K)
monthly FL (m)
monthly TPWV (kg m-2)
daily SD
daily SIF

5

Average
(from ECMWF ERA5 reanalysis)

Ancillary Variables

ANG
OLC

2

Static Map


The performance analysis of the PRE module was carried out using the 2016 dataset, which is an independent part of the observational MHS-DPR coincidence database, not used in the training and design phase of the algorithm (about 1.5 million points) (see section 3.2.2.2).

Figure 6: Number of occurrences (pixels) vs bins (1 mm/h) of precipitation values retrieved by the NN (PRE module) (red) and those from 2B-CMB in the verification database (blue). The left panel refers to ocean and the right one to land.

Figure 6 shows, in a bar graph, the comparison between the number of occurrences of the precipitation values provided by the NN (red) and those in 2B-CMB (blue), as a function of the precipitation in bins of 1 mm/h for ocean and land. A good agreement is seen between the NN-derived values and the 2B-CMB verification database across all bins, especially over oceans. Some small differences (mainly for land) for high precipitation values (> 15 mm/h) are essentially due to the low number of occurrences (less than 103). For small values of precipitation (0.1 to 5.0 mm/h), where the number of occurrences is larger (104 – 106), the agreement is very good.

Figure 7: 2D histograms of pixel-based comparison of surface precipitation rate estimates from the PRE module (x-axis) and the corresponding 2B-CMB values in the verification dataset (y-axis), over ocean (left) and land (right). Only pixels where both the NN and the verification dataset provide precipitation (TP pixels) are shown. A logarithmic scale is used for the precipitation rates in mm/h.

Table 16: PNPR-CLIM precipitation retrieval module statistical indexes.

 

Ocean

Land

BIAS[mm/h]

0.02

0.01

CC

0.75

0.70

RMSE[mm/h]

1.02

1.08

Another result of the verification study is shown in figure 7. It shows the 2D histogram of the surface precipitation rate estimates from the NN and the corresponding values in the 2B-CMB dataset over ocean and land. Only pixels for which both the neural network and the 2B-CMB provided rainfall estimates ≥ 0.1 mm/h (TP pixels) were considered. In the scatterplot, the logarithmic axes represent the precipitation rate (NN vs. GPM 2B-CMB referred to as DPR), while the colour represents the number of points in the dataset for each 2D precipitation rate bin. Most of the points are close to the main diagonal for both ocean and land, with slight overestimation of very low precipitation (precipitation rate < 0.5 mm/h) over land by the NN.

The values of the statistical indices (hit bias, correlation coefficient (CC), and RMSE) calculated over the entire verification dataset are also provided in table 16, and they confirm the good agreement between the NN retrievals and the verification dataset with very similar performances for both ocean and land pixels.

Figure 8: As in figure 7, with normalized density scatterplots of the NN (PRE module) and 2B-CMB (DPR) mean precipitation rates (ocean on the left, land on the right).

Figure 8 shows the normalized density scatterplot of the NN retrieval of rainfall rates and the corresponding values in the 2B-CMB dataset, for ocean and land surfaces. Normalisation was performed on the number of the 2B-CMB dataset instances in each precipitation rate bin (i.e. by normalising the scatterplots in figure 7 by the sum of instances in each column). In this way, the scatterplot highlights the rain rate distribution regardless of the number of occurrences for the various values of precipitation. This figure also shows the good agreement between the precipitation values resulting from the NN and from the 2B-CMB, with a slight underestimation by the NN for values greater than 1 mm/h.

The results presented, in addition to showing the agreement between the PRE module and 2B-CMB estimates, also evidence the good outcome of the network training, allowing the use of one unique NN over different land-surface types. The NN, applied to the independent verification global dataset with precipitation rates extending over a wide range of values, shows a good ability to retrieve global precipitation without anomalous inhomogeneity in the estimates.

3.2.6 The deep convection calibration procedure

A procedure to calibrate for deep convection has been developed in order to adjust the PRE module estimates in certain weather conditions (i.e., the presence of deep convection). The NN of the PRE module has been optimized in order to reproduce, in the most reliable way, the rainfall within the training database. However, since different precipitation regimes are not equally well represented in the dataset, the NN tends to optimize the precipitation estimate corresponding to the most frequent conditions. From figure 6 it is evident that the data points corresponding to precipitation rate greater than 10 mm/h are only a small part of the training dataset. This feature can be a weakness during the NN learning phase as far as intense precipitation regimes are concerned (e.g., the presence of deep convection). This issue leads to an underestimation by PNPR-CLIM (with respect to global precipitation datasets) over land areas characterized by deep convection, as shown in Panegrossi et al. (2020, EGU).

An additional verification study has been carried out using, as ground truth, the precipitation rate estimates made available through the Multi-Radar-Multi-Sensor (MRMS) system (Zhang et al., 2011; Zhang et al,. 2016; Tang et al., 2020). MRMS is a US and Canadian effort to provide Cartesian gridded level 2 and 3 radar products at 0.01° horizontal resolution, 2 min temporal sampling, combining 180 ground radars of the US and some of the Canadian network. It provides high-quality and high-resolution severe weather and precipitation products for meteorology, hydrology, and aviation applications. MRMS Quantitative Precipitation Estimation (QPE) algorithms are largely based on components from the National mosaic and multi-sensor QPE systems (Zhang et al., 2016). MRMS coverage is 126.65 °W to 60.15 °W and 22.52 °N to 54.90 °N. One year (2017) of the hourly precipitation, gauge adjusted MRMS product (freely available at https://mtarchive.geol.iastate.edu/) have been averaged over a 0.25°×0.25° regular grid to be compared with the PNPR-CLIM precipitation estimates. To obtain coherent measurements, the instantaneous precipitation rates from PNPR-CLIM were converted to hourly estimates by averaging all the estimates within 1 hour interval for each 0.25°×0.25° grid box.

The discrepancies between the PNPR-CLIM and MRMS precipitation products were evidenced by pronounced differences in the relative probability density functions, where the PNPR-CLIM overestimation leads to higher occurrences of heavy precipitation regimes (> 10 mm/h), followed by a quick cut-off beyond 40 mm/h (values not represented in the training phase), as shown in Figure 10, left panel, where the number of occurrences of precipitation values retrieved by PNPR-CLIM (blue) and those estimated by MRMS (orange) are displayed. The precipitation distribution shows that PNPR-CLIM overestimates precipitation rates greater than 10 mm/h, but underestimates precipitation rates over 40 mm/h.

The calibration procedure has been developed in order to improve the PNPR-CLIM precipitation estimate for deep convection events. The methodology adopted to identify situations of deep convection, uses the following conditions (Hong et al., 2005, Funatsu et al., 2007) on the BT differences of the 183.31 GHz channels:

\[ \Delta T_{17} > 0 K, \qquad\qquad \] \[ \Delta T_{13} > 0 K, \qquad(3.10) \] \[ \Delta T_{37} > 0 K, \qquad\qquad \]

where \( \Delta T_{17} \)  is the difference between channels 183.31±1 GHz and 183.31±7 GHz,  \( \Delta T_{13} \)  the difference between 183.31±1 GHz and 183.31±3 GHz, \( \Delta T_{37} \)  the difference between 183.31±3 GHz and 183.31±7 GHz. The method is based on the concept that channels further away from the centre of the water vapour absorption band at 183.31 GHz can see deeper into the cloud, hence being subjected to greater scattering from the middle or low layers of the deep convective clouds (Burns et al., 1997; Wang et al., 1997).

Any calibration curve should aim at fairly distributing the overestimates confined in this range, avoiding at the same time abrupt changes in the probability density function itself. To this purpose, it is particularly convenient to generate a calibration curve while trying to match-up the two probability densities. For one-dimensional distributions this leads to a calibration function given by \( f = G^{-1} \circ F, \)  where F and G are the cumulative densities of the distribution of the re-gridded hourly products based on PNPR-CLIM and MRMS, respectively.

In order to tune the calibration procedure and to avoid discontinuities between the calibrated precipitation fields and those where the calibration is not applied, a parametric calibration has been chosen:

\[ f_{t}= (1 - t)f_{0} + tf \qquad (3.11) \]

where f0  is the identity map and 𝑡 = 𝑡(𝛥𝑇17, 𝛥𝑇13, 𝛥𝑇37, 𝜙) is a convex, smooth function of the BT differences and the latitude 𝜙 such that t = 1 on the region characterized by deep convection (equation (3.10)) and mid-low latitudes (|𝜙| < 60°), and t = 0 outside of those regions. The shape of ft for different values of its variables is shown in figure 9.

Figure 9: The right panel shows the calibration functions that are applied according to the values assumed by the differences between 183.31 GHz channels. The left panel shows a detail for rain rate less than 1 mm/h.

Figure 10 shows the effect of the calibration module in solving the aforementioned overestimation and underestimation problems.

Figure 10: Number of occurrences of precipitation rate values retrieved by the PNPR-CLIM (blue) and MRMS (orange). In the left panel PNPR-CLIM without calibration is shown while the right panel shows the calibrated PNPR-CLIM.

3.2.7 The quality index

The algorithm provides a quality index to be associated with the estimated value of surface precipitation rate. The quality flag summarizes the product quality and reliability and provides a simple and immediate criterion for the evaluation of the products towards a correct selection and application of the precipitation estimates with respect to the analysed scenario. This index has been constructed based on seven different criteria:

  1. Quality of input data: The information provided by FIDUCEO is used to identify the BTs with less reliability (FIDUCEO quality index: 0 = Reliable, 1 = Use with caution, 2 = Unreliable);
  2. Background surface index: The quality is reduced for snow-covered background or presence of sea ice. Daily maps of snow/sea ice cover from ECMWF ERA5 are used to identify these conditions;
  3. Orography index: The quality index is reduced if the standard deviation of the terrain elevation within the pixel exceeds a certain threshold (400 m). The ETOPO1 land topography was exploited (Amante et al., 2009);
  4. Radiometer Scan index: The quality index is reduced for observations taken at the 5 outermost pixels of the scanline (at the lowest spatial resolution);
  5. Precipitation Probability Index: The quality index is reduced if the probability of precipitation is between 30 % and 70 %. In this range, the rain/ no rain discrimination has greater uncertainty;
  6. Calibration Index: The quality index is reduced if the BTs belong to the deep convection region identified by equation (3.10);
  7. High Latitudes Index: The quality index is reduced at high latitudes(ϕ>60°) in extremely cold and dry conditions. In these conditions the background surface contaminates the precipitation signal, leading to an overestimation of the precipitation and/or false detection. This effect is identified by using a threshold on the 89 GHz channel (BT89GHz < 175 K).

These criteria make it possible to create two quality indices, The Quality Flag Index (QF) and the Bit Quality Flag Index (BQF). BQF is a bit flag: the positions of the bits with value 1 indicate which of the seven conditions listed above have occurred (see table 17). QF is an integer variable, built from BQF, with values ranging from 0 (high quality) to 3 (poor quality); a further value (4) indicates invalid data. It is defined as the number of non-zero bits of BQF (with maximum allowable value QF = 3). It is also worth pointing out that, if the third (snow-covered surface) or fourth (sea ice) bit of BQF is non-zero, then QF is set to 3 by default because the snow/ice background significantly decreases the capability of predicting precipitation rates. Figure 11 shows the distribution of the QF index for the year 2017.

Table 17: Contribution of the conditions occurred in the BQF index.

BQF

Occurred Condition

0

Precipitation Probability Index

1

Radiometer Scan index

2

Quality of input data

3

Background surface index (Snow)

4

Background surface index (Sea Ice)

5

Orography index

6

High Latitudes Index

7

Calibration Index

8

Invalid Data

Figure 11: Distribution of the QF index (percentage over all the available observations) for the year 2017.

3.3 HOAPS v4

For HOAPS-v4, a NN algorithm is used to quantify precipitation from the SSMI(S) FCDR BTs. The neural network was trained with precipitation rates retrieved from assimilated brightness temperatures in a 1D-Var scheme from ECMWF. The training data set is based on radiative transfer calculations as described in Bauer et al. (2006a, b). The data set contains one month (August 2004) of assimilated SSM/I brightness temperatures and the corresponding ECMWF 1D-Var retrieved precipitation values of the ECMWF model. For more details on the neural network architecture of the precipitation retrieval algorithm see [D1] and Andersson et al. (2010). Output is provided over ice-free ocean. For precipitation, a sensitivity threshold is implemented at 0.3 mm/h. Below this threshold, a level 2 observation is considered as non-precipitating and consequently set to zero. The algorithm itself was developed outside the C3S project.

3.4 Gridding and merging to produce the COBRA precipitation dataset

3.4.1 Generation of per-satellite hourly gridded data (FPG algorithm)



Figure 12: Schematic of the FPG algorithm.

Table 18: Semi axes in km for the two instrument classes. For AMSU-B / MHS, the semi axes depend on the scan position sp = 1, …, 90 with 45 and 46 being the central positions in the scan line, close to the sub-satellite point. It is nb = sp for sp ≤ 45 and nb = 91 - sp else


AMSU-B / MHS (PNPR-CLIM)

SSM/I / SSMIS (HOAPS v4)

AMSR-E2 (HOAPS v4 extended)

TMI (HOAPS v4 extended)

Along-scan

\[ 0.5 \cdot 79.08 + 2.84 \cdot nb - 14.78 \cdot nb^{0.666} \]

15.5

15.5

9

Cross-scan

\[ 0.5 \cdot 28.72 - 0.90 \cdot nb + 0.094 \cdot nb^{1.5} \]

22.5

22.5

16

2 AMSR-E brightness temperatures from three neighbouring scan positions are averaged to match the SSM/I and SSMIS resolutions, see section 1.2

When generating L3 (i.e. spatially and temporally gridded) daily data, the L2 instantaneous precipitation rates are first averaged in hourly intervals on the final 1° × 1° latitude-longitude grid (referred to as 1 degree hourly (1DH)). This intermediate step is carried out so that diurnal cycles are represented optimally wherever the L2 observational database allows (see sections 3.4.3 and 3.4.4). The monthly data are computed as averages of the instantaneous precipitation rates directly. However, the L2 instantaneous precipitation rates have not been processed twice (for hourly and monthly averaging, respectively). Instead, the monthly averaging uses intermediate results from the hourly binning (see below).

The conventional gridding technique averages in every grid cell only those data, for which the centre of the footprint falls inside the grid cell, implying that each L2 datum is allocated to exactly one grid cell, despite its footprint overlapping with neighbouring grid cells, too. This approach can be expected to work sufficiently well where grid cells are large compared to the footprints of the observations, i.e., the integrated area from which the instruments receives its signal during one single observation. In the final dataset, we include polar regions where the extent of single cells in the equidistant grid decrease towards zero in longitudinal direction. Consequently, many grid cells remain unobserved in the conventional gridding and need to be filled in an additional post-processing step. Therefore, we opted to develop a tailored gridding algorithm that computes the areas of overlap (AO) for each observational footprint and the grid cells in the vicinity, implemented in python 3.7. When averaging data in one grid cell, they are weighted by their AO with this grid cell. We will refer here to the algorithm as FPG ("Footprint Gridding").

The FPG algorithm processes one day of L2 data for one platform. Figure 12 shows a flow chart for FPG. It consists of the following modules:

  • L2 Input: FPG reads one-dimensional arrays containing scan line time and scan position and two-dimensional (scan line vs. scan position) arrays for precipitation rates in mm/h, latitudes and longitudes of footprint centres, and – in the case of PNPR-CLIM – quality flag (see section 3.2.7).
  • Grid set-up: The 1° × 1° grid is defined in terms of grid cell edges and centre points. For each grid cell, the European Petroleum Survey Group (EPSG) code of the respective optimal Universal Transversal Mercator (UTM) zone is determined, based on the geographical coordinates of the grid cell's centre point. The grid cell's outline is sampled regularly by npe points per edge in latitude-longitude coordinates, which are then transformed into the grid cell's outline polygon in the respective UTM coordinates in m. In the following, we opt for npe = 5.

  • Footprint set-up: The ellipses of the footprints are set-up in Cartesian coordinates. The semi axes are in along-scan and cross-scan directions. Table 18 contains the formula or values for semi axes lengths. In the case of MHS and AMSU-B (PNPR-CLIM), we use the parameterisation of Bennartz (2000). Note that for the cross-track scanning geometry, the along-track direction corresponds to the cross-scan direction, as does the cross-track direction and along-scan direction. In the case of SSM/I and SSMIS observations (HOAPS v4 L2 observations) are independent of the scan position, due to the conical scanning geometry. The values reported in Table 18 correspond to the footprint of the 37 GHz channel of the instruments, as higher-resolution observations are first averaged to these footprints in the HOAPS processing. The ellipses are sampled as polygons with points spaced by 10° increments in the polar angle of the 2-dimensional Cartesian plane.
  • Hourly gridding: The L2 data of the considered day (24 h) are sliced in hourly subsets. For each hour, probably covered grid cells are determined as follows:
    1. The polygon formed by the hull of the entire hourly L2 swath (i.e., of all locations for which an observation has been recorded in this hour) is retrieved in terms of latitude-longitude coordinates. Grid cells are tested whether they fall inside the respective polygon. It is possible that many hourly swaths cover at least one of the poles, or cross the longitudinal ±180° boundary, where such a polygon would not be well defined. In this case the swath and the grid cells are thus rotated towards the equator inside the ±180° longitude limits. Grid cells that fall inside the polygon or have a minimum distance to the edge of the swath of less than 250 km (evaluated using the Haversine formula for the grid cell centre points and the footprint centre points of the outmost scan positions) are considered as probably covered.
    2. It can occur, due to the imperfect identification of the orbit situation, that the rotation (mentioned under point a) fails to move the swath away from the poles or the ±180° longitudinal boundary. In these cases, the additional rotation is discarded and the grid cells are instead identified as possibly covered if they have a minimum distance to the edge of the swath of 1200 km in the same sense as above.
    3. Additionally, in the case of HOAPS v4 observations, grid cells not covering open ocean are filtered out.

Note that the two maximum distances (250 km and 1200 km) have been tested to leave a small band of grid cells around the swath which are processed but do not contain observations. The limits can consequently be considered appropriate.

For each possibly covered grid cell, the possibly overlapping scan lines are identified as having the minimum distance between the grid cell centre and the centres of the footprints in one scan line. The distance, computed via the Haversine formula, must be lower than \( 1.01 \cdot (sa_{max} + \sqrt{2} \cdot 2\pi R \cdot \frac{1°}{360°} ), \)  where samax is the maximum length of the semi axes (see table 18) and R = 6371 km is the Earth's radius.

For each observation i in the remaining scan lines, the footprint ellipses are obtained as polygons with coordinates in the optimal UTM zone of the grid cell as follows:

    1. The geographical coordinates of the footprint centre point and the two neighbouring ones are converted to the coordinates of the grid cell's optimal UTM zone.
    2. Along- and cross-track directions in the 2-dimensional UTM frame are computed from the coordinates of the two neighbouring centre points.
    3. The polygonal representations of the ellipses (from "footprint set-up") in Cartesian coordinates are shifted and rotated to match the UTM coordinates of the footprint centre point and the respective directions of the semi axes.

For extreme scan positions at the very edge of the scan line, the semi-axes directions are computed from the coordinates of the centre point itself and of the one neighbouring scan position.

It should be noted that this representation of footprint ellipses is only an approximation, due to, for example, the parameterisation of AMSU-B/MHS footprints, and the distortion of distances in UTM coordinates towards the UTM zones' boundaries. The correct footprint extent (preferably in geographical coordinates) could be calculated from orbital and viewing parameters, also taking into account the deviation of the Earth's shape from a sphere. However, orbital parameters are not present in L2 observations and cannot easily be reconstructed. As fine structures in the precipitation fields are not in the focus here, both due to the relatively large footprints of the involved instruments and to the coarse resolution of the output grid (1° × 1°), we expect the approximated ellipses to be sufficiently accurate. Their centre points correspond to the original points present in the L2 data. The respective error in geolocation remains, but is not enlarged by our processing. Any discrepancies between the approximated and the real footprint ellipses originating from the approximation itself would then manifest as deviations of the ellipses' outlines.

The AO between the footprint polygon i and the grid cell polygon, Ai is computed using the "intersection" method of the "Polygon" class in the python module shapely.

From the original L2 data Pi, we compute the following quantities as an intermediate result, which is also part of the output for possible later averaging over multiples of hourly intervals (for example the monthly means):

\[ pxa = \sum_{i} A_{i}P_{i} \qquad (3.12) \]

\[ p2xa = \sum_{i} A_{i}P_{i}^2 \qquad (3.13) \]

\[ norm = \sum_{i} A_{i} \qquad (3.14) \]

The 1DH average precipitation precip_mean and the respective standard deviation precip_stdv (both in mm/h) are then computed as

\[ precip\_mean= \frac{pxa}{norm} \qquad (3.15) \]

\[ precip\_stdv= \sqrt{\frac{p2xa}{norm}-precip\_mean^2} \qquad (3.16) \]

Additionally, in the case of PNPR-CLIM L2 data, the quality flag is processed. From all observations with a non-zero AO with the grid cell, the minimum and maximum quality flags are retained (qf_min, qf_max⁡). The average quality flag qf_mean is computed in the same way as precip_mean. Using the "union" method in the "Polygon" class, the fraction of the grid cell that is covered by the ellipses in the hour, acov, is also determined.

  • Output: The following 3-dimensional, gridded variables are written to a netCDF file covering the entire processed day for the respective satellite: precip_mean , precip_stdv, pxa, p2xa, norm, qf_min, qf_max, qf_mean, qxa (equivalent to pxa for quality flag instead of precipitation rate), acov, numo (number of L2 data with non-zero overlap). The additional 3-dimensional integer variable tested_cells contains the value 2 for grid cells covered by data, 1 for grid cells that have been tested and found not be overlapped by any footprint in the hourly interval, and 0 elsewhere. The respective spatiotemporal coordinates (time, latitude, longitude) and their respective bounds are written out, too.

Figure 13: Probability density functions (PDF) for precipitation over ocean in the two datasets, based on all available 1DH data (NOAA15 and NOAA16 are excluded in the case of PNPR-CLIM due to lack of overall stability). The various panels cover different latitudinal bands specified in the panels' titles. The colors represent months. Solid lines correspond to PNPR-CLIM; dashed lines correspond to HOAPS. Zero-precipitation events have been excluded. Note the logarithmic scale of the y-axes and the nonlogarithmic, nonlinear scale of the x-axes.

3.4.2 Post-processing

3.4.2.1 Bias correction of 1DH precipitation rates

The overall distributions of hourly gridded data produced by FPG (section 3.4.1) differ strongly for the PNPR and HOAPS datasets over the ocean, see figure 13. It was decided to harmonise the datasets over the ocean. A dataset can be manipulated such that its distribution matches that of another one by quantile mapping, i.e., based on cumulative probabilities. In our case, this means summing up the PDFs displayed in figure 13. For a given month and latitudinal band, we then have a cumulative PDF for HOAPS, fH, and one for PNPR, fP. For a given 1DH non-zero rate of precipitation in HOAPS, pH, the corresponding precipitation rate according to the PNPR PDF is \( p_{H2P} = f^{-1}_P(f_H(p_H)) \) . Here, \( f^{-1}_P \) , is the inverse of the respective cumulative PNPR PDF. Likewise, a 1DH non-zero PNPR precipitation rate, pP, can be adjusted to the HOAPS distribution as \( p_{P2H} = f^{-1}_H(f_P(p_P)) \) . Mapping function based on the PDFs in figure 13 are illustrated in figure 14.

Figure 14: Mapping of 1DH precipitation rates over ocean between HOAPS (x-axes) and PNPR (y-axes) for the same categories (latitudinal bands in panels, months as colors) as in figure 13, based on the PDFs shown in figure 13. Note the nonlinear scaling of the axes. For an improved interpretability, we are also including black dashed lines corresponding to scaling with a constant factor (see lower left panel for the annotations of these lines), as well as the identity (1:1) mapping as a solid black line. The mapping functions are constructed as described in the main text. At higher precipitation rates, the populations become sparser, which is why a spline fit towards an identity (1:1) mapping has been carried out from the first occurrence of respective discontinuities to avoid an unphysical, spurious mapping.

For example, rates around 1 mm/h in HOAPS are scaled to higher values when mapped to the PNPR distribution at low latitudes (upper right panel in figure 14), or low rates in PNPR are scaled to higher values when mapped to the HOAPS distribution at high latitudes (left panels in figure 14). A comparison of resulting merged daily and monthly values (see sections 3.4.3 and 3.4.4) yielded a general underestimation in high latitudes and a likely underestimation in low latitudes. Therefore, we opted for a bias correction of PNPR 1DH values towards HOAPS in high latitudes and vice versa in low latitudes. In both cases, this generally increases the overall amount of precipitation.

The exact procedure is as follows: For each month, the individual mappings (PNPR-to-HOAPS and HOAPS-to-PNPR) are linearly interpolated in latitude onto the 1° output grid, assuming that the mappings shown in figure 14 are valid at the central latitudes in the respective bands, i.e. at ±70°, ±40°, ±20°, 0°. At latitudes poleward of ±70°, the respective mapping at ±70° is retained. This latitudinal interpolation is carried out to avoid discontinuities in the climatology at the boundaries of the latitudinal bands. HOAPS non-zero 1DH precipitation rates are corrected according to the above described latitudinally interpolated HOAPS-to-PNPR mapping in latitudes between -20° and +20° and mapped to identity (i.e., retained) at latitudes poleward of ±30°. Between ±20° and ±30°, the above (HOAPS-to-PNPR) mapping and the identity mapping are mixed with linearly increasing or decreasing weights. The correction of non-zero PNPR 1DH precipitation rates is implemented inversely, i.e., apply the latitudinally interpolated PNPR-to-HOAPS mapping poleward of ±30°, map to identity (i.e., retain) between -20° and +20°, and mix linearly in-between. Zero precipitation is always retained as zero precipitation.

3.4.2.2 Bias correction of 1DH standard deviation and other variables

The standard deviation in a 1DH grid cell (which is part of the FPG output (section 3.4.1)) cannot be corrected as straightforwardly as the mean values, precip_mean (described in section 3.4.2.1). We mapped the precip_mean ± precip_stdv and derived the bias-corrected standard deviation from the resulting difference. However, the standard deviation can occasionally exceed the mean value, such that precip_mean - precip_stdv < 0, which does not make sense for precipitation and the above quantile mapping. In this case, we determine the scaling factor s = precip_stdv / precip_mean by which we have to scale precip_stdv such that precip_mean - precip_stdv / s = 0. We can then bias-correct the values precip_mean ± precip_stdv / s according to section 3.4.2.1, derive the bias-corrected standard deviation from the respective difference, and re-scale by factor s. In cases where precip_mean - precip_stdv ≥ 0, it is s = 1.
The variables pxa and p2xa (see section 3.4.1) need to be biased-corrected, too, so that monthly values are treated similarly. As for 1DH values, it is precip_mean = pxa / norm (see section 3.4.1), the bias-corrected pxa can be computed directly from the bias-corrected 1DH precip_mean (described in section 3.4.2.1).
With the relation between precip_stdv, precip_mean, norm, and p2xa given in section 3.4.1, the bias-corrected p2xa can be computed based on the bias-corrected precip_stdv and precip_mean.

3.4.2.3 Filtering

3.4.2.3.1 Filtering of NOAA15 data

Precipitation rates retrieved by PNPR-CLIM using NOAA15 observations prove to be spurious at all available times, with certain drifts occurring later in the time series. Hans et al. (2017, Appendix A5.3) recommend to not use channel 3 data after year 2000. With NOAA16 being available only from early 2001 and because we rely on AMSU-B/MHS observations over land, we retain NOAA15 observations from 2000/01/01 to 2001/03/31, but set the respective quality flag of NOAA15 observations to three (worst quality).

3.4.2.3.2 Filtering of NOAA16 data

NOAA16 observations are not used from 2010/01/01 onwards, due to a drift in respective precipitation rates, see also Hans et al. (2017).

3.4.2.3.3 Filtering of NOAA19 data

NOAA19 observations on 2017/10/09 appeared unreasonably high. All observations on that day were discarded.

3.4.2.3.4 Filtering of AMSR-E data

Due to the higher native resolution of AMSR-E observations, the applied sea-ice mask for the retrieval of precipitation rates in HOAPS creates erroneous values where the signal over sea ice leaks into the averaged brightness temperatures (section 1.2.3). It manifests in extremely high precipitation, mostly in polar latitudes. These outliers are always based on a low number of instantaneous L2 observations. We filter out these erroneous observations by discarding all data falling below the black dashed line in the two-dimensional histogram of high-precipitation occurrences in figure 15.

Figure 15: Two-dimensional histogram of AMSR-E 1DH gridded observations for precipitation rates above 7.5 mm/h. The histogram has been created with respect to the number of observations on which the 1DH values are built and the latitudinal 1° grid cell in which they occur. Data points below the black dashed line are subsequently discarded.

3.4.3 Merging and aggregation to daily precipitation

This section describes the computation of a globally gridded precipitation product at a daily time resolution. Variable names given in this section are the same as those given in table 20 in section 4.1 and section 3.4.

Starting with the hourly 1° × 1° gridded data (1DH) from multiple platforms that result from the gridding algorithm described in section 3.4.1, 1DH global/near-global coverage is achieved by merging 1DH precipitation rates from these platforms into one single hourly composite  \( \overline{precip_h} \) . For this purpose, the variables computed in section 3.4.1 are averaged over all available platforms in each grid cell, with equal weights for each platform by default. The num_obs variable is handled in a different way as it is the sum of all platforms computed for every grid cell. The variables pxa (precip), p2xa (precip_stdv) and qxa (quality_flag) refer to monthly computations (associated names used in monthly files are given in the brackets) and are irrelevant for the computation procedure of daily values.

In the next step, composites containing missing values are detected and these gaps are filled. The gap filling is realized by nearest neighbour interpolation in time dimension. If there is an odd number of missing values, the valid value of the lower boundary is used for the gap point in the middle. This interpolation procedure is not used if all hourly composites of the entire day contain only invalid data. In this case, the resulting daily value is marked as missing. One should consider that there is no interpolation in any spatial dimension, and the procedure is applied to \( \overline{precip_h} \) only.
Finally, all hourly precipitation rates in an individual day are accumulated to a daily precipitation rate:

\[ precip_{1DD} = \sum_{i} \overline{precip_{h}} \qquad (3.17) \]

As mentioned before, there will be missing values if all the hourly composites of that day contain only missing data.
Daily means derived from hourly values are calculated for precip_stdv, too. Invalid values (fill values) will be ignored in this process if the entire day contains only invalid data.
In order to give users the possibility for estimating the data quality, two further variables are provided:

  1. Number of observations used to derive the daily precipitation rates
  2. Daily amount of hours covered by observations of at least one platform

The first auxiliary variable is derived from hourly values by summation. The other variable is provided by the detection of missing values before the interpolation procedure and it has been combined with time and platform information needed in the composite creation.

3.4.4 Merging of monthly averages

The computation of a globally gridded precipitation product with a monthly time resolution is described in this section. As in the previous section, the naming of variables will be equal to the names given in table 21 in section 4.2 and section 3.4.
The computation of monthly means of precipitation rates is based on the hourly gridded data described in section 3.4.1, too. Here, the monthly sum of pxa is computed for each platform i and normalized by monthly sums of hourly norm values as shown in following equation:

\[ precip_{i,1DM} = \frac{\sum_{h}pxa_{i,h}}{\sum_{h} norm_{i,h}} \qquad (3.18) \]

with h denoting the hour within the respective month. This implies that the monthly values are not merely averages of the daily values as outlined in section 3.4.3.
In a similar way and on basis of variable p2xa, a monthly intra-platform standard deviation is calculated for each platform i:

\[ precip\_stdv_{i,1DM} = \sqrt{\frac{\sum_{h}p2xa_{i,h}}{\sum_{h} norm_{i,h}} - (precip_{i,1DM})^2} \qquad (3.19) \]

From these values, platform composites are finally computed using weighted means. The weights are derived from the number of 1DH observations that a platform has got within the respective month normalized by the total number of available 1DH observations of all platforms in this month.
In addition to the rain rate data, quality variables are provided in the same way as it has been done for daily data (see section 3.4.3). These variables contain:

  1. Number of observations used to derive the monthly precipitation rates
  2. Monthly amount of hours covered by observations of at least one platform

Since equation (3.18) applies for PNPR-CLIM quality flags qxa (quality_flag) too, the monthly quality_flag variable has been calculated in the same manner as the monthly precip values.

4 Output data

Final results of gridding and merging procedures are global level 3 composite of precipitation rates on a one degree spatial grid. These precipitation rates are accumulated on daily time scales and averaged on monthly ones respectively (details in section 3.4.3, 3.4.4). For an easier identification, these two datasets have been given the name Copernicus Microwave-based Global Precipitation (COBRA). The output is provided as netCDF files. Their standard is netCDF4, i.e., netCDF version 4.0. The files are in compliance with CF-1.8 (http://cfconventions.org/) and ACDD 1.3 (https://wiki.esipfed.org/) conventions. A detailed description of output files with daily data can be found below in subsection 4.1. Files of monthly data are described in subsection 4.2.

Table 19 summarizes the valid versions of produced composites of daily and monthly precipitation rates.

Table 19: Valid versions of COBRA products

Version

Description

1.0

Precipitation rates are derived from the original algorithm as described in this document. The version number is valid for daily and monthly files.

4.1 Daily merged COBRA data

Files containing daily data of accumulated rain rates are named after following syntax:

COBRA_<Year>-<Month>-<Day>_<SpatialResolution>D_v<Version>.nc

The short cut 1D for SpatialResolution denotes the latitude-longitude grid of 1° × 1°. Table 20 lists all variables contained in the specific files and gives a short description of them.

Table 20: List of variables used in any file of daily precipitation rates

Variable Name

Dimension(s)

Unit

Description

Coordinates


lat

1

°N (degrees North)

Latitude of grid cell centre

lat_bnds

2

°N (degrees North)

Boundaries of top (northern) and bottom (southern) grid cell edge

lon

1

°E (degrees East)

Longitude of grid cell centre

lon_bnds

2

°E (degrees East)

Boundaries of left (western) and right (eastern) grid cell edge

time

1

Seconds since 1970-01-01

Time stamp of the current day

time_bnds

2

Seconds since 1970-01-01

Boundaries of the time interval covered by time variable

platform_id

1

N/A

An integer used for internal platform assignment

instrument_id

1

N/A

An integer used for internal instrument assignment

Data Variables


precip

3 (time, lat, lon)

mm/d

Daily accumulated precipitation rates that are represented by a single multi-platform composite

precip_stdv

3 (time, lat, lon)

mm/d

Daily mean of intra-platform standard deviation derived from hourly values

Quality Variables


quality_flag

3 (time, lat, lon)

N/A

Mean of PNPR-CLIM quality flags, whose assigned data was used in composite creation

num_obs

4 (time, lat, lon, instrument_id)

N/A

Total number of observations separated by instrument type

num_covered_hours

3 (time, lat, lon)

N/A

Accumulated number of hours, for which data of at least one platform is available on the respective day

Ancillary Variables


platform_name

2 (time, platform_id)

N/A

Names of all platforms that are used for composite creation for the respective day. The names are allocated to the platform identifier. Platform names are saved as char array. Thus, there is an additional dimension in the netCDF file describing the length of the longest string.

instrument_name

2 (time, instrument_id)

N/A

Assigned names of the specific instrument identifier. Instrument names are saved as char array. Thus, there is an additional dimension in the netCDF file describing the length of the longest string.

4.2 Monthly merged COBRA data

Files containing monthly data of average rain rates are named using the following syntax:

COBRA_<Year>-<Month><SpatialResolution>M_v<Version{_}>.nc

The short cut 1D for SpatialResolution denotes the latitude-longitude grid of 1° × 1°. Table 21 lists all variables that are contained in these files with a short description of them.

Table 21: List of variables used in files with monthly precipitation rates

Variable Name

Dimension(s)

Unit

Description

Coordinates


lat

1

°N (degrees north)

Latitude of grid cell centre

lat_bnds

2

°N (degrees north)

Boundaries of top (northern) and bottom (southern) grid cell edge

lon

1

°E (degrees east)

Longitude of grid cell centre

lon_bnds

2

°E (degrees east)

Boundaries of left (western) and right (eastern) grid cell edge

time

1

Seconds since 1970-01-01

Time stamp of the current month

time_bnds

2

Seconds since 1970-01-01

Boundaries of the time interval covered by time variable

platform_id

1

N/A

An integer used for internal platform assignment

instrument_id

1

N/A

An integer used for internal instrument assignment

Data Variables


precip

3 (time, lat, lon)

mm/d

Monthly mean precipitation that is represented by a single multi-platform composite

precip_stdv

3 (time, lat, lon)

mm/d

Monthly mean of intra-platform standard deviation derived from hourly values

Quality Variables


quality_flag

3 (time, lat, lon)

N/A

Mean of PNPR-CLIM quality flags, whose assigned data was used in composite creation

num_obs

4 (time, lat, lon, instrument_id)

N/A

Total number of observations separated by instrument type

num_covered_hours

3 (time, lat, lon)

N/A

Accumulated number of hours, for which data of at least one platform is available in the respective month

Ancillary Variables


platform_name

2 (time, platform_id)

N/A

Names of all platforms that are used for composite creation for the respective month. The names are allocated to the platform identifier. Platform names are saved as char array. Thus, there is an additional dimension in the netCDF file describing the length of the longest string.

instrument_name

2 (time, instrument_id)

N/A

Assigned names of the specific instrument identifier. Instrument names are saved as char array. Thus, there is an additional dimension in the netCDF file describing the length of the longest string.

Annex

Intermediate PNPR-CLIM output

The PNPR-CLIM output is an instantaneous precipitation rate (level 2) product generated from AMSU B and MHS cross-track scanners on board operational satellites in sun-synchronous orbits. The PNPR-CLIM output is provided in netCDF (V4.0) format and is CF V1.8 convention compliant (http://cfconventions.org/).
The input (FIDUCEO FCDR BTs) and output (PNPR-CLIM outputs) filenames have the following structure:

  1. FIDUCEO Data:

FIDUCEO_FCDR_L1C_<RADIOMETER><SATELLITE><START TIME, YYYYMMDDHHMMSS><END TIME, YYYYMMDDHHMMSS{_}>_EASY_v4.1_fv2.0.1.nc

  1. PNPR-CLIM output:

PNPR-CLIM_<FIDUCEO input filename>.nc

Table 22: Variables within the PNPR-CLIM output file

Name

Long Name

Type

CHAN

Channel

1D

LAT

Latitude

2D

LON

Longitude

2D

POS

Scan position

1D

PP

Probability of Precipitation

Geo2D

PR

Precipitation Rate

Geo2D

QUALITY_PIXEL_BITMASK

FIDUCEO quality pixel bitmask

Geo2D

SCAN

Scan line

1D

UPR

Unmasked Precipitation Rate

Geo2D

UTIME

Scan time

1D

QF

Quality Flag

Geo2D

BQF

Quality Bit Flag

Geo2D


A list of global attributes of PNPR-CLIM netCDF output files is reported, as an example, in the following paragraph:

Dataset type: Hierarchical Data Format, version 5 
File: PNPR-CLIM_FIDUCEO_FCDR_L1C_MHS_METOPA_20170101101322_20170101115439_EASY_v4.1_fv2.0.1.nc {
  dimensions:
    scan = 2286;
    pos = 90;
    chan = 5;
  variables:
    double pr(scan=2286, pos=90);
      :units = "mm / h";
      :long_name = "Precipitation Rate";
      :coordinates = "utime lat lon";
      :_FillValue = NaN; // double

    double pp(scan=2286, pos=90);
      :long_name = "Probability of Precipitation";
      :coordinates = "utime lat lon";
      :_FillValue = NaN; // double

    double upr(scan=2286, pos=90);
      :_FillValue = NaN; // double
      :units = "mm / h";
      :long_name = "Unmasked Precipitation Rate";
      :description = "Precipitation rate regardless the probability of precipitation";
      :coordinates = "utime lat lon";

    

    short bqf(scan=2286, pos=90);
      :units = "bit";
      :values = "0 : weak probability, 1 : extreme scan position, 2 : inherited fiduceo bitmask, 3 :
      snow, 4 : ice, 5 : orography, 6 : extreme cold , 7 : calibrated, 8 : invalid data";
      :long_name = "Quality Bit Flag";
      :coordinates = "utime lat lon";
      :_FillValue = -1S; // short
      :_Unsigned = "true";

    float lat(scan=2286, pos=90);
      :_FillValue = NaNf; // float
      :units = "North degree";
      :long_name = "Latitude";

    float lon(scan=2286, pos=90);
      :_FillValue = NaNf; // float
      :units = "East degree";
      :long_name = "Longitude";

    double utime(scan=2286);
      :long_name = "Scan time";
      :units = "Seconds since 1970";
      :_FillValue = NaN; // double

    long scan(scan=2286);
      :long_name = "Scan line";

    long pos(pos=90);
      :long_name = "Scan position";

    long chan(chan=5);
      :long_name = "Channel";

  // global attributes:
  :_NCProperties = "version=2,netcdf=4.6.2,hdf5=1.10.4";
  :Source_File = "FIDUCEO_FCDR_L1C_MHS_METOPA_20170101101322_20170101115439_EASY_v4.1_fv2.0.1.nc";
  :Retrieval_Algorithm = "PNPR";
  :Author = "ISAC-CNR";
  :Date_of_Creation = "Thu Jul  2 23:59:20 2020";

Intermediate HOAPS output

The HOAPS algorithm output files are Level 2 files, which contain – among others – precipitation, geographical coordinates of each observations centre point grouped in the two dimensions time and scan position, similar to the PNPR-CLIM output (see above). For a complete list of geophysical variables accessible to users in HOAPS, see the respective list for level 3 data products in HOAPS in section 5.1 in [D2]. The HOAPS Level 2 data have been generated in the scope of EUMETSAT’s CMSAF and are not part of the C3S portfolio.

References

Aires, F., Prigent, C., Rossow, W. B., Rothstein, M. (2001). A new neural network approach including first guess for retrieval of atmospheric water vapor, cloud liquid water path, surface temperature, and emissivities over land from satellite microwave observations. J. Geophys. Res. Atmos., 106, 14887–14907.

Amante, C., and B.W. Eakins, (2009). ETOPO1 1 Arc-Minute Global Relief Model: Procedures, Data Sources and Analysis (NOAA Technical Memorandum NESDIS NGDC-24). National Geophysical Data Center, NOAA. https://doi.org/10.7289/V5C8276M [May 2020]

Andersson, A., Fennig, K., Klepp, C., Bakan, S., Graßl, H., and Schulz, J. (2010). The Hamburg Ocean Atmosphere Parameters and Fluxes from Satellite Data – HOAPS-3, Earth Syst. Sci. Data, 2, 215–234. https://doi.org/10.5194/essd-2-215-2010

Bauer, P., Lopez, P., Benedetti, A., Salmond, D., & Moreau, E. (2006a). Implementation of 1D+4D-Var Assimilation of Precipitation-affected Microwave Radiances at ECMWF. Q. J. Roy. Meteor. Soc., 132, 2277–2306.

Bauer, P., Moreau, E., Chevallier, F., & O’Keeffe, U. (2006b). Multiple-scattering microwave radiative transfer for data assimilation applications. Q. J. R. Meteorol. Soc., 132, 1259–1281.

Bennartz, R. (2000). Optimal Convolution of AMSU-B to AMSU-A. J. Atmos. Oceanic Technol., 17, 1215–1225. https://doi.org/10.1175/1520-0426(2000)017<1215:OCOABT>2.0.CO;2

Bishop, C. M. (1995). Neural Networks for pattern recognition. Oxford: Clarendon Press.

Blackwell, W. J., Chen, F. W. (2005). Neural Network Applications in High Resolution Atmospheric Remote Sensing. Linc. Lab. J., 15, 299–322.

Brogniez, H., English, S., Mahfouf, J. F., Behrendt, A., Berg, W., Boukabara, S., Buehler, S. A., Chambon, P., Gambacorta, A., et al. (2016). A review of sources of systematic errors and uncertainties in observations and simulations at 183GHz. Atmos. Meas. Tech., European Geosciences Union, 9(5), 2207–2221. https://doi.org/10.5194/amt-9-2207-2016

Burgdorf, M., Hans, I., Prange, M., Lang, T., & Buehler, S. A. (2018). Inter-channel uniformity of a microwave sounder in space, Atmos. Meas. Tech., 11, 4005–4014. https://doi.org/10.5194/amt-11-4005-2018

Burns, B. A., Wu, X., and Diak, G. R. (1997). Effects of precipitation and cloud ice on brightness temperatures in AMSU moisture channels, IEEE T. Geosci. Remote, 35(6), 1429–1437. https://doi.org/10.1109/36.649797

Casella, D., do Amaral, L. M. C., Dietrich, S., Marra, A. C., Sanò, P., Panegrossi, G. (2017). The Cloud Dynamics and Radiation Database Algorithm for AMSR2: Exploitation of the GPM Observational Dataset for Operational Applications. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens., 10, 3985–4001.

Cohen, J. (1960). A coefficient of agreement for nominal scales. Educ. Psychol. Meas., 20, 37–46.

Fennig, K., Schröder, M., Hollmann, R. (2017). Fundamental Climate Data Record of Microwave Imager Radiances (Edition 3). Satellite Application Facility on Climate Monitoring. https://doi.org/10.5676/EUM_SAF_CM/FCDR_MWI/V003.

Fennig, K., Schröder, M., Andersson, A., & Hollmann, R. (2020). A Fundamental Climate Data Record of SMMR, SSM/I, and SSMIS brightness temperatures, Earth Syst. Sci. Data, 12, 647–681. https://doi.org/10.5194/essd-12-647-2020

Ferraro, R. R., Smith, E. A., Berg, W., Huffman, G. J. (1998). A screening methodology for passive microwave precipitation retrieval algorithms. J. Atmos. Sci., 55, 1583–1600.

Ferraro, R. R. (2004). The Status of the NOAA/NESDIS Operational AMSU Precipitation Algorithm (9). 2nd Workshop of the International Precipitation Working Group, Monterey.

Ferraro, R. R., Weng, F., Grody, N. C., Zhao, L., Meng, H., Kongoli, C., Pellegrino, P., Qiu, S., & Dean, C. (2005). NOAA operational hydrological products derived from the advanced microwave sounding unit, IEEE T. Geosci. Remote, 43(5), 1036–1049.

Funatsu, B. M., Claud, C., & Chaboureau, J.-P. (2007). Potential of Advanced Microwave Sounding Unit to identify precipitating systems and associated upper-level features in the Mediterranean region: Case studies. Journal of Geophysical Research: Atmospheres, 112(D17113). https://doi.org/10.1029/2006JD008297

Funatsu, B. M., Claud, C., & Chaboureau, J.-P. (2009). Comparison between the large-scale environments of moderate and intense precipitating systems in the Mediterranean region, Mon. Weather Rev., 137(11), 3933–3959. http://doi.org/10.1175/2009MWR2922.1

Grecu, M., & Anagnostou, E. N. (2001). Overland Precipitation Estimation from TRMM Passive Microwave Observations, J. Appl. Meteor., 40, 1367–1380. https://doi.org/10.1175/1520-0450(2001)040<1367:OPEFTP>2.0.CO;2

Grecu, M., & Olson, W. S. (2006). Bayesian Estimation of Precipitation from Satellite Passive Microwave Observations Using Combined Radar–Radiometer Retrievals. J. Appl. Meteorol. Climatol., 45, 416–433.

Grecu, M., Olson, W. S., Munchak, S. J., Ringerud, S., Liao, L., Haddad, Z. S., Kelley, B. L., & McLaughlin, S. F. (2016). The GPM combined algorithm. Journal of Atmospheric and Oceanic Technology, 33, 2225–2245. https://doi.org/10.1175/JTECH-D-16-0019.1

Hans, I., Burgdorf, M., John, V. O., Mittaz, J., & Buehler, S. A. (2017). Noise performance of microwave humidity sounders over their lifetime. Atmos. Meas. Tech., 10, 4927–4945. https://doi.org/10.5194/amt-10-4927-2017

Hans, I. (2018). Towards a new fundamental climate data record for microwave humidity sounders based on metrological best practice [Dissertation]. Hamburg. https://ediss.sub.uni-hamburg.de/volltexte/2018/9454/pdf/Dissertation.pdf

Hans, I., Burgdorf, M., Woolliams, E. (2019). FIDUCEO - Product user guide – Microwave FCDR release 4.1. http://cedadocs.ceda.ac.uk/1415/

Hollinger, J., Lo, R., Poe, G., Savage, R., & Peirce, J. (1987). Special Sensor Microwave/Imager User's Guide. Naval Reserach Laboratory, Washington DC, 177pp.

Hollinger, J. P., Peirce, J. L., & Poe, G. A. (1990). SSM/I Instrument Evaluation. IEEE Trans. Geosci. Remote Sens., 28, 781–790.

Hong, Y., Hsu, K.-L., Sorooshian, S., Gao, X. (2004). Precipitation estimation from remotely sensed imagery using an artificial neural network cloud classification system. J. Appl. Meteorol., 43, 1834–1853.

Hong, G., Heygster, G., Miao, J., & Kunzl, K. (2005). Detection of tropical deep convective clouds from AMSU-B vater vapour channels measurements. J. Geophys. Res., 110(D05205). https://doi.org/10.1029/2004JD004949

Hong, G., Heygster, G.,  Notholt, J., and Buehler, S.A.(2008). Interannual to Diurnal Variations in Tropical and Subtropical Deep Convective Clouds and Convective Overshooting from Seven Years of AMSU-B Measurements,  Journal of Climate,  vol.  21, n. 17,  4168–4189. https://doi.org/10.1175/2008JCLI1911.1.

Houze, R. A., Jr., McMurdie, L. A., Petersen, W. A., Schwaller, M. R., Baccus, W., Lundquist, J. D., Mass, C. F., Nijssen, B., Rutledge, S. A., Hudak, D. R., Tanelli, S., Mace, G. G., Poellot, M. R., Lettenmaier, D. P., Zagrodnik, J. P., Rowe, A. K., DeHart, J. C., Madaus, L. E., Barnes, H. C., & Chandrasekar, V. (2017). The Olympic Mountains Experiment (OLYMPEX), Bulletin of the American Meteorological Society, 98(10), 2167–2188. https://doi.org/10.1175/BAMS-D-16-0182.1

Kacimi, S., Viltard, N., Kirstetter, P.-E. (2013). A new methodology for rain identification from passive microwave data in the Tropics using neural networks. Q. J. R. Meteorol. Soc, 139, 912–922.

Kim, J., Ou, M., Park, J., Morris, K. R., Schwaller, M. R., & Wolff, D. B. (2014). Global Precipitation Measurement (GPM) Ground Validation (GV) Prototype in the Korean Peninsula, Journal of Atmospheric and Oceanic Technology, 31(9), 1902–1921. https://doi.org/10.1175/JTECH-D-13-00193.1

Kirstetter, P.-E., Viltard, N., Gosset, M. (2013). An error model for instantaneous satellite rainfall estimates: Evaluation of BRAIN-TMI over West Africa. Q. J. R. Meteorol. Soc., 139, 894–911.

Kummerow, C., Barnes, W., Kozu, T., Shiue, J., Simpson, J. (1998). The Tropical Rainfall Measuring Mission (TRMM) Sensor Package. Journal of Atmospheric and Oceanic Technology, 15(3), 809–817. https://doi.org/10.1175/1520-0426(1998)015<0809:TTRMMT>2.0.CO;2

Kummerow, C. D., Ringerud, S., Crook, J., Randel, D., Berg, W. (2011). An Observationally Generated A Priori Database for Microwave Rainfall Retrievals. J. Atmos. Ocean. Technol., 28, 113–130.

Kunkee, D. B., Poe, G. A., Boucher, D. J., Swadley, S. D., Hong, Y., Wessel, J. E., & Uliana, E. A. (2008). Design and Evaluation of the First Special Sensor Microwave Imager/Sounder, IEEE Trans. Geosci. Remote Sens., 46, 863–883.

Kuo, K., W.S. Olson, B.T. Johnson, M. Grecu, L. Tian, T.L. Clune, B.H. van Aartsen, A.J. Heymsfield, L. Liao, and R. Meneghini (2016). The Microwave Radiative Properties of Falling Snow Derived from Nonspherical Ice Particle Models. Part I: An Extensive Database of Simulated Pristine Crystals and Aggregate Particles, and Their Scattering Properties. J. Appl. Meteor. Climatol., 55, 691–708. https://doi.org/10.1175/JAMC-D-15-0130.1

Laviola, S., Levizzani, V. (2011). The 183-WLS fast rain rate retrieval algorithm. Part I: Retrieval design. Atmos. Res., 99, 443–461. https://doi.org/10.1016/j.atmosres.2010.11.013

Le, M., Chandrasekar, V. (2019). Ground Validation of Surface Snowfall Algorithm in GPM Dual-Frequency Precipitation Radar, Journal of Atmospheric and Oceanic Technology, 36(4), 607-619. https://doi.org/10.1175/JTECH-D-18-0098.1

Liou, Y.-A., Tzeng, Y. C., Chen, K. S. (1999). A neural-network approach to radiometric sensing of land-surface parameters. IEEE Trans. Geosci. Remote Sens., 37, 2718–2724.

Liu, G., and Seo, E. K. (2013). Detecting snowfall over land by satellite high-frequency microwave observations: The lack of scattering signature and a statistical approach, J. Geophys. Res. Atmos., 118, 1376–1387. https://doi.org/10.1002/jgrd.50172

Mahesh, C., Prakash, S., Sathiyamoorthy, V., Gairola, R. M. (2011). Artificial neural network based microwave precipitation estimation using scattering index and polarization corrected temperature. Atmos. Res., 102, 358–364.

McCann, D. W. (1992). A neural network short-term forecast of significant thunderstorms. Weather Forecast., 7, 525–534.

Merchant, C. J., Paul, F., Popp, T., Ablain, M., Bontemps, S., Defourny, P., Hollmann, R., Lavergne, T., Laeng, A., de Leeuw, G., Mittaz, J., Poulsen, C., Povey, A. C., Reuter, M., Sathyendranath, S., Sandven, S., Sofieva, V. F., & Wagner, W. (2017). Uncertainty information in climate data records from Earth observation. Earth Syst. Sci. Data, 9, 511–527. https://doi.org/10.5194/essd-9-511-2017

Merchant, C., Holl, G., Mittaz, J., Phipps, R., Taylor, M., Quast, R., Govaerts, Y., John, V., Ruethrich, F., Roebeling, R., Burgdorf, M., Hans, I., Woolliams, E., Block, T. (2019). D4_7 Scientific Basis of FIDUCEO FCDRs (Ref. Ares(2019)5471518 - 29/08/2019). http://cedadocs.ceda.ac.uk/1411/1/Attachment_0-13.pdf

Mugnai, A., Casella, D., Cattani, E., Dietrich, S., Laviola, S., Levizzani, V., Panegrossi, G., Petracca, M., Sanò, P., Di Paola, F., et al (2013). Precipitation products from the hydrology SAF. Nat. Hazards Earth Syst. Sci., 13, 1959–1981.

Northrop Grumman Corporation. (2002). Algorithm and Data User Manual (ADUM) for the Special Sensor Microwave Imager/Sounder (SSMIS) (Technical Report, Report 12621). Northrop Grumman Corporation Space Systems Division, Azusa, California.

Panegrossi, G., Dietrich, S., Marzano, F. S., Mugnai, A., Smith, E. A., Xiang, X., Tripoli, G. J., Wang, P. K., Poiares Baptista, J. P. V. (1998). Use of Cloud Model Microphysics for Passive Microwave-Based Precipitation Retrieval: Significance of Consistency between Model and Measurement Manifolds. J. Atmos. Sci., 55, 1644–1673.

Panegrossi, G., Sanò, P., Bagaglini, L., Casella, D., Cattani, E., Konrad, H., Niedorf, A., Schröder, M., Mikalsen, A. C., & Hollmann, R. (2020). Development of a microwave-based precipitation climate data record for the Copernicus Climate Change Service (EGU2020-18636). EGU General Assembly 2020, Online, 4–8 May 2020. https://doi.org/10.5194/egusphere-egu2020-18636

Qiu, S., Pellegrino, P., Ferraro, R., & Zhao, L. (2005). The improved AMSU rain-rate algorithm and its evaluation for a cool season event in the Western United States. Weather Forecast., 20(5), 761–774. http://doi.org/10.1175/WAF880.1

Reynolds, R. W., Smith, T.M., Liu, C., Chelton, D. B., Casey, K. S., & Schlax, M. G. (2007). Daily High-Resolution-Blended Analyses for Sea Surface Temperature. J. Climate, 20, 5473–5496. http://doi.org/10.1175/2007JCLI1824.1

Sanò, P., Casella, D., Mugnai, A., Schiavon, G., Smith, E. A., and Tripoli, G. J (2013). Transitioning from CRD to CDRD in bayesian retrieval of rainfall from satellite passive microwave measurements: Part 1. Algorithm description and testing, IEEE T. Geosci. Remote, 51(7), 4119–4143. https://doi.org/10.1109/TGRS.2012.2227332

Sanò, P., Panegrossi, G., Casella, D., Di Paola, F., Milani, L., Mugnai, A., Petracca, M., Dietrich, S. (2015). The Passive microwave Neural network Precipitation Retrieval (PNPR) algorithm for AMSU/MHS observations: Description and application to European case studies. Atmos. Meas. Tech., 8, 837–857.

Sanò, P., Panegrossi, G., Casella, D., Marra, A.C., Di Paola, F., Dietrich, S. (2016). The new Passive microwave Neural network Precipitation Retrieval (PNPR) algorithm for the cross-track scanning ATMS radiometer: Description and verification study over Europe and Africa using GPM and TRMM spaceborne radars. Atmos. Meas. Tech., 9, 5441–5460.

Schwaller, M., and Morris, K. (2011). A Ground Validation Network for the Global Precipitation Measurement Mission. Journal of Atmospheric and Oceanic Technology, 28, 301-319. https://doi.org/10.1175/2010JTECHA1403.1

Seto, S., Kubota, T., Takahashi, N., Iguchi, T., Oki, T. (2008). Advanced rain/no-rain classification methods for microwave radiometer observations over land. J. Appl. Meteorol. Climatol., 47, 3016–3029.

Skofronick-Jackson, G. and Johnson, B. T. (2011). Surface and atmospheric contributions to passive microwave brightness temperatures for falling snow events, J. Geophys. Res., 116, D02213-1–D02213-16

Speirs, P., Gabella, M., & Berne, A. (2017). A Comparison between the GPM Dual-Frequency Precipitation Radar and Ground-Based Radar Precipitation Rate Estimates in the Swiss Alps and Plateau, Journal of Hydrometeorology, 18(5), 1247–1269. https://doi.org/10.1175/JHM-D-16-0085.1

Staelin, D. H., & Chen, F. W. (2000). Precipitation observations near 54 and 183 GHz using the NOAA-15 satellite. IEEE T. Geosci. Remote, 38(5), 2322-2332. https://doi.org/10.1109/36.868889

Sudradjat, A., Wang, N.-Y., Gopalan, K., Ferraro, R. R. (2011). Prototyping a generic, unified land surface classification and screening methodology for GPM-era microwave land precipitation retrieval algorithms. J. Appl. Meteorol. Climatol., 50, 1200–1211.

Surussavadee, C., & Staelin, D.H. (2008). Global millimeter-wave precipitation retrievals trained with a cloud-resolving numerical weather prediction model, Part I: Retrieval design. IEEE Trans. Geosci. Remote Sens., 46, 99–108.

Tang, G., Wen, Y., Gao, J., Long, D., Ma, Y., Wan, W., Hong Y. (2017). Similarities and differences between three coexisting spaceborne radars in global rainfall and snowfall estimation. Water Resour. Res., 53, 3835–3853. https://doi.org/10.1002/2016WR019961

Tang, L., Zhang, J., Simpson, M., Arthur, A., Grams, H., Wang, Y., & Langston, C. (2020). Updates on the Radar Data Quality Control in the MRMS Quantitative Precipitation Estimation System, Journal of Atmospheric and Oceanic Technology, 37(9), 1521–1537. https://doi.org/10.1175/JTECH-D-19-0165.1

Tao, Y., Gao, X., Hsu, K., Sorooshian, S., & Ihler, A. (2016). A Deep Neural Network Modeling Framework to Reduce Bias in Satellite Precipitation Products, Journal of Hydrometeorology, 17(3), 931–945. https://doi.org/10.1175/JHM-D-15-0075.1

Tapiador, F.J., Navarro, A., Levizzani, V., García-Ortega, E., Huffman, G.J., Kidd, C., Kucera, P.A., Kummerow, C. D., Masunaga, H., Petersen, W. A., et al. (2017). Global precipitation measurements for validating climate models. Atmos. Res., 197, 1–20.

Wang, J. R., Wilheit, T. T., & Chang, L. A. (1989). Retrieval of total precipitable water using radiometric measurements near 92 and 183 GHz, J. Appl. Meteorol., 28(2), 146–154. https://doi.org/10.1175/1520-0450(1989)028%3C0146:ROTPWU%3E2.0.CO;2

Wang, J. R., Zhan, J., & Racette, P. (1997). Storm-associated microwave radiometric signatures in the frequency range of 90–220 GHz, J. Atmos. Ocean. Tech., 14(1), 13–31. https://doi.org/10.1175/1520-0426(1997)014<0013:SAMRSI>2.0.CO;2

Wentz, F. J. (1991). User’s Manual for SSM/I Antenna Temperature Tapes Revision 1 (Technical Report 120191). Remote Sensing Systems, Santa Rosa, California.

Zhang, J. et al. (2011). National Mosaic and Multi-Sensor QPE (NMQ) system: Description, results, and future plans. Bull. Amer. Meteor. Soc., 92, 1321–1338. https://doi.org/10.1175/2011BAMS-D-11-00047.1

Zhang, J., Howard, K., Langston C., Kaney, B., Qi, Y., Tang, L., Grams, H., Wang, Y., Cocks, S., Martinaitis, S., Arthur, A., Cooper, K., Brogden, J., Kitzmiller, D. (2016). Multi-Radar Multi- Sensor (MRMS) Quantitative Precipitation Estimation: Initial Operating Capabilities. Bull. Amer. Met. Soc., 97, 621–637. https://doi.org/10.1175/BAMS-D-14-00174.1


This document has been produced in the context of the Copernicus Climate Change Service (C3S).

The activities leading to these results have been contracted by the European Centre for Medium-Range Weather Forecasts, operator of C3S on behalf of the European Union (Delegation agreement signed on 11/11/2014). All information in this document is provided "as is" and no guarantee or warranty is given that the information is fit for any particular purpose.

The users thereof use the information at their sole risk and liability. For the avoidance of all doubt , the European Commission and the European Centre for Medium - Range Weather Forecasts have no liability in respect of this document, which is merely representing the author's view.

Related articles