Contributors: B. Calmettes (CLS), JF. Crétaux (CLS), L. Carrea (University of Reading), C.J. Merchant (University of Reading)

Issued by: B. Calmettes

Date: 21/04/2021

Ref:C3S_312b_Lot4.D2.LK.3-v3.0_LWL_Product_Quality_Assurance_Document_i1.0.docx

Official reference number service contract: 2018/C3S_312b_Lot4_EODC/SC2

Table of Contents

History of modifications

Issue

Date

Description of modification

Author

i0.1

31/01/2021

The present document was modified based on the document with deliverable ID: C3S_312b_Lot4.D2.LK.3-v2.0_202001_Product_Quality_Assurance_Document_LWL_v1.0
Revision of document to CDR v3.0, revision of available products (section 1.2) and parameters and units (section 1.3). Removal of Hydrosat dataset (section 2.1) inclusion of new in-situ dataset with data on Switzerland's lakes (section 2.2 ), revision of section 3.6 and formatting of tables.

BC

V1.0

22/02/2021

Reviewed Document. Revision of change log, Updated References, refresh of all cross references, acceptance of all formatting changes, updated ToC. To Assimila for review.

RK

List of datasets covered by this document

Deliverable ID

Product title

Product type (CDR, ICDR)

Version number

Delivery date

D3.LK.4-v2.0

Lake Water Level)

CDR

V3.0

31/01/2021

Related documents

Reference ID

Document

D1

System Quality Assurance Document v3.0 (D1.LK.3-v3.0)

D2

Algorithm Theoretical Basis Document v3.0 (D1.LK.4-v3.0)

D3

Product Quality Assessment Report v3.0 (D2.LK.4-v3.0)

D4

Product User Guide and Specification v3.0 (D3.LK.6-v3.0)

Acronyms

Acronym

Definition

ATBD

Algorithm Theoretical Basis Document

C3S

Copernicus Climate Change Service

CCI

Climate Change Initiative

CDR

Climate Data Records

CF

Climate Forecasting

CLS

Collecte Localisation Satellites

CNES

Centre National d'Etude Spatiale

DEM

Digital Elevation Model

ECMWF

European Center for Medium-range Weather Forecasts

ECV

Essential Climate Variable

EODC

Earth Observation Data Center

GCOS

Global Climate Observing System

GFO

Geosat-Follow-On

GGM02C

GRACE Gravity Model 02 (combination model)

HAL

Cluster High Performance of CNES

HR

High Rate

ICDR

Intermediate Climate Data Record

LEGOS

Laboratoire d'Etudes en Géophysique et Océanographie Spatiales

LK

Lake

LRM

Low Resolution Mode

LWL

Lake Water Level

PQAD

Product Quality Assessment Document

PQAR

Product Quality Assessment Report

PUGS

Product User Guide and Specification

SAR

Synthetic Aperture Radar

SARAL

Satellite with Argos and Altika

SRAL

SAR Radar Altimeter

TCDR

Thematic CDR

TOPEX

TOPography EXperiment

USDA

United States Department of Agriculture

WSH

Water Surface Height

General definitions

Accuracy: The closeness between the measured value and the true quantity value.
Precision: Closeness between measured values obtained by replicate measurements on the same object under similar conditions
Bias: Estimate of a systematic error

Scope of the document

This document is the Product Quality Assurance Document (PQAD). It describes the dataset and validation methods as well as the strategies used for validation and characterisation of the C3S Lake Water Level product. It is a self-contained document which gathers all validation methods and analyses conducted to assess the quality of the C3S Lake Water Level product.

Executive summary

The C3S Lake production system (C3S ECV LK) provides an operational service, generating lake surface water temperature and lake water level climate datasets for a wide variety of users within the climate change community. The present document covers the lake water level system.

The Product Quality Assurance includes the definition and description of the datasets, validation methods and strategies used for the validation and characterisation of the accuracy and stability of the Lake Water Level product. This document is applicable to the Climate Data Record version 3.0 of the produced in January 2021 (product version v3.0). This Quality assurance work consists of two parts: ( i) validation of the generated data by analysing the error generated by the instruments and processing and (ii) comparison of generated products with external data from other altimetry products and in-situ data.

This document describes the methodology at the third version of the C3S LWL product.

1. Validated products

1.1. Product Specifications

Presently, this section relies on statements for the Lake ECV from GCOS, published literature, experience from other CDR projects, and requirements emerging from the definition of the service. The user requirements are indicated in Table 1.

Table 1: User Requirements for Water Level as described in GCOS

Content of the dataset

Content of the main file

The data file shall contain the following information on separate layers:

  • Water Level value
  • A measure of the uncertainty

Spatial and temporal features

Spatial coverage

The product shall be distributed globally based on a harmonized identification of the products. The area of the lakes must be at least 1kmx1km.

Temporal coverage

Times series of 10 years minimum are required.

Temporal resolution

  • A monthly composite of the product shall be distributed.
  • A 10-day composite of the product shall be distributed.

Data uncertainties

Threshold

15 cm

Target

3cm for large lakes, 10 cm for the remainder

Format requirements

Format

NetCDF, CF Convention

A detailed description of the product generation is provided in the Algorithm Theoretical Basis Document (ATBD) [D2], with further information on the product given in the Product User Guide and Specification (PUGS) [D4].

1.2. Available products

The C3S lakes products comprise a long-term climate data record (CDR). The time series has been computed since late 1992 to 2020. The altimetry data is provided by the missions:

  • TOPEX/Poseidon from 1992 to 2002
  • Jason-1 from 2001 to 2013
  • Jason-2 from 2008 to 2015
  • Jason-3 from 2016 to Present
  • Envisat/RA-2 from 2002 to 2012
  • SARAL/Altika: from 2013 to Present
  • Geosat Follow On (GFO): from 1998 to 2008
  • Sentinel-3A/SRAL from 2016 to Present
  • Sentinel-3B from 2018 to Present

This current document is applicable to the Quality Assessment activities performed on the dataset generated in January 2021 (CDR v3.0).

1.3. Parameters and units

The products in this C3S version are shared with Theia-land program supported by CNES. The research leading to the current version of the product has received funding from CNES, LEGOS and CLS as well as the Copernicus Climate Change program permitting an increase in the spatial coverage.
The LWL product, in netCDF4 format, contains:

  • The measure of the absolute height expressed in meters of the reflecting water surface beneath the satellite with respect to a vertical datum (GGM02C).
  • The associated uncertainty expressed in meters.

Additional information is also included in the output file, concerning both the lake (name, country, basin, latitude, longitude) and the processing (processing mode, processing level).

2. Description of validating datasets

External altimetry-based data and in situ data could provide an accurate external estimate of water level, though it is not necessarily collocated in time and space, and also contains specific errors.
With this method, the lake water level products can be compared with external data in terms of root mean square error (rms) and correlation. Such results will be presented in the PQAR associated with v3.0 products.

2.1. Altimetry based datasets

2.1.1. G-Realm

The G-Realm dataset (https://ipad.fas.usda.gov/cropexplorer/global_reservoir) is produced by The United States Department of Agriculture (USDA) in cooperation with the National Aeronautics and Space Administration and the University of Maryland. This database utilises radar altimetry from different satellites (Topex/Jason, Envisat) to produce lake water level over global inland water bodies.
The result of the process is a time series of height variation. The accuracy is expected to be better than 10cm for the largest lakes and 20 cm for smaller lakes. Currently, only lakes larger than 100 km2 are available.

2.1.2. Dahiti

Dahiti is a global database. It currently contains water level series from 886 water bodies (https://dahiti.dgfi.tum.de/en). The processing for generating the Dahiti products, based on an extended outlier detection and Kalman filtering, is described in Schwatke et al. 2015.

2.2. In-situ datasets

Hydrolare (hydrolare.net), the International Data Centre on Hydrology of Lakes And Reservoirs provides data on mean monthly water level of nearly 1200 water bodies. The centre improves data collection, processing, analysing and monitoring lakes and reservoirs. It functions on the basis of free of charge dissemination of information for governmental, scientific, educational, projecting public and commercial institutions.
Other sources of in-situ information are being explored. Among them we can mention:

3. Description of product validation methodology

3.1. Overall procedure

The validation exercise consists of validating the local changes of water level as measured by altimetry. The uncertainties or errors in the products are two-fold: measurement errors and processing errors.

The following description of errors is inspired from [Bercher, 2008].

3.2. Instrument characteristics

Water Surface Height estimates are obtained by measurements of the satellite radar echo return time. The instrument monitors the reflected radar signal over a recording window (e.g. approximately 4.10-7 seconds for Jason-3). This window can be determined by the on-board tracking system based on the analysis of the last range measurements to approximate the position of the useful information in the incoming waveform – this is known as close loop mode. It can also be imposed by the use of a Digital Elevation Model (DEM) stored onboard the satellite – this is known as open loop mode.

In close loop mode, in case of rapid and unexpected changes of the topography, the tracking system may not position the recording window accurately and fail to measure the range (“unhook”). In this case, the system misses between 1-3 seconds of measurements [Chelton et al., 2001]. This ultimately leads to the loss of 10 to 120 high frequency measurements, depending on the sampling rate of the altimeter, or 5 to 20 km. The open loop mode allows to avoid such problems provided an accurate enough DEM is uploaded.

Both Sentinel-3A and Jason-3 use these 2 modes. Switching between the modes is defined by patches positions uploaded onboard. The initial Jason-3 DEM included about 250 lakes from the Hydroweb product. It has been updated on 08/30/2017 to about 350 lakes after Biancamaria et al (2017) performed validation showing high quality measurements were obtained over DEM stations.

The present product contains both lakes for which open and close loop acquisition modes are used. An evaluation of the products has not been performed as a function of the acquisition mode, nevertheless the editing algorithm [ATBD, D2] ensures that the accepted Water Level values are compatible with the a priori elevation provided in the lake parameter files. Therefore, such a selection acts as a posteriori use of DEM. This ensures that altimeter hooking is detected on possible different targets. Such measurements are rejected by the water level processing chain.

Other parameters can also affect the water level retrieved by the altimeter. In the context of hydrology, the presence of land surfaces in the altimeter footprint may result in a significant error in the range estimate from which water level is computed. This effect is particularly significant when land surfaces contain echogenic targets (buildings, emerged lands, humid vegetation, dams…etc.). The surface of the footprint, thus, plays an important role.

All altimeters are not similarly affected by surface specificities or canopy penetration of the signal. As an example, it depends on the altimeter frequency band (e.g. C, Ka, Ku) and the technology of the altimeter. Two technologies are currently used. As a result, the products are obtained either for standard altimetry measurements (low resolution mode, LRM, for Jason-3) or Synthetic Aperture Radar measurements (SAR, for Sentinel-3A). Lakes time series can contain measurements derived from both technologies, provided both Sentinel-3A/Sentinel-3B  and Jason-3 sub-satellite track pass over the considered lake.

3.3. Processing characteristics

Though altimeters are perfectly fitted to the observation of sea level, their performance decreases over inland waters, and particularly small lakes. The smaller the ratio between water and emerged land surfaces in the footprint, the more complex the waveforms. This requires dedicated retracking algorithms which are not yet able to reach the quality of the retracking algorithm over ocean.

Instrumental, propagation and geophysical corrections are also used in the computation of water level and may contain errors. The most notable is the geoid correction, locally leading to uncertainties up to 35cm [Jekeli et Dumrongchai, 2003].

Consequently, several criteria are applied to remove segments of satellite passes (transects) which are not compliant with the quality requirements. The validation and quality assessment metrics presented in this document only refer to these pre-selected transects.

3.4. Validation approach

The main problem of water level validation is that one tries to quantify errors smaller than natural variability.

Thus, one needs to process separately the errors of different nature (see sections 3.2 and.3.3). Are they constant in time (for a given lake, but variable on another target), or in space (for a given pass/day, all targets "see" the same error), or both (bias), or neither (random)? Above all, does using a given methodology (difference at crossovers, in situ) to separate errors and/or natural variability achieve better results than performing an absolute validation (statistics)?

The validation methods presented below are, thus, based on absolute and relative approaches.

The first absolute validation method quantifies the dispersion of consecutive water level high frequency measurements for the same target by computing their standard deviation. NB: consecutive measurements are then averaged to compute the lake products. The more precise the product is, the smaller the dispersion is expected to be, provided that one can accurately correct for the natural spatial slope over the transect.

Relative validation methods are based on a comparison of lake products with external water level data. These are either equivalent altimetry data or in situ data.

3.5. Absolute Assessment

The validation exercise consists of validating the local changes of water level as measured by altimetry. The uncertainties or errors in the products are two-fold: measurement errors and processing errors. The following description of errors is inspired from [Bercher, 2008].

Four performance indicators are chosen to assess the quality of lake products:

  • Dispersion: mean valid transect dispersion
  • High-frequency variations: standard deviation of residuals from a high-pass Lanczos filter with an arbitrary 1-month cut-off period.
  • Mean time step: average time between two valid measures
  • The missing values: this is the percent of lake water level values that can't be estimated due to different reasons: quality of the signal, shift of the ground trajectory, fast change in the level that activates the editing of the estimate.

These performance indicators will be calculated for each lake (140 lakes in the current version) on two time periods: the full time series of ~25 year for most lakes and the last 10 years. These last indicators give the performance of recent and future quality products.

Lakes products can contain altimeter data from multiple satellites tracks as well as different missions. Transects (intersections between satellite tracks and lakes) are on average longer in big lakes. So, the precision of the level estimation is also a function of the size of the lake.

Since LWL products are derived from multiple missions, other interesting indicators involve the comparison of the performance between missions. Missing values and the uncertainty per mission will be calculated.

3.6. Relative Assessment: comparison to independent altimetry products

External altimetry products using different data processing are useful to assess the quality of the lake water level products. This data comes from two different products (G-REALM and Dahiti). These products use different datums at different dates, and the comparison is not straightforward. Several approaches are being considered:

  • Comparison of monthly anomalies
  • Comparison between measurements available at the same dates

3.7. Relative assessment: comparison to in situ data

In situ data provide an accurate external estimate of water level, though it is not necessarily collocated in time and space, and also contains specific errors.

With this method, the lake water level products can be compared with in situ data in terms of root mean square difference and correlation [Crétaux et al, 2016].

4. Summary of validation results

This section provides some keys results. The detailed results will be included in the Quality Assessment report [D3].

For the lake products, the time series quality is assessed by studying the dispersion characteristics as well as providing an overview of the series completeness (Table 2 contains the values evaluated in 2017). The second column presents the average of the dispersion of the LWL measurements over lake transects whereas the third column is the standard deviation. The last dispersion value indicated in column 4 is the value of the standard deviation over the last transect in the operational product. Its comparison with column 2 and 3 values allows checking that the accuracy is coherent with these of the historical time series. The fifth column references the longest period without data over the period covered by the time series (last column) and the sixth one the mean interval between two points.

Table 2: Absolute validation of lake level products.

Lake Name

mean dispersion [cm]

std. dev. of dispersion [cm]

last dispersion [cm]

maximum number of days without data

mean number of days without data

time series length [years]

Alakol

14

8

11

72

24

7.8

Albert

19

17

36

77

29

25.6

Amadjuak

12

7

10

73

12

28.3

Aqqikol-Hu

9

3

5

56

27

4.8

Argentino

7

11

0

156

13

28.2

Athabasca

8

7

15

66

7

28.3

Ayakkum

16

31

9

766

40

25.5

Aydarkul

31

24

4

86

26

25.6

Aylmer

13

15

2

97

12

28.2

Bagre

31

29

2

99

14

12.4

Baikal

5

4

5

140

4

28.3

Baker

10

8

6

123

15

28.3

Balbina

12

22

7

140

15

28.2

Balkhash

6

5

6

159

9

28.3

Bangweulu

24

29

3

140

34

25.5

Bankim

41

16

5

51

9

12.5

Beysehir

11

12

10

164

17

28.3

Birch

31

36

9

91

28

4.8

Bodensee

32

27

3

54

29

4.8

Bogoria

18

20

7

56

28

4.9

Bosten

21

17

3

73

16

18.3

Bratskoye

10

13

4

100

10

28.3

Cahora_Bassa

26

30

29

73

17

18.3

Caribou

15

9

5

66

15

28.3

Caspian

4

3

0

64

6

28.3

Cedar

15

13

6

66

16

28.3

Cerros-Colorados

14

12

2

56

27

4.8

Chapala

9

11

3

89

26

9.8

Chardarya

16

22

1

99

13

28.3

Chishi

26

17

10

51

24

4.8

Chocon

14

11

3

37

26

7.8

Chukochye

18

18

4

56

23

4.8

Claire

16

7

10

56

25

4.9

Dagze-Co

16

26

3

358

43

27.9

Des_Bois

7

5

3

126

12

28.3

Dogaicoring-Q

6

8

1

912

42

18.2

Dorgon

19

10

2

52

10

12.5

Dorsoidong-Co

19

10

8

140

33

7.8

Dubawnt

15

8

12

40

8

12.5

Edouard

15

11

9

256

33

25.6

Erie

3

3

4

95

8

28.3

Faber

30

22

8

52

23

4.9

Fort_Peck

14

20

0

226

25

28.3

Grande_Trois

14

24

3

70

11

28.3

Greatslave

8

6

4

124

5

28.3

Guri

26

23

42

178

22

28.3

Gyaring-Co

11

8

8

54

27

4.8

Har

12

13

0

137

18

28.3

Hoh-Xil-Hu

8

4

9

80

28

4.9

Hongze

38

27

16

95

20

28.3

Hottah

31

18

14

69

11

12.5

Hovsgol

10

12

5

219

20

28.2

Hulun

14

14

5

73

13

28.3

Huron

3

2

2

69

8

28.3

Iliamna

46

48

12

78

13

12.5

Illmen

33

39

4

102

27

25.6

Issykkul

4

5

2

95

12

28.3

Kabele

20

16

7

56

28

4.9

Kabwe

22

13

11

56

28

4.8

Kainji

31

27

18

113

23

28.1

Kairakum

26

17

4

81

17

12.5

Kapchagayskoye

15

17

4

187

16

28.2

Kara_Bogaz_Gol

3

3

2

38

12

28.3

Karasor

11

7

5

108

25

4.8

Kariba

7

14

4

99

14

28.3

Kasba

13

13

4

98

10

18.2

Khanka

8

8

2

164

17

21.0

Kivu

19

13

7

314

39

25.6

Kokonor

15

15

10

247

32

25.6

Kossou

83

46

13

137

29

4.8

Krasnoyarskoye

22

27

0

109

11

18.3

Kremenchutska

11

17

3

116

12

28.3

Kumskoye

19

18

6

54

12

12.5

Kuybyshevskoye

14

18

5

122

15

28.3

Kyoga

11

12

6

106

15

28.3

Ladoga

6

3

5

89

7

28.3

Lagoa_Do_Patos

7

9

0

307

15

28.3

Langa-Co

6

6

6

70

11

11.7

Langano

14

15

4

29

27

4.8

Leman

10

8

4

54

19

4.6

Lixiodain-Co

16

23

9

804

55

23.7

Mai-Ndombe

22

22

9

54

19

4.8

Malawi

8

6

6

89

10

28.3

Manitoba

8

6

3

121

11

21.0

Michigan

4

3

4

65

8

28.3

Migriggyangzham

13

20

1

358

22

27.9

Mossoul

23

36

16

179

24

28.3

Mweru

4

4

5

59

15

28.2

Namco

8

10

3

151

31

25.6

Nasser

14

13

6

65

11

28.3

Nezahualcoyoti

52

51

3

248

49

25.6

Ngangze

10

12

1

250

17

28.3

Ngoring-Co

14

17

6

179

21

28.3

Nicaragua

4

3

1

62

14

28.3

Novosibirskoye

68

112

2

270

20

28.2

Nueltin

21

21

5

99

15

28.2

Oahe

52

57

16

384

34

25.6

Onega

6

4

4

120

9

28.3

Ontario

3

3

2

63

9

28.3

Opinac

16

14

4

78

17

28.3

Peipus

6

6

2

55

13

28.2

Ranco

8

2

4

56

27

4.8

Rukwa

4

7

0

189

16

28.2

Rybinskoye

8

9

5

91

11

28.3

Saint_Jean

31

40

4

136

19

28.3

Sakakawea

16

27

24

198

15

28.2

Saksak

19

18

5

190

20

28.1

San_Martin

43

32

3

33

17

4.8

Saratovskoye

13

11

4

97

17

28.3

Sarykamish

4

3

8

103

18

28.3

Sasykkol

9

11

6

79

13

12.4

Segozerskoye

60

46

5

90

11

12.5

Soungari

22

31

1

186

18

28.2

Superior

3

3

2

66

5

28.3

Tana

6

5

4

51

16

28.3

Tanganika

11

9

44

62

12

28.2

Tangra-Yumco

17

16

27

215

28

25.5

Tchad

20

17

3

139

16

28.3

Tchany

31

20

30

106

19

28.2

Tharthar

7

13

3

64

15

28.3

Todos_Los_Santos

21

24

17

1408

20

28.3

Toktogul

32

39

4

1007

54

25.6

 

Concerning in-situ comparisons, one example is presented in Figure 1 (performed by LEGOS). Altimetry results, given by Sentinel-3A and Jason missions and different retracking algorithms, are compared to in situ data from 2016 to mid-2017 on the Issykkul lake. Whatever retracking algorithm is used for Sentinel-3A, the root mean square (rms) values are lower than the Jason ones. On this lake, the Sentinel-3A results show that the use of ICE-1 retracking (see [D2]) instead of the ocean one (model fit according to a Brownian model) enables to be more consistent with in situ data. NB: Ocean retracking will be mostly used except for several lakes with small areas for which ICE-1 is used (this choice is permanent).

Figure 1: Comparison between Sentinel-3A, Jason and in situ data on the Issykkul lake

A generalisation of this kind of comparisons will allow a better characterisation of the altimetry errors over each lake. Inter-calibration between altimetry missions can also be checked.

References

Bercher, N. (2008). Précision de l'altimétrie satellitaire radar sur les cours d'eau: Développement d'une méthode standard de quantification de la qualité des produits alti-hydrologiques et applications (Doctoral dissertation).

Sylvain Biancamaria, Thomas Schaedele, Denis Blumstein, Frédéric Frappart, François Boy, et al.. Validation of Jason-3 tracking modes over French rivers. Remote Sensing of Environment, Elsevier, 2018, 209, pp.77-89. 10.1016/j.rse.2018.02.037 Chelton, D. B., Esbensen, S. K., Schlax, M. G., Thum, N., Freilich, M. H., Wentz, F. J., ... & Schopf, P. S. (2001). Observations of coupling between surface wind stress and sea surface temperature in the eastern tropical Pacific. Journal of Climate, 14(7), 1479-1498.

Crétaux, J. F., Abarca-del-Río, R., Berge-Nguyen, M., Arsen, A., Drolon, V., Clos, G., & Maisongrande, P. (2016). Lake volume monitoring from space. Surveys in Geophysics, 37(2), 269-305.

Jekeli, C., & Dumrongchai, P. (2003). On monitoring a vertical datum with satellite altimetry and water-level gauge data on large lakes. Journal of Geodesy, 77(7-8), 447-453.

Schwatke, C., Dettmering, D., Bosch, W., and Seitz, F.: DAHITI – an innovative approach for estimating water level time series over inland waters using multi-mission satellite altimetry, Hydrol. Earth Syst. Sci., 19, 4345-4364, https://doi.org/10.5194/hess-19-4345-2015, 2015


This document has been produced in the context of the Copernicus Climate Change Service (C3S).

The activities leading to these results have been contracted by the European Centre for Medium-Range Weather Forecasts, operator of C3S on behalf of the European Union (Delegation Agreement signed on 11/11/2014 and Contribution Agreement signed on 22/07/2021). All information in this document is provided "as is" and no guarantee or warranty is given that the information is fit for any particular purpose.

The users thereof use the information at their sole risk and liability. For the avoidance of all doubt , the European Commission and the European Centre for Medium - Range Weather Forecasts have no liability in respect of this document, which is merely representing the author's view.

Related articles