Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

This page describes the way the anomaly signal and uncertainty of the ensemble forecasts in the sub-seasonal and seasonal products are determined using the climatology as reference. This includes also how the expected forecast anomaly category (amongst 5 or 7 pre-defined categories) and the uncertainty category (divided in 3 categories low/medium/high) of the ensemble forecasts are determined. This is a generic procedure, which is the same for both EFAS and GloFAS, as it is executed the same way for each river pixel, regardless of the resolution, and also the same for the sub-seasonal and seasonal products, as it works in the exact same way regardless of whether it is weekly mean values, as in the sub-seasonal, or monthly mean values, as in the seasonal.

...

From the climate sample, 99 climate percentiles are determined, which represent the range of equally likely (1% chance) segments of the river discharge magnitude that occurred in the 20-year climatological sample (both sub-seasonal and seasonal is currently based on 20 years). Figure 1 shows an example generic climate distribution, either based on weekly means or monthly means, with  with the percentiles represented along the y-axis. Only the deciles (every 10%), the quartiles (25%, 50% and 75%), of which the middle (50%) is also called median, and few of the extreme percentiles are indicated near the minimum and maximum of the climatological range shown by black crosses. Each of these percentiles have an equivalent river discharge value magnitude along the x-axis. From one percentile to the next, the river discharge value magnitude range is divided into 100 equally likely bins (separated by the percentiles), some of which is indicated in Figure 1percentiles), such as bin1 of values below the 1st percentile, bin2 of values between the 1st and 2nd percentiles or bin 100 of river discharge values above the 99th percentiles, etc. In Figure 1, only the deciles (every 10%), the quartiles (25%, 50% and 75%), of which the middle (50%) is also called median, and few of the extreme percentiles are indicated near the minimum and maximum of the climatological range shown by black crosses. 


Figure 1. Schematic of the forecast anomaly categories, defined by the climatological distribution.

...


Table 1: Definition and description of the 7-anomaly categories. The possible value ranges in the 'Rank column' are inclusive at the start and exclusive at the end, so for example for the category of 'Extreme low' the possible ranks are 1, 2, 3, ... and 10. For the graphical products (maps and hydrographs), the middle three categories ('Bit low', 'Normal' and 'Bit high') are combined into one extended 'Near normal' category. The categories are colour-coded as they appear on the web products.

...

The sub-seasonal or seasonal forecasts have 51 ensemble members each. The members are all checked for climatological extremity and placed in one of the 100 climate bins (modelled climate conditions for this time of year, location and lead time); the corresponding allocated bin corresponds to the the anomaly or extremity level of the ensemble member, called hereafter rank (from 1 to 100). For example, rank 1 means the forecast value is below the 1st climate percentile (i.e. extremely anomalously low, less than the value that happened in the climatological period only 1% of the time), rank 2 means the value is between the 1st and 2nd climate percentiles (i.e. slightly less extremely low), etc., and finally rank 100 means the forecast value is above the 99th climate percentile (i.e. extremely high as higher than 99% of all the considered reforecastsclimatological distribution).

Figure 2 shows the process of determining the ranks for rank of each ensemble member. In this example, the lowest member gets the rank of 54 (red r54 on the graph in Figure 2) by moving vertically until crossing the climatological distribution and then moving horizontally to the y-axis to determine the two bounding percentiles and thus the right percentile bin. In this case, the lowest ensemble member value is between the 53rd and 54th percentile, which results in bin-54. Then all ensemble members, similarly, get a bin number, the 2nd lowest values with bin-60 and so on until the largest ensemble member value getting bin-97, as the river discharge value is between the 96th and 97th percentiles. 


Figure 2. Schematic of the forecast extremity ranking of the 51 ensemble members and the 7-anomaly categories in the context of the climatological distribution.

The probability of the forecast to be within one of the 7-anomaly category is calculated by counting of the ensemble members in each category and then dividing by 51, the total number of members. In the example of Figure 2, there is no member in the 3 low anomaly categories, while the 'Normal' category has 2 members, resulting in a 3.9% probability, the 'Bit high' category has 13 members, with a probability of 27.5%, the 'High' category has 17 members, with a probability of 33.3%, and finally the 'Extreme high' category has 18 ensemble members, with a 35.3% probability. The inset table in Figure 2 shows the number the number of ensemble members with the corresponding probabilities in the 7 categories, but also shows the climatological 'size' in terms of probabilities of the 7 categories. For ease of interpretation, the 7 categories are displayed here with different colours. This highlights, e.g., that the normal flow category's 3.9% probability is much lower than the climatologically expected probability of 20%, however, the three high flow categories have each much higher probabilities than the climatological reference probability, especially the extreme high category, where the forecast probability (35.3%) is more than double the corresponding climatological probability (15%). In addition, the extended 'Near normal' category would have 15 members with 31.4% probability, which is lower than the climatological probability of 50%.  

...

Expand
titleRank computation for 0-value singularity explained

For the forecast rank computation in the 0-value singularity case, a special solution was developed. All the 0 ensemble member values (all below 0.1 m3/s) get an evenly-representing rank assigned from any of the percentiles that have 0 values (i.e. below 0.1 m3/s) in the model climatology. In practice, this will mean, the 'rank-undefined' section of the ensemble forecast is going to be spread evenly across the 'rank-undefined' section of the climatology during the rank computation.

Figure 3 demonstrates the process on an idealised example, where the lowest 77 percentiles are 0 in the climatology and 23 out of 51 ensemble members are also 0 (see Figure 3a). The 23 ensemble members with 0 value then are spread across the 0-value range of the climatology from 1 to 77 (see Figure 3b). This way the ranks of the 23 members will be assigned from 1 to 77 with equal as possible spacing in between (see Figure 3c). Finally, the remaining non-zero ensemble members also get their ranks in the usual way, as described above in Figure 2. Finally, the schematic of ranks of all 51 members are provided in Figure 3d.

a)Image Modified

b)Image Modified

c)Image Modified

d)Image Modified

Figure 3. Schematic of the forecast extremity ranking calculation for areas with 0 river discharge values.

In the extreme case of all climate percentiles being 0, which happen over river pixels of the driest places of the world, such as the Sahara, the ensemble forecast member ranks can either be 100 for any non-zero value, regardless of the magnitude of the river discharge, or the evenly spread ranks from 1 to 100, as a representation of the totally 0 climatology. In the absolute most extreme case of all 99 climate percentiles being 0 and all 51 members being 0 in the forecast, the ranks of the forecast will be from 1 to 100 in equal representation. This means, this forecast will be a perfect representation of the climatological distribution, or with another word a forecast showing climatologically expected probabilities for all anomaly categories from Extreme low to Extreme high.

...

Panel
titleExamples with no 0-value section in the climatology and 5 ensemble forecast rank groups only

Few examples are shown here, when there is no 0-value in the climatology, so all ensemble forecast members can be ranked without any issue. For simplicity, 5 groups are used in the forecast only. The table below shows the numbers and the related average ranks for the 5 groups, with the rank-mean, rank-std and expected anomaly and uncertainty categories determined from those cases. For example, in the first row, 10 ensemble members are in the first group, which will all have the rank of 40. Then 10 members will be in the 2nd group with the rank of 45, and so on. The rank-mean of this simplified forecast distribution will be very close to 50 (mean of 40-45-50-55-60 with almost the same population in each group) and the rank-std will be about 7. This puts this forecast case into the 'Normal' expected anomaly category (rank-mean between 40 and 60) and the 'Low' uncertainty category (rank-set below 10).

The even distribution is represented first below, for which it is shown that by shifting the same rank distribution up or down does not change the standard deviation (and uncertainty). This is true for any variety of rank distributions. Also, after 'narrowing' the rank distribution, the mean does not change, but the uncertainty drops markedly. Moreover, in a similar manner, by adding extreme members (i.e. 1 or 100 or near that), even if only with very few members (2 in this example below), the uncertainty can be increased quite substantially.

Expand
titleTable with no 0-value section in climatology. Click here to expand...
Number of ensemble members in each groupCommon rank in each groupRank-meanExpected forecast anomaly categoryRank-stdForecast uncertainty category
N1N2N3N4N5R1R2R3R4R5
1010111010404550556050.0Near normal (40-60)7.00Low uncertainty
1010111010304050607050.0Near normal (40-60)14.00Medium uncertainty
1010111010103050709050.0Near normal (40-60)28.00High uncertainty














1010111010606570758070.0Bit high (60-75)7.00Low uncertainty
1010111010506070809070.0Bit high (60-75)14.00Medium uncertainty














01031100
455055
50.0Near normal (40-60)3.13Low uncertainty
01031100
405060
50.0Near normal (40-60)6.26Low uncertainty
01031100
305070
50.0Near normal (40-60)12.52Medium uncertainty














21027102145505510050.03Near normal (40-60)14.21Medium uncertainty
21027102140506010050.03Near normal (40-60)15.21Medium uncertainty
21027102130507010050.0Near normal (40-60)18.64Medium uncertainty
21027102120508010050.0Near normal (40-60)23.34High uncertainty
21027102110509010050.0Near normal (40-60)28.62High uncertainty
Panel
titleExamples with some percentage of 0-value section in the climatology

In these examples, again for simplicity reasons, the climatological and forecast values will only be in one of 2 categories, either 0-value or non 0-value. This way, the main impact of the 0/non-0 value issue can be demonstrated. In the tables below, the numbers and the related average ranks are given for the two groups of 0 and non-0 ensemble members, with the rank-mean, rank-std and expected anomaly and uncertainty categories determined from those cases.

There are 4 tables, with 10%, 30%, 70%  and 100% of 0-value in the climatology (i.e increasingly dry climate). For example, in the 7th row of the 1st table with 10% of 0 in the climatology, 11 ensemble members are 0-value and the remaining 40 are greater than 0. The average rank for the 0-value members are 5.5 (as this is given by the method of handling the 0-value issue with equal representation, explained above), while the average rank for the non-zero members is given as an example of 11. The related rank-mean is then 9.81, making this forecast into the 'Extreme low' expected category while the rank-std is 2.26, with low uncertainty category.

These tables demonstrate the complex interaction between the dryness of the climatology and ensemble forecasts, reflected in the forecast rank-mean and rank-std values and the subsequent expected anomaly and uncertainty categories. They also demonstrate, how less likely it becomes to have negative anomalies as the climate becomes drier and drier.

Expand
titleTable with 10% of 0-value in climatology. Click here to expand...
Number of 0 membersNumber of non-0 membersAverage rank of 0 membersAverage rank of non-0 membersRank-meanForecast anomaly categoryRank-stdUncertainty category
051NA (no member to rank)11 (the lowest possible rank for a non-zero member if 1-10 percentiles in the climatology are 0)(0 * 5.5 + 51 * 11)/51 = 11Low (10-25)0Low uncertainty
051NA20(0 * 5.5 + 51 * 20)/51 = 20Low (10-25)0Low uncertainty
051NA50(0 * 5.5 + 51 * 50)/51 = 50Near normal (40-60)0Low uncertainty
051NA70(0 * 5.5 + 51 * 70)/51 = 70Bit high (60-75)0Low uncertainty
051NA100(0 * 5.5 + 51 * 100)/51 = 100Extreme high (90<)0Low uncertainty








11405.511(11 * 5.5 + 40 * 11)/51 = 9.81Extreme low (<10)2.26Low uncertainty
11405.520(11 * 5.5 + 40 * 20)/51 = 16.87Low (10-25)5.96Low uncertainty
11405.550(11 * 5.5 + 40 * 50)/51 = 40.40Near normal (40-60)18.3Medium uncertainty
11405.570(11 * 5.5 + 40 * 70)/51 = 56.08Near normal (40-60)26.52High uncertainty
11405.5100(11 * 5.5 + 40 * 100)/51 = 79.61High (75-90)38.86High uncertainty








21305.511

(21 * 5.5 + 30 * 11)/51 = 8.73

Extreme low (<10)2.70Low uncertainty
21305.520(21 * 5.5 + 30 * 20)/51 = 14.02Low (10-25)7.13Low uncertainty
21305.550(21 * 5.5 + 30 * 50)/51 = 31.67Bit low (25-40)21.90High uncertainty
21305.570(21 * 5.5 + 30 * 70)/51 = 43.44Near normal (40-60)31.74High uncertainty
21305.5100(21 * 5.5 + 30 * 50)/51 = 61.08Bit high (60-75)46.50High uncertainty








36155.511

(36 * 5.5 + 15 * 11)/51 = 7.11

Extreme low (<10)2.50Low uncertainty
36155.520(36 * 5.5 + 15 * 20)/51 = 9.76Extreme low (<10)6.60Low uncertainty
36155.550(36 * 5.5 + 15 * 50)/51 = 18.58Low (10-25)20.27High uncertainty
36155.570(36 * 5.5 + 15 * 70)/51 = 24.47Low (10-25)29.38High uncertainty
36155.5100(36 * 5.5 + 15 * 50)/51 = 33.29Bit low (25-40)43.05High uncertainty








5105.5NA (no member to rank)

(51 * 5.5 + 0)/51 = 5.5

Extreme low (<10)0Low uncertainty
Expand
titleTable with 10% 0-value in climatology. Click here to expand...
Number of 0 membersNumber of non-0 membersAverage rank of 0 membersAverage rank of non-0 membersRank-meanForecast anomaly categoryRank-stdUncertainty category
051NA (no member to rank)31 (the lowest possible rank for a non-zero member if 1-30 percentiles in the climatology are 0)(0 * 15.5 + 51 * 31)/51 = 31Bit low (25-40)0Low uncertainty
051NA50(0 * 15.5 + 51 * 50)/51 = 50Near normal (40-60)0Low uncertainty
051NA70(0 * 15.5 + 51 * 70)/51 = 70Bit high (60-75)0Low uncertainty
051NA100(0 * 15.5 + 51 * 100)/51 = 100Extreme high (90<)0Low uncertainty








114015.531(11 * 15.5 + 40 * 31)/51 = 27.65Bit low (25-40)6.37Low uncertainty
114015.550(11 * 15.5 + 40 * 50)/51 = 42.55Near normal (40-60)14.18Medium uncertainty
114015.570(11 * 15.5 + 40 * 70)/51 = 58.24Near normal (40-60)22.41High uncertainty
114015.5100(11 * 15.5 + 40 * 100)/51 = 81.77High (75-90)34.75High uncertainty








213015.531

(21 * 15.5 + 30 * 31)/51 = 24.61

Low (10-25)7.62Low uncertainty
213015.550(21 * 15.5 + 30 * 50)/51 = 35.79Bit low (25-40)16.97Medium uncertainty
213015.570(21 * 15.5 + 30 * 70)/51 = 47.55Near normal (40-60)26.82High uncertainty
213015.5100

(21 * 15.5 + 30 * 100)/51 = 65.20

Bit high (60-75)41.58High uncertainty








361515.531

(36 * 15.5 + 15 * 31)/51 = 20.05

Low (10-25)7.06Low uncertainty
361515.550(36 * 15.5 + 15 * 50)/51 = 25.64Bit low (25-40)15.71Medium uncertainty
361515.570(36 * 15.5 + 15 * 70)/51 = 31.52Bit low (25-40)24.83High uncertainty
361515.5100(36 * 15.5 + 15 * 100)/51 = 40.35Near normal (40-60)38.50High uncertainty








51015.5NA (no member to rank)

(51 * 15.5 + 0)/51 = 15.5

Low (10-25)0Low uncertainty
Expand
titleTable with 70% of 0-value in climatology. Click here to expand...
Number of 0 membersNumber of non-0 membersAverage rank of 0 membersAverage rank of non-0 membersRank-meanForecast anomaly categoryRank-stdUncertainty category
051NA (no member to rank)71 (the lowest possible rank for a non-zero member if 1-70 percentiles in the climatology are 0)(0 * 35.5 + 51 * 71)/51 = 71Bit high (60-75)0Low uncertainty
051NA100(0 * 35.5 + 51 * 100)/51 = 100Extreme high (90<)0Low uncertainty
114035.571(11 * 35.5 + 40 * 71)/51 = 63.34Bit high (60-75)14.60Medium uncertainty
114035.5100(11 * 35.5 + 40 * 100)/51 = 86.08High (75-90)26.52High uncertainty
213035.571

(21 * 35.5 + 30 * 71)/51 = 56.38

Near normal (40-60)17.47Medium uncertainty
213035.5100

(21 * 35.5 + 30 * 100)/51 = 73.44

Bit high (60-75)31.74High uncertainty
361535.571

(36 * 35.5 + 15 * 71)/51 = 45.94

Near normal (40-60)16.17Medium uncertainty
361535.5100(36 * 35.5 + 15 * 100)/51 = 54.47Near normal (40-60)29.38High uncertainty
51035.5NA (no member to rank)

(51 * 35.5 + 0)/51 = 35.5

Bit low (25-40)0Low uncertainty
Expand
titleTable with 100% of 0-value in climatology. Click here to expand...
Number of 0 membersNumber of non-0 membersAverage rank of 0 membersAverage rank of non-0 membersRank-meanForecast anomaly categoryRank-stdUncertainty category
051NA (no member to rank)100 (the lowest possible rank for a non-zero member if 1-99 percentiles in the climatology are 0)(0 * 50.5 + 51 * 100)/51 = 100Extreme high (90<)0Low uncertainty
114050.5100(11 * 50.5 + 40 * 100)/51 = 89.32High (75-90)20.35High uncertainty
213035.5100

(21 * 50.5 + 30 * 100)/51 = 79.61

High (75-90)24.36High uncertainty
361535.5100(36 * 50.5 + 15 * 100)/51 = 65.05Bit high (60-75)22.55High uncertainty
51035.5NA (no member to rank)

(51 * 50.5 + 0)/51 = 50.5

Near normal (40-60)0Low uncertainty