Page History

This page describes the way the anomaly and uncertainty of the ensemble forecasts in the sub-seasonal and seasonal products are determined using the climatology as reference. This includes also how the dominant forecast anomaly category (the 'dominant' of the 7 predefined ones) and the uncertainty category (of the 3 predefined ones) of the ensemble forecasts are determined. This is a generic procedure, which is the same for both EFAS and GloFAS, as it is executed the same way for each river pixel, regardless of the resolution, and also the same for the sub-seasonal and seasonal products, as it works in the exact same way regardless of whether it is weekly mean values, as in the sub-seasonal, or monthly mean values, as in the seasonal.

...

Climatological percentiles and forecast anomaly categories

Currently, From the sub-seasonal climate sample uses 660 reforecast values, while the seasonal uses 500 values. From the climate sample 99 climate percentiles are determined, which represent equally likely (1% chance) segments of the river discharge value range that occurred in the 20-year climatological sample (both sub-seasonal and seasonal is currently based on 20 years). Figure 1 shows an example idealised generic climate distribution, either based on weekly means or monthly means, with the percentiles represented along the y-axis. Only the deciles (every 10%), the quartiles (25%, 50% and 75%), of which the middle (50%) is also called median, and few of the extreme percentiles are indicated near the minimum and maximum of the climatological range indicated by black crosses. Each of these percentiles have an equivalent river discharge value along the x-axis. From one percentile to the next, the river discharge value range is divided into 100 equally likely bins (separated by the percentiles), some of which is indicated in Figure 1, such as bin1 of values below the 1st percentile, bin2 of values between the 1st and 2nd percentiles or bin 100 of river discharge values above the 99th percentiles, etc.

...

In the extreme case of all climate percentiles being 0, which happen over river pixels of the driest places of the world, such as the Sahara, the ensemble forecast member ranks can either be 100 for any non-zero value, regardless of the magnitude of the river discharge, or the evenly spread ranks from 1 to 100, as a representation of the totally 0 climatology. In the absolute most extreme case of all 99 climate percentiles being 0 and all 51 members being 0 in the forecast, the ranks of the forecast will be from 1 to 100 in equal representation. This means, this forecast will be a perfect representation of the climatological distribution, or with another word a perfectly 'normal' condition.

...

Forecast anomaly category computation for the whole ensemble

...

The ensemble forecasts have 51 members, which will be assigned an extremity rank each. Using these 51 ranks, the forecasts will be put in one of the 7 anomaly categories (as described in Table 1). This is done based on the arithmetic mean of the 51 ensemble member rank values (rank-mean) (see Figure 4). This rank-mean will also be a number between 1 and 100, but this time a real (not integer) number. If the anomaly is 50.5, that is exactly the normal (median) condition, i.e. no anomaly whatsoever. If the anomaly is below 50.5, then drier than normal conditions are forecast, if above 50.5, then wetter than normal. The lower/higher the anomaly value is below/above 50.5, the drier/wetter the conditions are predicted to be. The lowest/highest possible value is 1/100, if all ensemble members are 1/100 (the most extremely dry/wet). Then, based on this rank-mean, we define the dominant forecast anomaly category (one of the 7 categories in Table 1) for the ensemble forecast, by placing the rank-mean into the right categories, as defined in Table 1 above. For example, all rank-mean values from 40.0 to 60.0, interpreted as 40.0<= <60.0, will be assigned to 'Near normal', or category-4.
The ensemble forecast anomaly was not based on the most probable of the 7 anomaly categories, as that would make it prone to jumpiness. For example, in the super uncertain case of 6, 8, 7, 7, 7, 9, 7 members being in each of the 7 anomaly categories, the forecast category (the dominant one) would be the 'High' category (cat-6), as that has the most members (9). However, it is likely that nearby river pixels could easily be only slightly different with 7, 9, 7, 7, 7, 7, 7 members in each category, in which case the dominant anomaly forecast anomaly category would be the 'Low' category (cat-2), as now that has the most (again 9) members. It is worth mentioning that very uncertain cases are especially likely to happen at longer ranges. These two forecasts are only slightly different in terms of distribution, but the ensemble forecast anomaly categories would be almost the complete opposite of each other, making the signal look possibly very jumpy geographically. With the mean-rank definition we avoid this and simply assign the 'Near normal' category (cat-4) for both these forecasts, as the mean of the ranks are certainly very close to each other and both will be quite near the median.

Figure 24. Schematic of the forecast extremity ranking of the 51 ensemble members and the calculation of the dominant forecast anomaly category for the whole ensemble.

Forecast uncertainty category computation for the whole ensemble

...

In addition to the dominant forecast anomaly computation for the whole ensemble, as one of 7 predefined categories, the forecast uncertainty is also represented in the sub-seasonal and seasonal products, namely on the new river network and basin summary products. The forecast uncertainty is defined by the standard deviation (std) of the ensemble member ranks (rank-std). If the ensemble member ranks cluster well, and the spread of the ranks is low, then the forecast uncertainty will be low and conversely the confidence will be high.
...
For forecast uncertainty, three uncertainty categories are defined based on the rank-std value of the ensemble forecasts, using the easy to remember generic category split values of 10 and 20. (Table 2). These work well enough for all lead times and geographical areas.
Uncertainty categories Name Rank STD
Cat-1 Low uncertainty 0-10
Cat-2 Medium uncertainty 10-20
Cat-3 High uncertainty 20<
Table 2: Uncertainty categories defined by the standard deviation of the ensemble member ranks.

Examples to help

...

interpreting the methodology of the rank computation and the selection of the

...

forecast anomaly and the uncertainty categories

Below, there are examples with simplified rank distributions and specific cases of very dry rivers (0 values) that will demonstrate how the dominant forecast anomaly category and uncertainty category computation generation work.

In these examples, for simplicity reason, a fixed portion of the climatological and/or ensemble forecast distribution is 0, while the non-zero ensemble members have just one other rank for the very dry cases and 5 for the non-zero cases, with members having the same rank in each group. This way, the computation methodology can be demonstrated in a simple way that is easier to interpret.

...

Example No-2/3/4/5 then demonstrate the complexity of the dry cases, when some portion or all of the climatological percentiles are 0. It is generally noticeable that as the percentage of the zero climate percentiles increase, it gets more and more difficult to end up with negative anomalies. The absolute lowest possible rank-mean values are when all ensemble forecast members are 0 and their average rank is determined by the 0-value section of the climatology. So, for the option of 10% of climatology being 0, the minimum possible rank is 5.5, while for say 30% it will be 15.5 and for the option of all 99 percentiles being zero, the rank-mean will be 50.5. The extent of which the rank-mean increases depends on how many of the ensemble members will be non-zero and with which actual rank (determined in the non-zero section of the climatology). For example, one of the most extreme cases is when all 99 climatological percentiles are 0 and none of the ensemble forecast members are 0. For this super unlikely to occur event, the rank-mean will always be 100 (and the dominant anomaly forecast anomaly category 'Extreme high'), regardless of the actual ensemble member values. So, even if all the forecast ensemble member river discharge is values are very low, say from 0.12 to 0.23, the forecast rank-mean will still be 100 and the dominant forecast anomaly category 'Extreme high'.

...

Number of members in each group					Rank in each group					Rank-mean	Dominant Forecast anomaly category	Rank-std	Uncertainty category
N1	N2	N3	N4	N5	R1	R2	R3	R4	R5	Rank-mean	Dominant Forecast anomaly category	Rank-std	Uncertainty category
10	10	11	10	10	40	45	50	55	60	50.0	Near normal (40-60)	7.00	Low uncertainty
10	10	11	10	10	30	40	50	60	70	50.0	Near normal (40-60)	14.00	Medium uncertainty
10	10	11	10	10	10	30	50	70	90	50.0	Near normal (40-60)	28.00	High uncertainty

10	10	11	10	10	60	65	70	75	80	70.0	Bit high (60-75)	7.00	Low uncertainty
10	10	11	10	10	50	60	70	80	90	70.0	Bit high (60-75)	14.00	Medium uncertainty

0	10	31	10	0		45	50	55		50.0	Near normal (40-60)	3.13	Low uncertainty
0	10	31	10	0		40	50	60		50.0	Near normal (40-60)	6.26	Low uncertainty
0	10	31	10	0		30	50	70		50.0	Near normal (40-60)	12.52	Medium uncertainty

2	10	27	10	2	1	45	50	55	100	50.03	Near normal (40-60)	14.21	Medium uncertainty
2	10	27	10	2	1	40	50	60	100	50.03	Near normal (40-60)	15.21	Medium uncertainty
2	10	27	10	2	1	30	50	70	100	50.0	Near normal (40-60)	18.64	Medium uncertainty
2	10	27	10	2	1	20	50	80	100	50.0	Near normal (40-60)	23.34	High uncertainty
2	10	27	10	2	1	10	50	90	100	50.0	Near normal (40-60)	28.62	High uncertainty

...

Number of 0 members	Number of non-0 members	Average rank of 0 members	Average rank of non-0 members	Rank-mean	Dominant Forecast anomaly category	Rank-std	Uncertainty category
0	51	NA (no member to rank)	11 (the lowest possible rank for a non-zero member if 1-10 percentiles in the climatology are 0)	(0 * 5.5 + 51 * 11)/51 = 11	Low (10-25)	0	Low uncertainty
0	51	NA	20	(0 * 5.5 + 51 * 20)/51 = 20	Low (10-25)	0	Low uncertainty
0	51	NA	50	(0 * 5.5 + 51 * 50)/51 = 50	Near normal (40-60)	0	Low uncertainty
0	51	NA	70	(0 * 5.5 + 51 * 70)/51 = 70	Bit high (60-75)	0	Low uncertainty
0	51	NA	100	(0 * 5.5 + 51 * 100)/51 = 100	Extreme high (90<)	0	Low uncertainty

11	40	5.5	11	(11 * 5.5 + 40 * 11)/51 = 9.81	Extreme low (<10)	2.26	Low uncertainty
11	40	5.5	20	(11 * 5.5 + 40 * 20)/51 = 16.87	Low (10-25)	5.96	Low uncertainty
11	40	5.5	50	(11 * 5.5 + 40 * 50)/51 = 40.40	Near normal (40-60)	18.3	Medium uncertainty
11	40	5.5	70	(11 * 5.5 + 40 * 70)/51 = 56.08	Near normal (40-60)	26.52	High uncertainty
11	40	5.5	100	(11 * 5.5 + 40 * 100)/51 = 79.61	High (75-90)	38.86	High uncertainty

21	30	5.5	11	(21 * 5.5 + 30 * 11)/51 = 8.73	Extreme low (<10)	2.70	Low uncertainty
21	30	5.5	20	(21 * 5.5 + 30 * 20)/51 = 14.02	Low (10-25)	7.13	Low uncertainty
21	30	5.5	50	(21 * 5.5 + 30 * 50)/51 = 31.67	Bit low (25-40)	21.90	High uncertainty
21	30	5.5	70	(21 * 5.5 + 30 * 70)/51 = 43.44	Near normal (40-60)	31.74	High uncertainty
21	30	5.5	100	(21 * 5.5 + 30 * 50)/51 = 61.08	Bit high (60-75)	46.50	High uncertainty

36	15	5.5	11	(36 * 5.5 + 15 * 11)/51 = 7.11	Extreme low (<10)	2.50	Low uncertainty
36	15	5.5	20	(36 * 5.5 + 15 * 20)/51 = 9.76	Extreme low (<10)	6.60	Low uncertainty
36	15	5.5	50	(36 * 5.5 + 15 * 50)/51 = 18.58	Low (10-25)	20.27	High uncertainty
36	15	5.5	70	(36 * 5.5 + 15 * 70)/51 = 24.47	Low (10-25)	29.38	High uncertainty
36	15	5.5	100	(36 * 5.5 + 15 * 50)/51 = 33.29	Bit low (25-40)	43.05	High uncertainty

51	0	5.5	NA (no member to rank)	(51 * 5.5 + 0)/51 = 5.5	Extreme low (<10)	0	Low uncertainty

...

Number of 0 members	Number of non-0 members	Average rank of 0 members	Average rank of non-0 members	Rank-mean	Dominant Forecast anomaly category	Rank-std	Uncertainty category
0	51	NA (no member to rank)	31 (the lowest possible rank for a non-zero member if 1-30 percentiles in the climatology are 0)	(0 * 15.5 + 51 * 31)/51 = 31	Bit low (25-40)	0	Low uncertainty
0	51	NA	50	(0 * 15.5 + 51 * 50)/51 = 50	Near normal (40-60)	0	Low uncertainty
0	51	NA	70	(0 * 15.5 + 51 * 70)/51 = 70	Bit high (60-75)	0	Low uncertainty
0	51	NA	100	(0 * 15.5 + 51 * 100)/51 = 100	Extreme high (90<)	0	Low uncertainty

11	40	15.5	31	(11 * 15.5 + 40 * 31)/51 = 27.65	Bit low (25-40)	6.37	Low uncertainty
11	40	15.5	50	(11 * 15.5 + 40 * 50)/51 = 42.55	Near normal (40-60)	14.18	Medium uncertainty
11	40	15.5	70	(11 * 15.5 + 40 * 70)/51 = 58.24	Near normal (40-60)	22.41	High uncertainty
11	40	15.5	100	(11 * 15.5 + 40 * 100)/51 = 81.77	High (75-90)	34.75	High uncertainty

21	30	15.5	31	(21 * 15.5 + 30 * 31)/51 = 24.61	Low (10-25)	7.62	Low uncertainty
21	30	15.5	50	(21 * 15.5 + 30 * 50)/51 = 35.79	Bit low (25-40)	16.97	Medium uncertainty
21	30	15.5	70	(21 * 15.5 + 30 * 70)/51 = 47.55	Near normal (40-60)	26.82	High uncertainty
21	30	15.5	100	(21 * 15.5 + 30 * 100)/51 = 65.20	Bit high (60-75)	41.58	High uncertainty

36	15	15.5	31	(36 * 15.5 + 15 * 31)/51 = 20.05	Low (10-25)	7.06	Low uncertainty
36	15	15.5	50	(36 * 15.5 + 15 * 50)/51 = 25.64	Bit low (25-40)	15.71	Medium uncertainty
36	15	15.5	70	(36 * 15.5 + 15 * 70)/51 = 31.52	Bit low (25-40)	24.83	High uncertainty
36	15	15.5	100	(36 * 15.5 + 15 * 100)/51 = 40.35	Near normal (40-60)	38.50	High uncertainty

51	0	15.5	NA (no member to rank)	(51 * 15.5 + 0)/51 = 15.5	Low (10-25)	0	Low uncertainty

...

Number of 0 members	Number of non-0 members	Average rank of 0 members	Average rank of non-0 members	Rank-mean	Dominant Forecast anomaly category	Rank-std	Uncertainty category
0	51	NA (no member to rank)	71 (the lowest possible rank for a non-zero member if 1-70 percentiles in the climatology are 0)	(0 * 35.5 + 51 * 71)/51 = 71	Bit high (60-75)	0	Low uncertainty
0	51	NA	100	(0 * 35.5 + 51 * 100)/51 = 100	Extreme high (90<)	0	Low uncertainty
11	40	35.5	71	(11 * 35.5 + 40 * 71)/51 = 63.34	Bit high (60-75)	14.60	Medium uncertainty
11	40	35.5	100	(11 * 35.5 + 40 * 100)/51 = 86.08	High (75-90)	26.52	High uncertainty
21	30	35.5	71	(21 * 35.5 + 30 * 71)/51 = 56.38	Near normal (40-60)	17.47	Medium uncertainty
21	30	35.5	100	(21 * 35.5 + 30 * 100)/51 = 73.44	Bit high (60-75)	31.74	High uncertainty
36	15	35.5	71	(36 * 35.5 + 15 * 71)/51 = 45.94	Near normal (40-60)	16.17	Medium uncertainty
36	15	35.5	100	(36 * 35.5 + 15 * 100)/51 = 54.47	Near normal (40-60)	29.38	High uncertainty
51	0	35.5	NA (no member to rank)	(51 * 35.5 + 0)/51 = 35.5	Bit low (25-40)	0	Low uncertainty

...

Number of 0 members	Number of non-0 members	Average rank of 0 members	Average rank of non-0 members	Rank-mean	Dominant Forecast anomaly category	Rank-std	Uncertainty category
0	51	NA (no member to rank)	100 (the lowest possible rank for a non-zero member if 1-99 percentiles in the climatology are 0)	(0 * 50.5 + 51 * 100)/51 = 100	Extreme high (90<)	0	Low uncertainty
11	40	50.5	100	(11 * 50.5 + 40 * 100)/51 = 89.32	High (75-90)	20.35	High uncertainty
21	30	35.5	100	(21 * 50.5 + 30 * 100)/51 = 79.61	High (75-90)	24.36	High uncertainty
36	15	35.5	100	(36 * 50.5 + 15 * 100)/51 = 65.05	Bit high (60-75)	22.55	High uncertainty
51	0	35.5	NA (no member to rank)	(51 * 50.5 + 0)/51 = 50.5	Near normal (40-60)	0	Low uncertainty

...

Page tree

Versions Compared

Old Version 19

New Version 20

Key