Page History

This page describes the way the anomaly signal and uncertainty of the ensemble forecasts in the sub-seasonal and seasonal products are determined using the climatology as reference. This includes also how the expected forecast anomaly category (amongst 5 or 7 pre-defined categories) and the uncertainty category (divided in 3 categories low/medium/high) of the ensemble forecasts are determined. This is a generic procedure, which is the same for both EFAS and GloFAS, as it is executed the same way for each river pixel, regardless of the resolution, and also the same for the sub-seasonal and seasonal products, as it works in the exact same way regardless of whether it is weekly mean values, as in the sub-seasonal, or monthly mean values, as in the seasonal.

...

From the climate sample, 99 climate percentiles are determined, which represent the range of equally likely (1% chance) segments of the river discharge magnitude that occurred in the 20-year climatological sample (both sub-seasonal and seasonal is currently based on 20 years). Figure 1 shows an example generic climate distribution, either based on weekly means or monthly means, with with the percentiles represented along the y-axis. Only the deciles (every 10%), the quartiles (25%, 50% and 75%), of which the middle (50%) is also called median, and few of the extreme percentiles are indicated near the minimum and maximum of the climatological range shown by black crosses. Each of these percentiles have an equivalent river discharge value magnitude along the x-axis. From one percentile to the next, the river discharge value magnitude range is divided into 100 equally likely bins (separated by the percentiles), some of which is indicated in Figure 1percentiles), such as bin1 of values below the 1st percentile, bin2 of values between the 1st and 2nd percentiles or bin 100 of river discharge values above the 99th percentiles, etc. In Figure 1, only the deciles (every 10%), the quartiles (25%, 50% and 75%), of which the middle (50%) is also called median, and few of the extreme percentiles are indicated near the minimum and maximum of the climatological range shown by black crosses.

Figure 1. Schematic of the forecast anomaly categories, defined by the climatological distribution.

...

Table 1: Definition and description of the 7-anomaly categories. The possible value ranges in the 'Rank column' are inclusive at the start and exclusive at the end, so for example for the category of 'Extreme low' the possible ranks are 1, 2, 3, ... and 10. For the graphical products (maps and hydrographs), the middle three categories ('Bit low', 'Normal' and 'Bit high') are combined into one extended 'Near normal' category. The categories are colour-coded as they appear on the web products.

...

The sub-seasonal or seasonal forecasts have 51 ensemble members each. The members are all checked for climatological extremity and placed in one of the 100 climate bins (modelled climate conditions for this time of year, location and lead time); the corresponding allocated bin corresponds to the the anomaly or extremity level of the ensemble member, called hereafter rank (from 1 to 100). For example, rank 1 means the forecast value is below the 1st climate percentile (i.e. extremely anomalously low, less than the value that happened in the climatological period only 1% of the time), rank 2 means the value is between the 1st and 2nd climate percentiles (i.e. slightly less extremely low), etc., and finally rank 100 means the forecast value is above the 99th climate percentile (i.e. extremely high as higher than 99% of all the considered reforecastsclimatological distribution).

Figure 2 shows the process of determining the ranks for rank of each ensemble member. In this example, the lowest member gets the rank of 54 (red r54 on the graph in Figure 2) by moving vertically until crossing the climatological distribution and then moving horizontally to the y-axis to determine the two bounding percentiles and thus the right percentile bin. In this case, the lowest ensemble member value is between the 53rd and 54th percentile, which results in bin-54. Then all ensemble members, similarly, get a bin number, the 2nd lowest values with bin-60 and so on until the largest ensemble member value getting bin-97, as the river discharge value is between the 96th and 97th percentiles.

Figure 2. Schematic of the forecast extremity ranking of the 51 ensemble members and the 7-anomaly categories in the context of the climatological distribution.

The probability of the forecast to be within one of the 7-anomaly category is calculated by counting of the ensemble members in each category and then dividing by 51, the total number of members. In the example of Figure 2, there is no member in the 3 low anomaly categories, while the 'Normal' category has 2 members, resulting in a 3.9% probability, the 'Bit high' category has 13 members, with a probability of 27.5%, the 'High' category has 17 members, with a probability of 33.3%, and finally the 'Extreme high' category has 18 ensemble members, with a 35.3% probability. The inset table in Figure 2 shows the number the number of ensemble members with the corresponding probabilities in the 7 categories, but also shows the climatological 'size' in terms of probabilities of the 7 categories. For ease of interpretation, the 7 categories are displayed here with different colours. This highlights, e.g., that the normal flow category's 3.9% probability is much lower than the climatologically expected probability of 20%, however, the three high flow categories have each much higher probabilities than the climatological reference probability, especially the extreme high category, where the forecast probability (35.3%) is more than double the corresponding climatological probability (15%). In addition, the extended 'Near normal' category would have 15 members with 31.4% probability, which is lower than the climatological probability of 50%.

...

Expand

title	Rank computation for 0-value singularity explained

For the forecast rank computation in the 0-value singularity case, a special solution was developed. All the 0 ensemble member values (all below 0.1 m3/s) get an evenly-representing rank assigned from any of the percentiles that have 0 values (i.e. below 0.1 m3/s) in the model climatology. In practice, this will mean, the 'rank-undefined' section of the ensemble forecast is going to be spread evenly across the 'rank-undefined' section of the climatology during the rank computation.

Figure 3 demonstrates the process on an idealised example, where the lowest 77 percentiles are 0 in the climatology and 23 out of 51 ensemble members are also 0 (see Figure 3a). The 23 ensemble members with 0 value then are spread across the 0-value range of the climatology from 1 to 77 (see Figure 3b). This way the ranks of the 23 members will be assigned from 1 to 77 with equal as possible spacing in between (see Figure 3c). Finally, the remaining non-zero ensemble members also get their ranks in the usual way, as described above in Figure 2. Finally, the schematic of ranks of all 51 members are provided in Figure 3d.

a)Image Modified	b)Image Modified
c)Image Modified	d)Image Modified

Figure 3. Schematic of the forecast extremity ranking calculation for areas with 0 river discharge values.

In the extreme case of all climate percentiles being 0, which happen over river pixels of the driest places of the world, such as the Sahara, the ensemble forecast member ranks can either be 100 for any non-zero value, regardless of the magnitude of the river discharge, or the evenly spread ranks from 1 to 100, as a representation of the totally 0 climatology. In the absolute most extreme case of all 99 climate percentiles being 0 and all 51 members being 0 in the forecast, the ranks of the forecast will be from 1 to 100 in equal representation. This means, this forecast will be a perfect representation of the climatological distribution, or with another word a forecast showing climatologically expected probabilities for all anomaly categories from Extreme low to Extreme high.

...

Panel

title	Examples with no 0-value section in the climatology and 5 ensemble forecast rank groups only

Few examples are shown here, when there is no 0-value in the climatology, so all ensemble forecast members can be ranked without any issue. For simplicity, 5 groups are used in the forecast only. The table below shows the numbers and the related average ranks for the 5 groups, with the rank-mean, rank-std and expected anomaly and uncertainty categories determined from those cases. For example, in the first row, 10 ensemble members are in the first group, which will all have the rank of 40. Then 10 members will be in the 2nd group with the rank of 45, and so on. The rank-mean of this simplified forecast distribution will be very close to 50 (mean of 40-45-50-55-60 with almost the same population in each group) and the rank-std will be about 7. This puts this forecast case into the 'Normal' expected anomaly category (rank-mean between 40 and 60) and the 'Low' uncertainty category (rank-set below 10).

The even distribution is represented first below, for which it is shown that by shifting the same rank distribution up or down does not change the standard deviation (and uncertainty). This is true for any variety of rank distributions. Also, after 'narrowing' the rank distribution, the mean does not change, but the uncertainty drops markedly. Moreover, in a similar manner, by adding extreme members (i.e. 1 or 100 or near that), even if only with very few members (2 in this example below), the uncertainty can be increased quite substantially.

Expand

title	Table with no 0-value section in climatology. Click here to expand...

Number of ensemble members in each group					Common rank in each group					Rank-mean	Expected forecast anomaly category	Rank-std	Forecast uncertainty category
N1	N2	N3	N4	N5	R1	R2	R3	R4	R5	Rank-mean	Expected forecast anomaly category	Rank-std	Forecast uncertainty category
10	10	11	10	10	40	45	50	55	60	50.0	Near normal (40-60)	7.00	Low uncertainty
10	10	11	10	10	30	40	50	60	70	50.0	Near normal (40-60)	14.00	Medium uncertainty
10	10	11	10	10	10	30	50	70	90	50.0	Near normal (40-60)	28.00	High uncertainty

10	10	11	10	10	60	65	70	75	80	70.0	Bit high (60-75)	7.00	Low uncertainty
10	10	11	10	10	50	60	70	80	90	70.0	Bit high (60-75)	14.00	Medium uncertainty

0	10	31	10	0		45	50	55		50.0	Near normal (40-60)	3.13	Low uncertainty
0	10	31	10	0		40	50	60		50.0	Near normal (40-60)	6.26	Low uncertainty
0	10	31	10	0		30	50	70		50.0	Near normal (40-60)	12.52	Medium uncertainty

2	10	27	10	2	1	45	50	55	100	50.03	Near normal (40-60)	14.21	Medium uncertainty
2	10	27	10	2	1	40	50	60	100	50.03	Near normal (40-60)	15.21	Medium uncertainty
2	10	27	10	2	1	30	50	70	100	50.0	Near normal (40-60)	18.64	Medium uncertainty
2	10	27	10	2	1	20	50	80	100	50.0	Near normal (40-60)	23.34	High uncertainty
2	10	27	10	2	1	10	50	90	100	50.0	Near normal (40-60)	28.62	High uncertainty

Panel

title	Examples with some percentage of 0-value section in the climatology

In these examples, again for simplicity reasons, the climatological and forecast values will only be in one of 2 categories, either 0-value or non 0-value. This way, the main impact of the 0/non-0 value issue can be demonstrated. In the tables below, the numbers and the related average ranks are given for the two groups of 0 and non-0 ensemble members, with the rank-mean, rank-std and expected anomaly and uncertainty categories determined from those cases.

There are 4 tables, with 10%, 30%, 70% and 100% of 0-value in the climatology (i.e increasingly dry climate). For example, in the 7th row of the 1st table with 10% of 0 in the climatology, 11 ensemble members are 0-value and the remaining 40 are greater than 0. The average rank for the 0-value members are 5.5 (as this is given by the method of handling the 0-value issue with equal representation, explained above), while the average rank for the non-zero members is given as an example of 11. The related rank-mean is then 9.81, making this forecast into the 'Extreme low' expected category while the rank-std is 2.26, with low uncertainty category.

These tables demonstrate the complex interaction between the dryness of the climatology and ensemble forecasts, reflected in the forecast rank-mean and rank-std values and the subsequent expected anomaly and uncertainty categories. They also demonstrate, how less likely it becomes to have negative anomalies as the climate becomes drier and drier.

Expand

title	Table with 10% of 0-value in climatology. Click here to expand...

Number of 0 members	Number of non-0 members	Average rank of 0 members	Average rank of non-0 members	Rank-mean	Forecast anomaly category	Rank-std	Uncertainty category
0	51	NA (no member to rank)	11 (the lowest possible rank for a non-zero member if 1-10 percentiles in the climatology are 0)	(0 * 5.5 + 51 * 11)/51 = 11	Low (10-25)	0	Low uncertainty
0	51	NA	20	(0 * 5.5 + 51 * 20)/51 = 20	Low (10-25)	0	Low uncertainty
0	51	NA	50	(0 * 5.5 + 51 * 50)/51 = 50	Near normal (40-60)	0	Low uncertainty
0	51	NA	70	(0 * 5.5 + 51 * 70)/51 = 70	Bit high (60-75)	0	Low uncertainty
0	51	NA	100	(0 * 5.5 + 51 * 100)/51 = 100	Extreme high (90<)	0	Low uncertainty

11	40	5.5	11	(11 * 5.5 + 40 * 11)/51 = 9.81	Extreme low (<10)	2.26	Low uncertainty
11	40	5.5	20	(11 * 5.5 + 40 * 20)/51 = 16.87	Low (10-25)	5.96	Low uncertainty
11	40	5.5	50	(11 * 5.5 + 40 * 50)/51 = 40.40	Near normal (40-60)	18.3	Medium uncertainty
11	40	5.5	70	(11 * 5.5 + 40 * 70)/51 = 56.08	Near normal (40-60)	26.52	High uncertainty
11	40	5.5	100	(11 * 5.5 + 40 * 100)/51 = 79.61	High (75-90)	38.86	High uncertainty

21	30	5.5	11	(21 * 5.5 + 30 * 11)/51 = 8.73	Extreme low (<10)	2.70	Low uncertainty
21	30	5.5	20	(21 * 5.5 + 30 * 20)/51 = 14.02	Low (10-25)	7.13	Low uncertainty
21	30	5.5	50	(21 * 5.5 + 30 * 50)/51 = 31.67	Bit low (25-40)	21.90	High uncertainty
21	30	5.5	70	(21 * 5.5 + 30 * 70)/51 = 43.44	Near normal (40-60)	31.74	High uncertainty
21	30	5.5	100	(21 * 5.5 + 30 * 50)/51 = 61.08	Bit high (60-75)	46.50	High uncertainty

36	15	5.5	11	(36 * 5.5 + 15 * 11)/51 = 7.11	Extreme low (<10)	2.50	Low uncertainty
36	15	5.5	20	(36 * 5.5 + 15 * 20)/51 = 9.76	Extreme low (<10)	6.60	Low uncertainty
36	15	5.5	50	(36 * 5.5 + 15 * 50)/51 = 18.58	Low (10-25)	20.27	High uncertainty
36	15	5.5	70	(36 * 5.5 + 15 * 70)/51 = 24.47	Low (10-25)	29.38	High uncertainty
36	15	5.5	100	(36 * 5.5 + 15 * 50)/51 = 33.29	Bit low (25-40)	43.05	High uncertainty

51	0	5.5	NA (no member to rank)	(51 * 5.5 + 0)/51 = 5.5	Extreme low (<10)	0	Low uncertainty

Expand

title	Table with 10% 0-value in climatology. Click here to expand...

Number of 0 members	Number of non-0 members	Average rank of 0 members	Average rank of non-0 members	Rank-mean	Forecast anomaly category	Rank-std	Uncertainty category
0	51	NA (no member to rank)	31 (the lowest possible rank for a non-zero member if 1-30 percentiles in the climatology are 0)	(0 * 15.5 + 51 * 31)/51 = 31	Bit low (25-40)	0	Low uncertainty
0	51	NA	50	(0 * 15.5 + 51 * 50)/51 = 50	Near normal (40-60)	0	Low uncertainty
0	51	NA	70	(0 * 15.5 + 51 * 70)/51 = 70	Bit high (60-75)	0	Low uncertainty
0	51	NA	100	(0 * 15.5 + 51 * 100)/51 = 100	Extreme high (90<)	0	Low uncertainty

11	40	15.5	31	(11 * 15.5 + 40 * 31)/51 = 27.65	Bit low (25-40)	6.37	Low uncertainty
11	40	15.5	50	(11 * 15.5 + 40 * 50)/51 = 42.55	Near normal (40-60)	14.18	Medium uncertainty
11	40	15.5	70	(11 * 15.5 + 40 * 70)/51 = 58.24	Near normal (40-60)	22.41	High uncertainty
11	40	15.5	100	(11 * 15.5 + 40 * 100)/51 = 81.77	High (75-90)	34.75	High uncertainty

21	30	15.5	31	(21 * 15.5 + 30 * 31)/51 = 24.61	Low (10-25)	7.62	Low uncertainty
21	30	15.5	50	(21 * 15.5 + 30 * 50)/51 = 35.79	Bit low (25-40)	16.97	Medium uncertainty
21	30	15.5	70	(21 * 15.5 + 30 * 70)/51 = 47.55	Near normal (40-60)	26.82	High uncertainty
21	30	15.5	100	(21 * 15.5 + 30 * 100)/51 = 65.20	Bit high (60-75)	41.58	High uncertainty

36	15	15.5	31	(36 * 15.5 + 15 * 31)/51 = 20.05	Low (10-25)	7.06	Low uncertainty
36	15	15.5	50	(36 * 15.5 + 15 * 50)/51 = 25.64	Bit low (25-40)	15.71	Medium uncertainty
36	15	15.5	70	(36 * 15.5 + 15 * 70)/51 = 31.52	Bit low (25-40)	24.83	High uncertainty
36	15	15.5	100	(36 * 15.5 + 15 * 100)/51 = 40.35	Near normal (40-60)	38.50	High uncertainty

51	0	15.5	NA (no member to rank)	(51 * 15.5 + 0)/51 = 15.5	Low (10-25)	0	Low uncertainty

Expand

title	Table with 70% of 0-value in climatology. Click here to expand...

Number of 0 members	Number of non-0 members	Average rank of 0 members	Average rank of non-0 members	Rank-mean	Forecast anomaly category	Rank-std	Uncertainty category
0	51	NA (no member to rank)	71 (the lowest possible rank for a non-zero member if 1-70 percentiles in the climatology are 0)	(0 * 35.5 + 51 * 71)/51 = 71	Bit high (60-75)	0	Low uncertainty
0	51	NA	100	(0 * 35.5 + 51 * 100)/51 = 100	Extreme high (90<)	0	Low uncertainty
11	40	35.5	71	(11 * 35.5 + 40 * 71)/51 = 63.34	Bit high (60-75)	14.60	Medium uncertainty
11	40	35.5	100	(11 * 35.5 + 40 * 100)/51 = 86.08	High (75-90)	26.52	High uncertainty
21	30	35.5	71	(21 * 35.5 + 30 * 71)/51 = 56.38	Near normal (40-60)	17.47	Medium uncertainty
21	30	35.5	100	(21 * 35.5 + 30 * 100)/51 = 73.44	Bit high (60-75)	31.74	High uncertainty
36	15	35.5	71	(36 * 35.5 + 15 * 71)/51 = 45.94	Near normal (40-60)	16.17	Medium uncertainty
36	15	35.5	100	(36 * 35.5 + 15 * 100)/51 = 54.47	Near normal (40-60)	29.38	High uncertainty
51	0	35.5	NA (no member to rank)	(51 * 35.5 + 0)/51 = 35.5	Bit low (25-40)	0	Low uncertainty

Expand

title	Table with 100% of 0-value in climatology. Click here to expand...

Number of 0 members	Number of non-0 members	Average rank of 0 members	Average rank of non-0 members	Rank-mean	Forecast anomaly category	Rank-std	Uncertainty category
0	51	NA (no member to rank)	100 (the lowest possible rank for a non-zero member if 1-99 percentiles in the climatology are 0)	(0 * 50.5 + 51 * 100)/51 = 100	Extreme high (90<)	0	Low uncertainty
11	40	50.5	100	(11 * 50.5 + 40 * 100)/51 = 89.32	High (75-90)	20.35	High uncertainty
21	30	35.5	100	(21 * 50.5 + 30 * 100)/51 = 79.61	High (75-90)	24.36	High uncertainty
36	15	35.5	100	(36 * 50.5 + 15 * 100)/51 = 65.05	Bit high (60-75)	22.55	High uncertainty
51	0	35.5	NA (no member to rank)	(51 * 50.5 + 0)/51 = 50.5	Near normal (40-60)	0	Low uncertainty

Page tree

Versions Compared

Old Version 56

New Version 57

Key