Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Data format

VBar-delimited format

Data file consists of score values and corresponding metadata in an ASCII format.

The first line contains a tag setting the version of the file format:

#version=1.0

There is currently only one version of the format: 1.0.

Ensuing lines contain data records. Each record has the following format:

centre | model_id | yyyymm | time | forecast_step | station_id | latitude | longitude | station_elevation | model_orography_elevation | parameter |

...

 score | event | sample_size | score_mean_value

...

 

Record format

Data file consists of score values and corresponding metadata in an ASCII format.

Every score value is described by the full set of key attributes, like its parameter, station id, month, step etc. Parameters describing one score value at one station are organised into a record. Each record corresponds to one score value. A record is a collection of pairs key=value separated by commas. A record spans one line. Value of the key which is not given in the current record is inherited from the previous record, except for the value parameter v which has to be present in each record.

Each record has the following format:

centre=centre, model=model_id, d=yyyymm, t=time, st=station_id, lat=latitude, lon=longitude, se=station_elevation, me=model_orography_elevation, par=parameter, sc=score, ev=event, n=sample_size, v=mean_value

If the value is not available the record either should not be given at all or the value should be set to NIL (v=nil); every record must contain the key v (value of key v is not inheritable from previous record).

Parameter keys

...

  • centre (4-characters string) is the WMO identifier of the originating centre (ammc, cwao, ecmf, edzw, egrr, kwbc, lfpw, rjtd, rksl, rums etc);
  • model_id (variable-length stringa string, not containing a comma or vertical bar) is free model identifier assigned by the originating centre (to distinguish between potentially different models provided by the centre);

  • yyyymm is the month of the mean, where yyyy is the year and mm is the month (01-12);
  • time is the validity time (in hours UTC) of the forecasts verified;
  • forecast_step is the length of the forecast (in hours);
  • station_id (a number) is the WMO ID of the observation station verifying the forecasts;
  • latitude is the latitude of the observation station verifying the forecasts;
  • longitude is the longitude of the observation station verifying the forecasts;
  • station_elevation is the elevation of the observation station above the mean sea level in meters;
  • model_orography_elevation is the elevation of the model orography at the observation location;
  • parameter is the verified model output parameter:

...

  • sample_size is number of observations used to compute the monthly mean at the given station;
  • score_mean_value is the value or values of the score mean; in case of contingency table these are the 4 values delimited by comma