The filesystems available are HOME, PERM, HPCPERM and SCRATCH, and are completely isolated from those in other ECMWF platforms in Reading such as ECGATE or the Cray HPCF.

Filesystems from those platforms are not cross-mounted either. This means that if you need to use data from another ECMWF platform such as ECGATE or the Cray HPCF, you will need to so transfer it first using scp or rsync. See HPC2020: File transfers for more information.

File System

Suitable for ...

TechnologyFeaturesQuota
HOMEpermanent files, e. g. profile, utilities, sources, libraries, etc. NFS

It is backed up.

Snapshots available. See HPC2020: Recovering data from snapshots

Throttled I/O bandwidth from parallel compute nodes (less performance)


10GB

PERMpermanent files without the need for automated backups, smaller input files for serial or small processing, etc.

NFS

No backup

Snapshots available.  See HPC2020: Recovering data from snapshots

Throttled I/O bandwidth from parallel compute nodes (less performance)

DO NOT USE IN PARALLEL APPLICATIONS

DO NOT USE FOR JOB STANDARD OUTPUT/ERROR

500 GB

HPCPERMpermanent files without the need for automated backups,  bigger input files for parallel model runs, climate files, std output, etc.Lustre

No Backup

No snapshots

No automatic deletion

  • 100 GB for users without HPC access
  • 1 TB for users with HPC access
SCRATCHall temporary (large) files. Main storage for your jobs and experiments input and output files.Lustre

Automatic deletion after 30 days of last access - implemented since 27 March 2023

No snapshots

No backup

  • 50 TB for users with HPC access
  • 2 TB for users without HPC access

SCRATCHDIR

Big temporary data for an individual session or job, not as fast as TMPDIR but higher capacity. Files accessible from all cluster.Lustre

Deleted at the end of session or job

Created per session/ job as a subdirectory in SCRATCH

part of SCRATCH quota

TMPDIR

Fast temporary data for an individual session or job, small files only. Local to every node.

SSD on shared (GPIL) nodes 

(*f QoSs)

Deleted at the end of session or job

Created per session/ job


3 GB per session/job by default.

Customisable up to 40 GB with 

--gres=ssdtmp:<size>G


RAM on exclusive parallel compute nodes

(*p QoSs)

no limit (maximum memory of the node)

Environment variables

Those filesystems can be conveniently referenced from your session and scripts using the environment variables of the same name: $HOME, $PERM, $HPCPERM$SCRATCH, $SCRATCHDIR and $TMPDIR.

$TEMP, which in the past was an alias to $SCRATCH, has been deprecated and is no longer defined. Please use $SCRATCH instead

More capacity for $TMPDIR?

When running on the shared GPIL nodes (*f QoSs), you may request a bigger space in the SSD-backed TMPDIR with the extra SBATCH option:

#SBATCH --gres=ssdtmp:<size>G

With <size> being a number up to 40 GB. If that is still not enough for you, you may point your TMPDIR to SCRATCHDIR:

export TMPDIR=$SCRATCHDIR


How much have I used?

You can check your current usage and limits with the "quota" command.

Filesystem structure

You will notice your filesystems have now a flat name structure. If you port any scripts or code that had paths to filesystems hardcoded from older platforms, please make sure you update them. Where possible, try and use the environment variables provided, which should work on both sides pointing to the right location in each case:

BeforeAfter
/home/group/user or /home/ms/group/user/home/user
/perm/group/user or /perm/ms/group/user/perm/user
-/hpcperm/user
/scratch/group/user or /scratch/ms/group/user/scratch/user

Special directories on RAM

Some special directories are not disk-based but actually mapped into the node's main memory. There are no limits set on exclusive nodes running on parallel queues. However, when running on shared nodes (GPILs) on fractional or interactive queues, this could lead to a single application or user exhausting all the memory of the node and thus impacting others. This is why the following limits are set:

DirectoryNew limit
/tmp428 GB (80% of available memory) per user's session
/var/tmp428 GB (80% of available memory) per user's session
/dev/shm428 GB (80% of available memory) per user's session
$XDG_RUNTIME_DIR64 MB per user's session

Users should instead use the general purpose file systems available, and in particular, $TMPDIR or $SCRATCHDIR for temporary storage per session or job.

Automatic cleanup

Any data left on those spaces will be automatically deleted at the end of the job or session.

Special Filesystems

See hereafter a list of the specialised filesystems on Atos:

DirectoryContentComment
/ec/vol/msbackupBackup of the Conventional observations for the four days.
One file per synoptical cycle.
Available both on ecs and hpc.

2 Comments

  1. It's interesting I can write far more to /var/tmp RAM than to $TMPDIR SSD. If I'm on a shared node, retrieving GRIB from MARS and processing it to create a much smaller permanent file, which do you suggest I use for the temporary GRIB storage? It sounds like it would be fastest to use /var/tmp but would it be inconsiderate use of RAM?

    1. Luke Jones , I would encourage to use the $TMPDIR if sizes are small. As stated above, when running on the shared GPIL nodes (*f QoSs), you may request a bigger space in the SSD-backed TMPDIR with the extra SBATCH option:

      #SBATCH --gres=ssdtmp:<size>G

      With <size> being a number up to 40 GB. If that is still not enough for you, you may point your TMPDIR to SCRATCHDIR:

      export TMPDIR=$SCRATCHDIR

      We do not recommend using those special directories on RAM because of the impact they may have on others sharing the node with you.