Warning |
---|
TEMS is to be retired at the end of October 2021. See more information here . |
TEMS is a test platform where users and system administrators can prepare the migration to the final Atos systems.
This means that by nature TEMS is evolving with time, and may not be as feature complete as other production systems.
Tip |
---|
If you find any problem or any feature missing that you think should be present, and it is not listed here, please let us know by reporting as a "Problem on computing" through the ECMWF Support Portal mentioning "TEMSAtos" in the summary. |
Table of Contents |
---|
Missing Features
Comprehensive software stack
We have provided a basic software stack that should satisfy most users, but some software packages or libraries you require may not be present. If that is the case, let us know by reporting as a "Problem on computing" through the ECMWF Support Portal mentioning "TEMS" in the summary.
Atos HPCF is not operational platform yet, and many features or elements may be gradually added as complete setup is finalised. Here is a list of the known limitations, missing features and issues.
Table of Contents |
---|
Missing Features
End of job information
A basic report is provided at the end of the job with information about its execution.
No Format |
---|
## INFO[ECMWF-INFO -ecepilog] --------------------------------------------------------------------------------------------- ## [ECMWF-INFO -ecepilog] This is the ECMWF job Epilogue. [ECMWF-INFO -ecepilog] +++ Please report problems to ServiceDesk, servicedesk@ecmwf.int ## INFO issues using the Support portal +++ [ECMWF-INFO -ecepilog] +++ https://support.ecmwf.int +++ [ECMWF-INFO -ecepilog] --------------------------------------------------------------------------------------------- ## INFO ## INFO[ECMWF-INFO -ecepilog] [ECMWF-INFO -ecepilog] Run at 2021-0409-08T1128T06:1921:4225 on TEMS ## INFOaa [ECMWF-INFO -ecepilog] Job Name : myjob ## INFOeci [ECMWF-INFO -ecepilog] Job ID : 66894 ## INFO1009559 [ECMWF-INFO -ecepilog] Submitted : 2021-0409-08T1128T06:1905:38 ## INFO23 [ECMWF-INFO -ecepilog] Dispatched : 2021-0409-08T1128T06:1905:38 ## INFO23 [ECMWF-INFO -ecepilog] Completed : 2021-0409-08T1128T06:1921:42 ## INFO25 [ECMWF-INFO -ecepilog] Waiting in the queue : 0.0 ## INFO[ECMWF-INFO -ecepilog] Runtime : 4.0 ## INFO962 [ECMWF-INFO -ecepilog] Exit Code : 0:0 ## INFO Account[ECMWF-INFO -ecepilog] State : ecus ## COMPLETED [ECMWF-INFO Queue-ecepilog] Account : np ## INFO Ownermyaccount [ECMWF-INFO -ecepilog] Queue : uid(9999) ## INFO STDOUTnf [ECMWF-INFO -ecepilog] Owner : myjob.66894.out ## INFO STDERRuser [ECMWF-INFO -ecepilog] STDOUT : myjobslurm-1009559.66894.out ## [ECMWF-INFO Nodes-ecepilog] STDERR : 1 ## INFO Tasksslurm-1009559.out [ECMWF-INFO -ecepilog] Nodes : 16 ## INFO CPUs/Task 1 [ECMWF-INFO -ecepilog] Logical CPUs : 4 ## INFO8 [ECMWF-INFO -ecepilog] SBU : 220.722460 units ## INFO Logical CPUs : 256 ## INFO[ECMWF-INFO -ecepilog] |
Warning | ||
---|---|---|
| ||
|
Alternatively, you may use sacct
to get some of the statistics from SLURM once the job has finished.
SSD disks on GPIL nodes
The GPIL nodes have local SSDs with some 950GB capacity. These have not been mounted yet, as we need to develop a service model for their use.
Known issues
Intel MKL
...
greater than 19.0.5: performance issues on AMD chips
Recent versions of MKL do not use the AVX2 kernels for certain operations on non-intel chips, such as the AMD Rome on TEMSour HPCF. The consequence is a significant drop in performance.