SLURM HPC Plugin#
https://slurm.schedmd.com/documentation.html
This HPC environment can be selected via --execenv=hpc.slurm. In this HPC
environment, SIERRA will run experiments spread across multiple nodes allocated
by the SLURM scheduler. The following table describes the SLURM-SIERRA
interface. Some SLURM environment variables are used by SIERRA to configure
experiments during stage 1,2; if they are not defined SIERRA will throw an
error.
Environment variable |
SIERRA context |
Command line override |
|---|---|---|
Used to transfer environment variables into the GNU parallel environment. |
N/A |
|
Used to set the shell used by GNU parallel to execute all commands
in. Overwritten by SIERRA to |
N/A |
|
Exported by SIERRA via |
N/A |
|
Exported by SIERRA via |
N/A |
|
Exported by SIERRA via |
N/A |
|
Used to set # threads per experimental node for each allocated compute node. |
N/A |
|
Used to set # parallel jobs per allocated compute node. |
||
Obtaining the list of nodes allocated to a job which SIERRA can direct GNU parallel to use for experiments. |
N/A |
|
Creating the UUID nodelist file passed to GNU parallel, guaranteeing no collisions (i.e., simultaneous SIERRA invocations sharing allocated nodes if multiple jobs are started from the same directory). |
N/A |