Prepare SLURM scripts for Hmsc-HPC model fitting
Source:R/mod_SLURM.R
, R/mod_SLURM_refit.R
Mod_SLURM.Rd
The mod_SLURM
function generates SLURM job submission scripts for fitting
Hmsc-HPC models in an HPC environment. Additionally, mod_SLURM_refit
creates SLURM scripts for refitting models that failed or were not previously
fitted.
Usage
mod_SLURM(
model_dir = NULL,
job_name = NULL,
cat_job_info = TRUE,
ntasks = 1L,
cpus_per_task = 1L,
gpus_per_node = 1L,
memory_per_cpu = "64G",
job_runtime = NULL,
HPC_partition = "small-g",
env_file = ".env",
path_Hmsc = NULL,
command_prefix = "Commands2Fit",
SLURM_prefix = "Bash_Fit",
SLURM_path_out = NULL
)
mod_SLURM_refit(
model_dir = NULL,
n_array_jobs = 210L,
job_name = NULL,
memory_per_cpu = "64G",
job_runtime = NULL,
HPC_partition = "small-g",
env_file = ".env",
cat_job_info = TRUE,
ntasks = 1L,
cpus_per_task = 1L,
gpus_per_node = 1L,
SLURM_prepare = TRUE,
path_Hmsc = NULL,
refit_prefix = "Commands2Refit",
SLURM_prefix = "Bash_Refit"
)
Arguments
- model_dir
Character. Path to the root directory of the fitted model.
- job_name
Character. Name of the submitted job(s).
- cat_job_info
Logical. If
TRUE
, additional bash commands are included to print job-related information. Default:TRUE
.- ntasks
Integer. Number of tasks to allocate for the job (
#SBATCH --ntasks
). Default: 1.- cpus_per_task
Integer. Number of CPU cores allocated per task (
#SBATCH --cpus-per-task
). Default: 1.- gpus_per_node
Integer. Number of GPUs requested per node (
#SBATCH --gpus-per-node
). Default: 1.- memory_per_cpu
Character. Memory allocation per CPU core. Example: "32G" for 32 gigabytes. Defaults to "64G".
- job_runtime
Character. Maximum allowed runtime for the job. Example: "01:00:00" for one hour. Required — if not provided, the function throws an error.
- HPC_partition
Character. Name of the SLURM partition to submit the job to. Default: "small-g", for running the array jobs on the GPU.
- env_file
Character. Path to the environment file containing paths to data sources. Defaults to
.env
.- path_Hmsc
Character. Path to the Hmsc-HPC installation.
- command_prefix
Character.Prefix for the bash commands used in job execution. Default: "
Commands2Fit
".- SLURM_prefix
Character. Prefix for the generated SLURM script filenames.
- SLURM_path_out
Character. Directory where SLURM script(s) will be saved. If
NULL
(default), the function derives the path frommodel_dir
.- n_array_jobs
Integer. Number of jobs per SLURM script file. In LUMI HPC, there is a limit of 210 submitted jobs per user for the
small-g
partition. This argument is used to split the jobs into multiple SLURM scripts if needed. Default: 210. See LUMI documentation for more details.- SLURM_prepare
Logical. Whether to prepare SLURM command files. If
TRUE
(default), the SLURM commands will be saved to disk using the mod_SLURM function.- refit_prefix
Character. Prefix for files containing commands to refit failed or incomplete models.