I want to find out how much memory my jobs are using on a cluster that uses the SLURM scheduler. When I run the sacct command, the output does not include information about memory usage. The man page for sacct, shows a long and somewhat confusing array of options, and it is hard to tell which one is best.
ANSWER: It’s useful to know that SLURM uses RSS (Resident set size) to indicate memory-related options. The man page lists four fields that one can specify with the “format” option that might be of use:
AveRSS – Average resident set size of all tasks in job
MaxRSS – Maximum resident set size of all tasks in job
MaxRSSNode – The node on which the maxrss occurred
MaxRSSTask – The task ID where the maxrss occurred
For example,
sacct --format="AveRSS,MaxRSS,MaxRSSNode"
Will display the Average and Maximum memory footprint for all tasks in your currently running jobs, and the nodes on which the maximum memory footprints occurred.