Oscar Slurm Runaway Jobs

Please see below regarding the current Slurm runaway jobs on Oscar

What’s the issue?

  • Slurm jobs after completion are not being cleared from the queue (runaway jobs). This means even if your previous job is finished it will still be shown as its in a RUNNING state to slurmctld
  • This affects new jobs of users who have too many runaway jobs as their new jobs will be pending in the queue due to QOS limits

What’s the cause?

  • We are working to identify the root cause of this problem. Expect an update soon

Please email support@ccv.brown.edu with any issues, questions, or concerns.

Sincerely,
CCV User Services

Oscar Users,

This issue is resolved now.

It was due to a bug in the current Slurm version. This is addressed in the new version of Slurm that we are upgrading to during the winter maintenance.

We apologize for the inconvenience.

Sincerely,
CCV User Services