Skip to content

Commit 8aeb812

Browse files
bradbeamkulkarnisamr
authored andcommitted
fix: Backport slurmctld segfault patch
Signed-off-by: Brad Beam <[email protected]>
1 parent 2511e8c commit 8aeb812

File tree

2 files changed

+17
-0
lines changed

2 files changed

+17
-0
lines changed
Lines changed: 12 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,12 @@
1+
diff --git a/src/slurmctld/job_mgr.c b/src/slurmctld/job_mgr.c
2+
index ec5cbfa81a..fc46c38956 100644
3+
--- a/src/slurmctld/job_mgr.c
4+
+++ b/src/slurmctld/job_mgr.c
5+
@@ -16289,6 +16289,7 @@ extern bool job_epilog_complete(uint32_t job_id, char *node_name,
6+
* really started. Very rare obviously.
7+
*/
8+
if ((IS_JOB_PENDING(job_ptr) && (!IS_JOB_COMPLETING(job_ptr))) ||
9+
+ ((!job_ptr->node_bitmap_cg) && (!IS_JOB_COMPLETING(job_ptr))) ||
10+
(job_ptr->node_bitmap == NULL)) {
11+
#ifndef HAVE_FRONT_END
12+
uint32_t base_state = NODE_STATE_UNKNOWN;

images/patches/slurm/README.md

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -204,3 +204,8 @@ Notes from `pthead_detch` man
204204
In SLURM when a job fails due to not being able to meet the segment size requirements, the reason is `FAIL_BAD_CONSTRAINTS`. When a job is in this state, it is set to priority = 0, which is a held state. The scheduler will skip evaluating the job on future runs.
205205

206206
This patch is to change it so that jobs that fail for unmet segment size requirements to not hold the job. So that if there are topology changes to the cluster, that can satisfy the job requirements, the job can still schedule. This will set the job reason to `Reason=Resources` instead of `Reason=BadConstraints`.
207+
208+
### 0024-slurmctld-segfault-epilog.patch
209+
210+
This patch backports the following commits to address an issue with slurmctld segfaults
211+
[SlurmCommit](https://github.com/SchedMD/slurm/commit/3d9f9647b416c47a238600e08b2887e4b8b93c2e)

0 commit comments

Comments
 (0)