This update for slurm and pdsh fixes the following issues:
slurm was updated to version 24.11.1 using package slurm2411:
Security issues fixed:
Important remarks:
slurmdbd
(Slurm DataBase Daemon) you must update this first.slurmdbd
will work with Slurm daemons of version 23.02 and above.
You will not need to update all clusters at the same time, but it is very
important to update slurmdbd
first and having it running before updating
any other clusters making use of it.Highlights of changes:
/run/slurmdbd
(bsc#1236928)AccountUtilizationByUser
is able to be filtered by QOS.slurmd -C
switch/hpe_slingshot
.slurmctld
: Changed incoming RPC handling to dedicated thread pool with
asynchronous handling of I/O that can be configured via conmgr_*
entries
under SlurmctldParameters
in slurm.conf
.Configuration File Changes (see appropriate man page for details)
SchedulerParameters=bf_allow_magnetic_slot
option. It allows jobs in
magnetic reservations to be planned by backfill scheduler.TopologyParam=TopoMaxSizeUnroll=#
to allow --nodes=<min>-<max>
for
topology/block
.DataParserParameters
slurm.conf
parameter to allow setting default
value for CLI --json
and --yaml
arguments.switch/hpe_slingshot
now requires enable_stepmgr
.slurm.conf
under
SlurmctldParameters
:conmgr_max_connections
: Defaults to 150 connections.conmgr_threads
: Defaults to 64 threads for slurmctld.conmgr_use_poll
: Defaults is to use epoll in Linux.conmgr_connect_timeout
: Defaults to MessageTimeout
.conmgr_read_timeout
: Defaults to MessageTimeout
.conmgr_wait_write_delay
: Defaults to MessageTimeout
.conmgr_write_timeout
: Defaults to MessageTimeout. SlurmctldParamters=ignore_constraint_validation
to ignore
constraint/feature
validation at submission.SchedulerParameters=bf_topopt_enable
option to enable experimental hook
to control backfill.Command Changes (see man pages for details):
--cpu-bind=rank
.'%b'
as a file name pattern for the array task id modulo 10.sacct
: Respect --noheader
for --batch-script
and --env-vars
.sacctmgr ping
command to query status of slurmdbd
.sbcast
: Add --nodelist
option to specify where files are transmitted tosbcast
: Add --no-allocation
option to transmit files to nodes outside
of a job allocation.slurmdbd
: Add -u
option. This is used to determine if restarting the DBD
will result in database conversion.salloc --get-user-env
.scontrol
: Add --json
/--yaml
support to listpids
.scontrol
: Add liststeps
.scontrol
: Add listjobs
.scontrol show topo
: Show aggregated block sizes when using topology/block.API Changes:
burst_buffer/lua
call slurm.job_info_to_string()
.job_submit/lua
: Add assoc_qos
attribute to job_desc
to display all
potential QOS's for a job's association.job_submit/lua
: Add slurm.get_qos_priority()
function to retrieve the
given QOS's priority.SLURMRESTD Changes:
data_parser/v0.0.42+prefer_refs
flag to default.data_parser/v0.0.42+minimize_refs
flag to inline single referenced
schemas in the OpenAPI schema to get default behavior of
data_parser/v0.0.41
.JOB_INFO
field minimum_switches
to required_switches
to reflect the actual behavior.ACCOUNT_CONDITION
field assocation
to association
(typo).slurmdb/v0.0.42/jobs pid
field deprecated.For details on the changes in this version update, consult Slurm 24.11 changelog
pdsh was updated from version 2.34 to 2.35:
-d
option to log errorspdsh-slurm_24_11
compatible with Slurm 24.11{ "binaries": [ { "slurm_24_11-config": "24.11.1-3.5.3", "slurm_24_11-auth-none": "24.11.1-3.5.3", "pdsh-genders": "2.35-7.44.1", "pdsh-slurm_22_05": "2.35-7.44.1", "slurm_24_11-sql": "24.11.1-3.5.3", "pdsh-machines": "2.35-7.44.1", "pdsh-dshgroup": "2.35-7.44.1", "slurm_24_11-slurmdbd": "24.11.1-3.5.3", "slurm_24_11-config-man": "24.11.1-3.5.3", "slurm_24_11-lua": "24.11.1-3.5.3", "slurm_24_11-cray": "24.11.1-3.5.3", "libnss_slurm2_24_11": "24.11.1-3.5.3", "perl-slurm_24_11": "24.11.1-3.5.3", "libpmi0_24_11": "24.11.1-3.5.3", "pdsh-slurm_24_11": "2.35-7.44.4", "slurm_24_11-torque": "24.11.1-3.5.3", "slurm_24_11-pam_slurm": "24.11.1-3.5.3", "slurm_24_11-webdoc": "24.11.1-3.5.3", "pdsh-netgroup": "2.35-7.44.1", "pdsh": "2.35-7.44.1", "slurm_24_11-doc": "24.11.1-3.5.3", "slurm_24_11-munge": "24.11.1-3.5.3", "pdsh-slurm_23_02": "2.35-7.44.1", "slurm_24_11-sview": "24.11.1-3.5.3", "libslurm42": "24.11.1-3.5.3", "slurm_24_11-plugins": "24.11.1-3.5.3", "slurm_24_11": "24.11.1-3.5.3", "pdsh-slurm_20_11": "2.35-7.44.1", "slurm_24_11-node": "24.11.1-3.5.3", "slurm_24_11-devel": "24.11.1-3.5.3" } ] }
{ "binaries": [ { "slurm_24_11-config": "24.11.1-3.5.3", "slurm_24_11-auth-none": "24.11.1-3.5.3", "pdsh-genders": "2.35-7.44.1", "pdsh-slurm_22_05": "2.35-7.44.1", "slurm_24_11-sql": "24.11.1-3.5.3", "pdsh-machines": "2.35-7.44.1", "pdsh-dshgroup": "2.35-7.44.1", "slurm_24_11-slurmdbd": "24.11.1-3.5.3", "slurm_24_11-config-man": "24.11.1-3.5.3", "slurm_24_11-lua": "24.11.1-3.5.3", "slurm_24_11-cray": "24.11.1-3.5.3", "libnss_slurm2_24_11": "24.11.1-3.5.3", "perl-slurm_24_11": "24.11.1-3.5.3", "libpmi0_24_11": "24.11.1-3.5.3", "pdsh-slurm_24_11": "2.35-7.44.4", "slurm_24_11-torque": "24.11.1-3.5.3", "slurm_24_11-pam_slurm": "24.11.1-3.5.3", "slurm_24_11-webdoc": "24.11.1-3.5.3", "pdsh-netgroup": "2.35-7.44.1", "pdsh": "2.35-7.44.1", "slurm_24_11-doc": "24.11.1-3.5.3", "slurm_24_11-munge": "24.11.1-3.5.3", "pdsh-slurm_23_02": "2.35-7.44.1", "slurm_24_11-sview": "24.11.1-3.5.3", "libslurm42": "24.11.1-3.5.3", "slurm_24_11-plugins": "24.11.1-3.5.3", "slurm_24_11": "24.11.1-3.5.3", "pdsh-slurm_20_11": "2.35-7.44.1", "slurm_24_11-node": "24.11.1-3.5.3", "slurm_24_11-devel": "24.11.1-3.5.3" } ] }
{ "binaries": [ { "slurm_24_11-config": "24.11.1-3.5.3", "slurm_24_11-auth-none": "24.11.1-3.5.3", "pdsh-genders": "2.35-7.44.1", "pdsh-slurm_22_05": "2.35-7.44.1", "slurm_24_11-sql": "24.11.1-3.5.3", "pdsh-machines": "2.35-7.44.1", "pdsh-dshgroup": "2.35-7.44.1", "slurm_24_11-slurmdbd": "24.11.1-3.5.3", "slurm_24_11-config-man": "24.11.1-3.5.3", "slurm_24_11-lua": "24.11.1-3.5.3", "slurm_24_11-cray": "24.11.1-3.5.3", "libnss_slurm2_24_11": "24.11.1-3.5.3", "perl-slurm_24_11": "24.11.1-3.5.3", "libpmi0_24_11": "24.11.1-3.5.3", "pdsh-slurm_24_11": "2.35-7.44.4", "slurm_24_11-torque": "24.11.1-3.5.3", "slurm_24_11-pam_slurm": "24.11.1-3.5.3", "slurm_24_11-webdoc": "24.11.1-3.5.3", "pdsh-netgroup": "2.35-7.44.1", "pdsh": "2.35-7.44.1", "slurm_24_11-doc": "24.11.1-3.5.3", "slurm_24_11-munge": "24.11.1-3.5.3", "pdsh-slurm_23_02": "2.35-7.44.1", "slurm_24_11-sview": "24.11.1-3.5.3", "libslurm42": "24.11.1-3.5.3", "slurm_24_11-plugins": "24.11.1-3.5.3", "slurm_24_11": "24.11.1-3.5.3", "pdsh-slurm_20_11": "2.35-7.44.1", "slurm_24_11-node": "24.11.1-3.5.3", "slurm_24_11-devel": "24.11.1-3.5.3" } ] }
{ "binaries": [ { "slurm_24_11-config": "24.11.1-3.5.3", "slurm_24_11-auth-none": "24.11.1-3.5.3", "pdsh-genders": "2.35-7.44.1", "pdsh-slurm_22_05": "2.35-7.44.1", "slurm_24_11-sql": "24.11.1-3.5.3", "pdsh-machines": "2.35-7.44.1", "pdsh-dshgroup": "2.35-7.44.1", "slurm_24_11-slurmdbd": "24.11.1-3.5.3", "slurm_24_11-config-man": "24.11.1-3.5.3", "slurm_24_11-lua": "24.11.1-3.5.3", "slurm_24_11-cray": "24.11.1-3.5.3", "libnss_slurm2_24_11": "24.11.1-3.5.3", "perl-slurm_24_11": "24.11.1-3.5.3", "libpmi0_24_11": "24.11.1-3.5.3", "pdsh-slurm_24_11": "2.35-7.44.4", "slurm_24_11-torque": "24.11.1-3.5.3", "slurm_24_11-pam_slurm": "24.11.1-3.5.3", "slurm_24_11-webdoc": "24.11.1-3.5.3", "pdsh-netgroup": "2.35-7.44.1", "pdsh": "2.35-7.44.1", "slurm_24_11-doc": "24.11.1-3.5.3", "slurm_24_11-munge": "24.11.1-3.5.3", "pdsh-slurm_23_02": "2.35-7.44.1", "slurm_24_11-sview": "24.11.1-3.5.3", "libslurm42": "24.11.1-3.5.3", "slurm_24_11-plugins": "24.11.1-3.5.3", "slurm_24_11": "24.11.1-3.5.3", "pdsh-slurm_20_11": "2.35-7.44.1", "slurm_24_11-node": "24.11.1-3.5.3", "slurm_24_11-devel": "24.11.1-3.5.3" } ] }
{ "binaries": [ { "slurm_24_11-config": "24.11.1-3.5.3", "slurm_24_11-auth-none": "24.11.1-3.5.3", "pdsh-genders": "2.35-7.44.1", "pdsh-slurm_22_05": "2.35-7.44.1", "slurm_24_11-sql": "24.11.1-3.5.3", "pdsh-machines": "2.35-7.44.1", "pdsh-dshgroup": "2.35-7.44.1", "slurm_24_11-slurmdbd": "24.11.1-3.5.3", "slurm_24_11-config-man": "24.11.1-3.5.3", "slurm_24_11-lua": "24.11.1-3.5.3", "slurm_24_11-cray": "24.11.1-3.5.3", "libnss_slurm2_24_11": "24.11.1-3.5.3", "perl-slurm_24_11": "24.11.1-3.5.3", "libpmi0_24_11": "24.11.1-3.5.3", "pdsh-slurm_24_11": "2.35-7.44.4", "slurm_24_11-torque": "24.11.1-3.5.3", "slurm_24_11-pam_slurm": "24.11.1-3.5.3", "slurm_24_11-webdoc": "24.11.1-3.5.3", "pdsh-netgroup": "2.35-7.44.1", "pdsh": "2.35-7.44.1", "slurm_24_11-doc": "24.11.1-3.5.3", "slurm_24_11-munge": "24.11.1-3.5.3", "pdsh-slurm_23_02": "2.35-7.44.1", "slurm_24_11-sview": "24.11.1-3.5.3", "libslurm42": "24.11.1-3.5.3", "slurm_24_11-plugins": "24.11.1-3.5.3", "slurm_24_11": "24.11.1-3.5.3", "pdsh-slurm_20_11": "2.35-7.44.1", "slurm_24_11-node": "24.11.1-3.5.3", "slurm_24_11-devel": "24.11.1-3.5.3" } ] }
{ "binaries": [ { "slurm_24_11-config": "24.11.1-3.5.3", "slurm_24_11-auth-none": "24.11.1-3.5.3", "pdsh-genders": "2.35-7.44.1", "pdsh-slurm_22_05": "2.35-7.44.1", "slurm_24_11-sql": "24.11.1-3.5.3", "pdsh-machines": "2.35-7.44.1", "pdsh-dshgroup": "2.35-7.44.1", "slurm_24_11-slurmdbd": "24.11.1-3.5.3", "slurm_24_11-config-man": "24.11.1-3.5.3", "slurm_24_11-lua": "24.11.1-3.5.3", "slurm_24_11-cray": "24.11.1-3.5.3", "libnss_slurm2_24_11": "24.11.1-3.5.3", "perl-slurm_24_11": "24.11.1-3.5.3", "libpmi0_24_11": "24.11.1-3.5.3", "pdsh-slurm_24_11": "2.35-7.44.4", "slurm_24_11-torque": "24.11.1-3.5.3", "slurm_24_11-pam_slurm": "24.11.1-3.5.3", "slurm_24_11-webdoc": "24.11.1-3.5.3", "pdsh-netgroup": "2.35-7.44.1", "pdsh": "2.35-7.44.1", "slurm_24_11-doc": "24.11.1-3.5.3", "slurm_24_11-munge": "24.11.1-3.5.3", "pdsh-slurm_23_02": "2.35-7.44.1", "slurm_24_11-sview": "24.11.1-3.5.3", "libslurm42": "24.11.1-3.5.3", "slurm_24_11-plugins": "24.11.1-3.5.3", "slurm_24_11": "24.11.1-3.5.3", "pdsh-slurm_20_11": "2.35-7.44.1", "slurm_24_11-node": "24.11.1-3.5.3", "slurm_24_11-devel": "24.11.1-3.5.3" } ] }