Skip to content

SLURM install guide: slurmd won't start #93

@wlandau

Description

@wlandau

I am using the Debian section of your SLURM install guide to install SLURM on a single-node Ubuntu 16.04 desktop. In my case, the master and worker nodes are the same. I do not have a cluster, but I want to run some small examples and fix some code.

I am stuck at sudo /etc/init.d/slurmd start. The log shows

[2017-10-29T21:57:31.074] slurmctld version 15.08.7 started on cluster cluster              
[2017-10-29T21:57:31.075] layouts: no layout to initialize                                  
[2017-10-29T21:57:31.075] fatal: Frontend not configured correctly in slurm.conf.  See man slurm.conf look for frontendname.   

slurmd -C shows:

ClusterName=(null) NodeName=Haggunenon CPUs=4 Boards=1 SocketsPerBoard=1 CoresPerSocket=4 ThreadsPerCore=1 RealMemory=16004 TmpDisk=223645
UpTime=0-00:29:17

I also submitted posts to Stack Overflow:

And here is my /etc/slurm-llnl/slurm.conf. I generated it from /usr/share/doc/slurmctld/slurm-wlm-configurator.html, supplying the information from slurmd -C. I also changed the user name to wlandau (my own user name) and set ControlMachine and NodeName to Haggunenon (the hostname).

# slurm.conf file generated by configurator easy.html.
# Put this file on all nodes of your cluster.
# See the slurm.conf man page for more information.
#
ControlMachine=Haggunenon
#ControlAddr=
#
#MailProg=/bin/mail
MpiDefault=none
#MpiParams=ports=#-#
ProctrackType=proctrack/pgid
ReturnToService=1
SlurmctldPidFile=/var/run/slurm-llnl/slurmctld.pid
#SlurmctldPort=6817
SlurmdPidFile=/var/run/slurm-llnl/slurmd.pid
#SlurmdPort=6818
SlurmdSpoolDir=/var/lib/slurm-llnl/slurmd
SlurmUser=wlandau
#SlurmdUser=root
StateSaveLocation=/var/lib/slurm-llnl/slurmctld
SwitchType=switch/none
TaskPlugin=task/none
#
#
# TIMERS
#KillWait=30
#MinJobAge=300
#SlurmctldTimeout=120
#SlurmdTimeout=300
#
#
# SCHEDULING
FastSchedule=1
SchedulerType=sched/backfill
#SchedulerPort=7321
SelectType=select/linear
#
#
# LOGGING AND ACCOUNTING
AccountingStorageType=accounting_storage/none
ClusterName=cluster
#JobAcctGatherFrequency=30
JobAcctGatherType=jobacct_gather/none
#SlurmctldDebug=3
SlurmctldLogFile=/var/log/slurm-llnl/slurmctld.log
#SlurmdDebug=3
SlurmdLogFile=/var/log/slurm-llnl/slurmd.log
#
#
# COMPUTE NODES
NodeName=Haggunenon CPUs=4 RealMemory=16004 CoresPerSocket=4 ThreadsPerCore=1 State=UNKNOWN
PartitionName=DEFAULT Nodes=Haggunenon Default=YES MaxTime=INFINITE State=UP

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions