|
我以为我设置了,。一个job,只占用一个CPU线程,但是查看scontrol show node | grep CPU
他还是为CPUAlloc=2,尝试 SelectType=select/cons_tres SelectTypeParameters=CR_Core_Memory 了这俩地方,好像也还没有玩明白,请大神们帮看看
应该是掉沟里了,自己出不来
脚本如下
#!/bin/bash
#SBATCH --job-name=cpu-stress
#SBATCH -N 1
#SBATCH --ntasks=1
#SBATCH --ntasks-per-core=1
#SBATCH --cpus-per-task=1
stress -c 1 -t 30s
——————————————————————————
测试情况如下,如果不指定--ntasks,他会直接运行俩次,指定了,。就是一次
root@ictest99:/usr/local/etc# srun --cpus-per-task=1 hostname
ictest99
ictest99
root@ictest99:/usr/local/etc# srun --cpus-per-task=1 --ntasks=1 hostname
ictest99
root@ictest99:/usr/local/etc# srun --ntasks-per-core=1 hostname
ictest99
下面是日志
yq@ictest99:~$ srun -J test --ntasks-per-core=1 hostname
ictest99
scontrol show job 237
JobId=237 JobName=test
UserId=yq(1000) GroupId=yq(1000) MCS_label=N/A
Priority=1 Nice=0 Account=(null) QOS=normal
JobState=COMPLETED Reason=None Dependency=(null)
Requeue=1 Restarts=0 BatchFlag=0 Reboot=0 ExitCode=0:0
RunTime=00:00:00 TimeLimit=UNLIMITED TimeMin=N/A
SubmitTime=2024-03-08T10:50:14 EligibleTime=2024-03-08T10:50:14
AccrueTime=Unknown
StartTime=2024-03-08T10:50:14 EndTime=2024-03-08T10:50:14 Deadline=N/A
SuspendTime=None SecsPreSuspend=0 LastSchedEval=2024-03-08T10:50:14 Scheduler=Main
Partition=cpu AllocNode:Sid=localhost:692604
ReqNodeList=(null) ExcNodeList=(null)
NodeList=ictest99
BatchHost=ictest99
NumNodes=1 NumCPUs=2 NumTasks=1 CPUs/Task=1 ReqB:S:C:T=0:0:*:*
ReqTRES=cpu=1,mem=63000M,node=1,billing=1
AllocTRES=cpu=2,node=1,billing=2
Socks/Node=* NtasksPerN:B:S:C=0:0:*:1 CoreSpec=*
MinCPUsNode=1 MinMemoryNode=0 MinTmpDiskNode=0
Features=(null) DelayBoot=00:00:00
OverSubscribe=OK Contiguous=0 Licenses=(null) Network=(null)
Command=hostname
WorkDir=/home/yq
Power=
——————————————————————————————————————
srun -J test02 --cpus-per-task=1 --ntasks=1 hostname
ictest99
scontrol show job 239
JobId=239 JobName=test02
UserId=yq(1000) GroupId=yq(1000) MCS_label=N/A
Priority=1 Nice=0 Account=(null) QOS=normal
JobState=COMPLETED Reason=None Dependency=(null)
Requeue=1 Restarts=0 BatchFlag=0 Reboot=0 ExitCode=0:0
RunTime=00:00:00 TimeLimit=UNLIMITED TimeMin=N/A
SubmitTime=2024-03-08T10:51:44 EligibleTime=2024-03-08T10:51:44
AccrueTime=Unknown
StartTime=2024-03-08T10:51:44 EndTime=2024-03-08T10:51:44 Deadline=N/A
SuspendTime=None SecsPreSuspend=0 LastSchedEval=2024-03-08T10:51:44 Scheduler=Main
Partition=cpu AllocNode:Sid=localhost:692604
ReqNodeList=(null) ExcNodeList=(null)
NodeList=ictest99
BatchHost=ictest99
NumNodes=1 NumCPUs=2 NumTasks=1 CPUs/Task=1 ReqB:S:C:T=0:0:*:*
ReqTRES=cpu=1,mem=63000M,node=1,billing=1
AllocTRES=cpu=2,node=1,billing=2
Socks/Node=* NtasksPerN:B:S:C=0:0:*:* CoreSpec=*
MinCPUsNode=1 MinMemoryNode=0 MinTmpDiskNode=0
Features=(null) DelayBoot=00:00:00
OverSubscribe=OK Contiguous=0 Licenses=(null) Network=(null)
Command=hostname
WorkDir=/home/yq
Power=
TresPerTask=cpu:1
如果还需要啥测试,我可以随时贴上来,谢谢大侠们了
|
-
-
集群.7z
7.13 KB, 下载次数 Times of downloads: 1
集群配置
|