计算化学公社

标题: Rocky 9.5安装torque队列管理系统 [打印本页]

作者
Author:
kkkil2    时间: 2025-2-8 10:32
标题: Rocky 9.5安装torque队列管理系统
最近刚 接触rocky系统,安装了最新的9.5版本,这个版本可以用dnf方法安装torque-server,torque-mom,torque-scheduler,
版本是6.1.3.但是安装好后如何配置没找到详细的说明。我干脆卸载了它们,手段安装,参考了在ubutu系统下的安装方法(https://www.jianshu.com/p/3076655cca3c)。
一步一步来:

0.下载程序包: wget http://wpfilebase.s3.amazonaws.com/torque/torque-6.1.2.tar.gz

1. dnf install make gcc g++ libxml2-dev zlib1g-dev libboost-all-dev libssl-dev

2.配置(server用自己的主机名称代替,用命令hostname可查到)  ./configure --prefix=/opt/torque/6.1.2 --with-scp --with-default-server=c8336 --disable-gcc-warnings

3. 编辑文件:  vi ./src/lib/Libattr/req.cpp
更改将1057行的'\0'改为"\0"

4.make -j64 CFLAGS='-fpermissive'

5.make install

6.服务配置考到system下: cp contrib/systemd/{pbs_{server,sched,mom},trqauthd}.service /etc/systemd/system/

7.设置路径:
TORQUE=/opt/torque/6.1.2
echo "TORQUE=$TORQUE" >> /etc/profile
echo "export PATH=\$PATH:$TORQUE/bin:$TORQUE/sbin" >> /etc/profile
source /etc/profile

8.bash环境下启动
chmod +w torque.setup
vi ./torque.setup
#更改第一行中的sh为bash
sudo ./torque.setup c8336 # c8336对应你自己的hostname
运行如有显示warning,没有关系,选y:
initializing TORQUE (admin: c8336)
Warning: program compiled against libxml 210 using older 209

You have selected to start pbs_server in create mode.
If the server database exists it will be overwritten.
do you wish to continue y/(n)?y

9. kill 掉pbs和trqauthd的进程:
ps -e | grep trqauthd
ps -e | grep pbs
kill -9 进程id号

10.for i in trqauthd pbs_server pbs_mom pbs_sched; do sudo service $i start; done

11.vi /var/spool/torque/server_priv/nodes
c8336 np=64 //添加本行信息,注意等号前后不要有空格 c8336是主机名 np是cpu核心数

12.vi /var/spool/torque/mom_priv/config
pbsserver c8336 //添加这两行信息c8336是主机名
logevent 255

13.重启服务
for i in trqauthd pbs_server pbs_mom pbs_sched; do sudo service $i restart; done

14.systemctl enable pbs_server
systemctl start pbs_server
systemctl enable pbs_scheld
systemctl start pbs_sched
ps -ef |grep pbs
显示有pbs进程

15.创建队列:
pbs_server -t create
qmgr -c 'set server scheduling = true'
qmgr -c 'set server keep_completed = 10'
qmgr -c 'set server mom_job_sync = true'
qmgr -c 'create queue batch'
qmgr -c 'set queue batch queue_type = execution'
qmgr -c 'set queue batch started = true'
qmgr -c 'set queue batch enabled = true'
qmgr -c 'set queue batch resources_default.walltime = 1080:00:00'
qmgr -c 'set queue batch resources_default.nodes = 1'
qmgr -c 'set server default_queue = batch'
qmgr -c 'set server scheduling=true'



作者
Author:
Santz    时间: 2025-2-8 16:10
slurm 不香吗,torque-6.1.2 应该是 2018 年的产物了





欢迎光临 计算化学公社 (http://bbs.keinsci.com/) Powered by Discuz! X3.3