计算化学公社

 找回密码 Forget password
 注册 Register
Views: 11408|回复 Reply: 2
打印 Print 上一主题 Last thread 下一主题 Next thread

[ORCA] ORCA集群上运行BSUB脚本求助

[复制链接 Copy URL]

47

帖子

0

威望

478

eV
积分
525

Level 4 (黑子)

跳转到指定楼层 Go to specific reply
楼主
已经安装好ORCA和Openmpi,可以运行计算。但是输出文件中总会出现如下的内容:Failed to register memory region (MR):

Hostname: n0106
Address:  e92a5000
Length:   4194304
Error:    Cannot allocate memory
--------------------------------------------------------------------------
--------------------------------------------------------------------------
Open MPI has detected that there are UD-capable Verbs devices on your
system, but none of them were able to be setup properly.  This may
indicate a problem on this system.

You job will continue, but Open MPI will ignore the "ud" oob component
in this run.

Hostname: n0106


这样的过程很耗时,浪费了计算的时间。
有没有前辈知道该怎么解决啊?
脚本内容如下:
#!/bin/bash
#BSUB -J NO2
#BSUB -n 44
#BSUB -q 1080Ti


INFILE="NO2.inp"
export LD_LIBRARY_PATH=/home/ceph/shangyl/ORCA/OPENMI/openmpi-2.0.2/openmpi/lib/openmpi:$LD_LIBRARY_PATH
export LD_LIBRARY_PATH=/home/ceph/shangyl/ORCA/OPENMI/openmpi-2.0.2/openmpi/lib:$LD_LIBRARY_PATH
export PATH=/home/ceph/shangyl/ORCA/OPENMI/openmpi-2.0.2/openmpi/bin:$PATH
export PATH=/home/ceph/shangyl/ORCA/ORCA/orca_4_0_0_linux_x86-64:$PATH
export ORCA_EXEC=/home/ceph/shangyl/ORCA/ORCA/orca_4_0_0_linux_x86-64/orca



#====================   Do NOT revise any lines if you do not know their meanings    =====================================================================
#=========================================================================================================================================================
#BSUB -o %J.out
#BSUB -e %J.err
export OMP_NUM_THREADS=12



CURDIR=$PWD
rm -rf $CURDIR/nodelist.$LSB_JOBID >& /dev/null

for i in `echo $LSB_HOSTS`
do
        echo $i >> $CURDIR/nodelist.$LSB_JOBID
done

sed -i "s@n@n@g" $CURDIR/nodelist.$LSB_JOBID

NPROCS=`cat $CURDIR/nodelist.$LSB_JOBID|wc -l`

uniq $CURDIR/nodelist.$LSB_JOBID > $CURDIR/nodelist-tmp.$LSB_JOBID
for i in `cat $CURDIR/nodelist-tmp.$LSB_JOBID`
do
        CORES=`cat $CURDIR/nodelist.$LSB_JOBID|grep $i|wc -l`
        echo "$i" >> $CURDIR/nodelist-tmp2.$LSB_JOBID
done
mv $CURDIR/nodelist-tmp2.$LSB_JOBID $CURDIR/nodelist.$LSB_JOBID
rm -rf $CURDIR/nodelist-tmp.$LSB_JOBID

#cp nodelist.$LSB_JOBID  "$CURDIR/${INFILE:0:${#INFILE}-4}.nodes"

$ORCA_EXEC $INFILE &>$LSB_JOBNAME.out

rm -rf $CURDIR/nodelist.$LSB_JOBID

2

帖子

0

威望

87

eV
积分
89

Level 2 能力者

2#
发表于 Post on 2021-4-27 22:59:22 | 只看该作者 Only view this author
遇到了相同额问题,请问楼主当时是怎么解决的

2425

帖子

1

威望

6196

eV
积分
8641

Level 6 (一方通行)

3#
发表于 Post on 2021-4-27 23:21:13 | 只看该作者 Only view this author
cuifl 发表于 2021-4-27 22:59
遇到了相同额问题,请问楼主当时是怎么解决的

https://users.open-mpi.narkive.c ... emory-openmpi-2-0-2

Put “oob=tcp” in your default MCA param file
High-Performance Computing for You
为您专属定制的高性能计算解决方案

更多讯息,请访问:
https://labitc.top
http://tophpc.top:8080
电邮: ask@hpc4you.top

本版积分规则 Credits rule

手机版 Mobile version|北京科音自然科学研究中心 Beijing Kein Research Center for Natural Sciences|京公网安备 11010502035419号|计算化学公社 — 北京科音旗下高水平计算化学交流论坛 ( 京ICP备14038949号-1 )|网站地图

GMT+8, 2026-2-20 21:47 , Processed in 0.180182 second(s), 26 queries , Gzip On.

快速回复 返回顶部 返回列表 Return to list