|
|
本帖最后由 牧生 于 2026-5-6 16:52 编辑
我跑一个任务,报MPI的问题,但我无法解决。
此外,不知道为何,我的9950X3D,架构被自动认作intel的Haswell
第一个
apptainer exec cp2k-mkl_2025.2-experimental.sif mpirun -np 1 cp2k.psmp -i OPT.inp -o OPT.out
SIRIUS 7.9.0, git hash: https://api.github.com/repos/ele ... git/ref/tags/v7.9.0
--------------------------------------------------------------------------
MPI_ABORT was invoked on rank 0 in communicator MPI_COMM_WORLD
Proc: [[5169,1],0]
Errorcode: 1
NOTE: invoking MPI_ABORT causes Open MPI to kill all MPI processes.
You may or may not see output from other processes, depending on
exactly when Open MPI kills them.
--------------------------------------------------------------------------
--------------------------------------------------------------------------
prterun has exited due to process rank 0 with PID 0 on node jing calling
"abort". This may have caused other processes in the application to be
terminated by signals sent by prterun (as reported here).
--------------------------------------------------------------------------
第二个
apptainer exec cp2k-rocm_2026.1-gfx942.sif mpirun -np 6 cp2k.psmp -i OPT.inp -o OPT.out
Core not found: ZEN4
Core: Haswell
Core not found: ZEN4
Core: Haswell
Core not found: ZEN4
Core: Haswell
Core not found: ZEN4
Core: Haswell
Core not found: ZEN4
Core: Haswell
Core not found: ZEN4
Core: Haswell
Inconsistency in warp sizes: Cuda/Hip indicates warp size = 64, while the gpu_properties files indicates warp_size = 32.
Inconsistency in warp sizes: Cuda/Hip indicates warp size = 64, while the gpu_properties files indicates warp_size = 32.
Inconsistency in warp sizes: Cuda/Hip indicates warp size = 64, while the gpu_properties files indicates warp_size = 32.
Inconsistency in warp sizes: Cuda/Hip indicates warp size = 64, while the gpu_properties files indicates warp_size = 32.
Inconsistency in warp sizes: Cuda/Hip indicates warp size = 64, while the gpu_properties files indicates warp_size = 32.
Inconsistency in warp sizes: Cuda/Hip indicates warp size = 64, while the gpu_properties files indicates warp_size = 32.
--------------------------------------------------------------------------
MPI_ABORT was invoked on rank 0 in communicator MPI_COMM_WORLD
Proc: [[46349,1],0]
Errorcode: 1
NOTE: invoking MPI_ABORT causes Open MPI to kill all MPI processes.
You may or may not see output from other processes, depending on
exactly when Open MPI kills them.
--------------------------------------------------------------------------
--------------------------------------------------------------------------
prterun has exited due to process rank 0 with PID 0 on node jing calling
"abort". This may have caused other processes in the application to be
terminated by signals sent by prterun (as reported here).
--------------------------------------------------------------------------
第三个apptainer exec cp2k-opensource_2025.2-force-avx512.sif mpirun -np 6 cp2k.psmp -i OPT.inp -o OPT.out
Core not found: ZEN4
Core: Haswell
Core not found: ZEN4
Core: Haswell
Core not found: ZEN4
Core: Haswell
Core not found: ZEN4
Core: Haswell
Core not found: ZEN4
Core: Haswell
Core not found: ZEN4
Core: Haswell
SIRIUS 7.9.0, git hash: https://api.github.com/repos/ele ... git/ref/tags/v7.9.0
--------------------------------------------------------------------------
MPI_ABORT was invoked on rank 0 in communicator MPI_COMM_WORLD
Proc: [[33067,1],0]
Errorcode: 1
NOTE: invoking MPI_ABORT causes Open MPI to kill all MPI processes.
You may or may not see output from other processes, depending on
exactly when Open MPI kills them.
--------------------------------------------------------------------------
--------------------------------------------------------------------------
prterun has exited due to process rank 0 with PID 0 on node jing calling
"abort". This may have caused other processes in the application to be
terminated by signals sent by prterun (as reported here).
--------------------------------------------------------------------------
|
|