|
请问下楼主,我用您的测试集中的体系B在课题组服务器上跑,最高也只能跑91.220 ns/day,与您的最高性能300多ns/day差距较大是为啥?而且我的“-update gpu -bonded gpu”要比“-update gpu”速度快些(前者91.220 ns/day,后者76.52791.220 ns/day)。而且-ntomp=1要比-ntomp>1(8,12,16等)都快,请问下我的问题出在哪?谢谢楼主
我的机器参数:
CPU:AMD EPYC 7402 24-Core Processor
GPU: A100*8
gmx版本信息:
- gmx -version
- GROMACS version: 2022.6
- Precision: mixed
- Memory model: 64 bit
- MPI library: thread_mpi
- OpenMP support: enabled (GMX_OPENMP_MAX_THREADS = 128)
- GPU support: CUDA
- SIMD instructions: AVX2_256
- CPU FFT library: fftw-3.3.8-sse2-avx
- GPU FFT library: cuFFT
- RDTSCP usage: enabled
- TNG support: enabled
- Hwloc support: disabled
- Tracing support: disabled
- C compiler: /usr/bin/cc GNU 9.4.0
- C compiler flags: -mavx2 -mfma -pthread -Wno-missing-field-initializers -fexcess-precision=fast -funroll-all-loops -O3 -DNDEBUG
- C++ compiler: /usr/bin/c++ GNU 9.4.0
- C++ compiler flags: -mavx2 -mfma -pthread -Wno-missing-field-initializers -fexcess-precision=fast -funroll-all-loops -fopenmp -O3 -DNDEBUG
- CUDA compiler: /data/soft/cuda-sdk/12.1/bin/nvcc nvcc: NVIDIA (R) Cuda compiler driver;Copyright (c) 2005-2023 NVIDIA Corporation;Built on Tue_Feb__7_19:32:13_PST_2023;Cuda compilation tools, release 12.1, V12.1.66;Build cuda_12.1.r12.1/compiler.32415258_0
- CUDA compiler flags:-std=c++17;--generate-code=arch=compute_50,code=sm_50;--generate-code=arch=compute_52,code=sm_52;--generate-code=arch=compute_60,code=sm_60;--generate-code=arch=compute_61,code=sm_61;--generate-code=arch=compute_70,code=sm_70;--generate-code=arch=compute_75,code=sm_75;--generate-code=arch=compute_80,code=sm_80;--generate-code=arch=compute_86,code=sm_86;--generate-code=arch=compute_89,code=sm_89;--generate-code=arch=compute_90,code=sm_90;-Wno-deprecated-gpu-targets;--generate-code=arch=compute_53,code=sm_53;--generate-code=arch=compute_80,code=sm_80;-use_fast_math;-D_FORCE_INLINES;-mavx2 -mfma -pthread -Wno-missing-field-initializers -fexcess-precision=fast -funroll-all-loops -fopenmp -O3 -DNDEBUG
- CUDA driver: 12.20
- CUDA runtime: 12.10
复制代码
提交脚本:
- #!/bin/bash
- #An example.
- #SBATCH -J wyd-test
- #SBATCH -p normal # 使用指定的队列
- #SBATCH --qos=normalqos # 使用normal队列对应的QoS
- #SBATCH --gres=gpu:1 # 使用的GPU卡数
- gmx mdrun -pin on -ntmpi 1 -ntomp 1 -notunepme -bonded gpu -update gpu -v -deffnm B
复制代码
|
|