计算化学公社
标题:
GPU加速问题GPU可检测,但是不运行
[打印本页]
作者Author:
gongyq
时间:
2022-3-19 16:18
标题:
GPU加速问题GPU可检测,但是不运行
请教群里的大神,我的GPU是英伟达Quadro K4000,拥有768个CUDA核心,我查询了Gromacs GPU算力是3,应该刚刚达标。为啥跑gromacs时显示detected ,但是non-functional。
我cmake编译的代码如下:
cmake .. -DGMX_BUILD_OWN_FFTW=ON -DREGRESSIONTEST_DOWNLOAD=ON -DGMX_GPU=CUDA -DCUDA_TOOLKIT_ROOT_DIR=/usr/local/cuda -DGMX_USE_OPENCL=on -DCMAKE_INSTALL_PREFIX=/usr/local/gromacs
mdrun的log如下:
Command line:
gmx mdrun -deffnm md_0_1 -v -pin on
GROMACS version: 2021.3
Verified release checksum is c5bf577cc74de0e05106b7b6426476abb7f6530be7b4a2c64f637d6a6eca8fcb
Precision: mixed
Memory model: 64 bit
MPI library: thread_mpi
OpenMP support: enabled (GMX_OPENMP_MAX_THREADS = 64)
GPU support: CUDA
SIMD instructions: AVX2_256
FFT library: fftw-3.3.8-sse2-avx-avx2-avx2_128
RDTSCP usage: enabled
TNG support: enabled
Hwloc support: disabled
Tracing support: disabled
C compiler: /usr/bin/cc GNU 9.3.0
C compiler flags: -mavx2 -mfma -Wno-missing-field-initializers -fexcess-precision=fast -funroll-all-loops -O3 -DNDEBUG
C++ compiler: /usr/bin/c++ GNU 9.3.0
C++ compiler flags: -mavx2 -mfma -Wno-missing-field-initializers -fexcess-precision=fast -funroll-all-loops -fopenmp -O3 -DNDEBUG
CUDA compiler: /usr/local/cuda/bin/nvcc nvcc: NVIDIA (R) Cuda compiler driver;Copyright (c) 2005-2022 NVIDIA Corporation;Built on Thu_Feb_10_18:23:41_PST_2022;Cuda compilation tools, release 11.6, V11.6.112;Build cuda_11.6.r11.6/compiler.30978841_0
CUDA compiler flags:-std=c++17;-gencode;arch=compute_35,code=sm_35;-gencode;arch=compute_37,code=sm_37;-gencode;arch=compute_50,code=sm_50;-gencode;arch=compute_52,code=sm_52;-gencode;arch=compute_60,code=sm_60;-gencode;arch=compute_61,code=sm_61;-gencode;arch=compute_70,code=sm_70;-gencode;arch=compute_75,code=sm_75;-gencode;arch=compute_80,code=sm_80;-Wno-deprecated-gpu-targets;-gencode;arch=compute_86,code=sm_86;-gencode;arch=compute_35,code=compute_35;-gencode;arch=compute_53,code=compute_53;-gencode;arch=compute_80,code=compute_80;-use_fast_math;-D_FORCE_INLINES;-mavx2 -mfma -Wno-missing-field-initializers -fexcess-precision=fast -funroll-all-loops -fopenmp -O3 -DNDEBUG
CUDA driver: 11.40
CUDA runtime: 11.60
Running on 1 node with total 24 cores, 48 logical cores, 0 compatible GPUs
Hardware detected:
CPU info:
Vendor: Intel
Brand: Intel(R) Xeon(R) CPU E5-2680 v3 @ 2.50GHz
Family: 6 Model: 63 Stepping: 2
Features: aes apic avx avx2 clfsh cmov cx8 cx16 f16c fma htt intel lahf mmx msr nonstop_tsc pcid pclmuldq pdcm pdpe1gb popcnt pse rdrnd rdtscp sse2 sse3 sse4.1 sse4.2 ssse3 tdt x2apic
Hardware topology: Basic
Sockets, cores, and logical processors:
Socket 0: [ 0 24] [ 1 25] [ 2 26] [ 3 27] [ 4 28] [ 5 29] [ 6 30] [ 7 31] [ 8 32] [ 9 33] [ 10 34] [ 11 35]
Socket 1: [ 12 36] [ 13 37] [ 14 38] [ 15 39] [ 16 40] [ 17 41] [ 18 42] [ 19 43] [ 20 44] [ 21 45] [ 22 46] [ 23 47]
GPU info:
Number of GPUs detected: 1
#0: N/A, stat: non-functional
Running on 1 node with total 24 cores, 48 logical cores, 0 compatible GPUs
Hardware detected:
CPU info:
Vendor: Intel
Brand: Intel(R) Xeon(R) CPU E5-2680 v3 @ 2.50GHz
Family: 6 Model: 63 Stepping: 2
Features: aes apic avx avx2 clfsh cmov cx8 cx16 f16c fma htt intel lahf mmx msr nonstop_tsc pcid pclmuldq pdcm pdpe1gb popcnt pse rdrnd rdtscp sse2 sse3 sse4.1 sse4.2 ssse3 tdt x2apic
Hardware topology: Basic
Sockets, cores, and logical processors:
Socket 0: [ 0 24] [ 1 25] [ 2 26] [ 3 27] [ 4 28] [ 5 29] [ 6 30] [ 7 31] [ 8 32] [ 9 33] [ 10 34] [ 11 35]
Socket 1: [ 12 36] [ 13 37] [ 14 38] [ 15 39] [ 16 40] [ 17 41] [ 18 42] [ 19 43] [ 20 44] [ 21 45] [ 22 46] [ 23 47]
GPU info:
Number of GPUs detected: 1
#0: N/A, stat: non-functional
作者Author:
lyj714
时间:
2022-3-19 21:45
更新显卡驱动
欢迎光临 计算化学公社 (http://bbs.keinsci.com/)
Powered by Discuz! X3.3