Ci-dessous, les différences entre deux révisions de la page.
Les deux révisions précédentes Révision précédente Prochaine révision | Révision précédente Dernière révision Les deux révisions suivantes | ||
developpement:activites:integration:cuda4jessie [2015/11/17 15:42] equemene |
developpement:activites:integration:cuda4jessie [2016/04/21 18:37] equemene [Installation des composants Nvidia] |
||
---|---|---|---|
Ligne 81: | Ligne 81: | ||
Les paquets créés sont les suivants : | Les paquets créés sont les suivants : | ||
<code> | <code> | ||
- | libcuda1_352.55-3_amd64.deb | + | libcuda1_355.11-2_amd64.deb |
- | libegl1-nvidia_352.55-3_amd64.deb | + | libegl1-nvidia_355.11-2_amd64.deb |
- | libgl1-nvidia-glx_352.55-3_amd64.deb | + | libegl-nvidia0_355.11-2_amd64.deb |
- | libgles1-nvidia_352.55-3_amd64.deb | + | libgl1-nvidia-glx_355.11-2_amd64.deb |
- | libgles2-nvidia_352.55-3_amd64.deb | + | libgles1-nvidia_355.11-2_amd64.deb |
- | libnvcuvid1_352.55-3_amd64.deb | + | libgles2-nvidia_355.11-2_amd64.deb |
- | libnvidia-compiler_352.55-3_amd64.deb | + | libglvnd-nvidia_355.11-2_amd64.deb |
- | libnvidia-eglcore_352.55-3_amd64.deb | + | libnvcuvid1_355.11-2_amd64.deb |
- | libnvidia-encode1_352.55-3_amd64.deb | + | libnvidia-compiler_355.11-2_amd64.deb |
- | libnvidia-fbc1_352.55-3_amd64.deb | + | libnvidia-eglcore_355.11-2_amd64.deb |
- | libnvidia-ifr1_352.55-3_amd64.deb | + | libnvidia-encode1_355.11-2_amd64.deb |
- | libnvidia-ml1_352.55-3_amd64.deb | + | libnvidia-fbc1_355.11-2_amd64.deb |
- | nvidia-alternative_352.55-3_amd64.deb | + | libnvidia-ifr1_355.11-2_amd64.deb |
- | nvidia-cuda-mps_352.55-3_amd64.deb | + | libnvidia-ml1_355.11-2_amd64.deb |
- | nvidia-detect_352.55-3_amd64.deb | + | nvidia-alternative_355.11-2_amd64.deb |
- | nvidia-driver_352.55-3_amd64.deb | + | nvidia-cuda-mps_355.11-2_amd64.deb |
- | nvidia-driver-bin_352.55-3_amd64.deb | + | nvidia-detect_355.11-2_amd64.deb |
- | nvidia-kernel-dkms_352.55-3_amd64.deb | + | nvidia-driver_355.11-2_amd64.deb |
- | nvidia-kernel-source_352.55-3_amd64.deb | + | nvidia-driver-bin_355.11-2_amd64.deb |
- | nvidia-kernel-support_352.55-3_amd64.deb | + | nvidia-kernel-dkms_355.11-2_amd64.deb |
- | nvidia-libopencl1_352.55-3_amd64.deb | + | nvidia-kernel-source_355.11-2_amd64.deb |
- | nvidia-opencl-common_352.55-3_amd64.deb | + | nvidia-kernel-support_355.11-2_amd64.deb |
- | nvidia-opencl-icd_352.55-3_amd64.deb | + | nvidia-legacy-check_355.11-2_amd64.deb |
- | nvidia-smi_352.55-3_amd64.deb | + | nvidia-libopencl1_355.11-2_amd64.deb |
- | nvidia-vdpau-driver_352.55-3_amd64.deb | + | nvidia-opencl-common_355.11-2_amd64.deb |
- | xserver-xorg-video-nvidia_352.55-3_amd64.deb | + | nvidia-opencl-icd_355.11-2_amd64.deb |
+ | nvidia-smi_355.11-2_amd64.deb | ||
+ | nvidia-vdpau-driver_355.11-2_amd64.deb | ||
+ | xserver-xorg-video-nvidia_355.11-2_amd64.deb | ||
</code> | </code> | ||
Ligne 119: | Ligne 122: | ||
Les paquets créés sont les suivants : | Les paquets créés sont les suivants : | ||
<code> | <code> | ||
- | nvidia-modprobe_358.09-1_amd64.deb | + | nvidia-modprobe_361.28-1_amd64.deb |
</code> | </code> | ||
Ligne 162: | Ligne 165: | ||
Les paquets créés sont les suivants : | Les paquets créés sont les suivants : | ||
<code> | <code> | ||
- | nvidia-installer-cleanup_20151021+1_amd64.deb | + | nvidia-installer-cleanup_20151021+4_amd64.deb |
- | nvidia-kernel-common_20151021+1_amd64.deb | + | nvidia-kernel-common_20151021+4_amd64.deb |
- | nvidia-support_20151021+1_amd64.deb | + | nvidia-support_20151021+4_amd64.deb |
</code> | </code> | ||
Ligne 183: | Ligne 186: | ||
dpkg -i nvidia-modprobe_*_amd64.deb | dpkg -i nvidia-modprobe_*_amd64.deb | ||
+ | dpkg -i nvidia-legacy-check-*_amd64.deb | ||
dpkg -i nvidia-alternative_*_amd64.deb | dpkg -i nvidia-alternative_*_amd64.deb | ||
Ligne 195: | Ligne 198: | ||
dpkg -i nvidia-kernel-dkms_*_amd64.deb | dpkg -i nvidia-kernel-dkms_*_amd64.deb | ||
- | ls -1 lib*352.55*deb | xargs -I '{}' dpkg -i '{}' | + | ls -1 lib*$(dpkg -l | grep nvidia-kernel-dkms | awk '{ print $3 }')*deb | xargs -I '{}' dpkg -i '{}' |
apt-get -f install | apt-get -f install | ||
Ligne 203: | Ligne 206: | ||
dpkg -i nvidia-vdpau-driver_*_amd64.deb nvidia-driver_*_amd64.deb nvidia-driver-bin_*_amd64.deb | dpkg -i nvidia-vdpau-driver_*_amd64.deb nvidia-driver_*_amd64.deb nvidia-driver-bin_*_amd64.deb | ||
- | ls *352.55*deb | grep -v ^lib | grep -v nvidia-kernel | grep -v libopencl | xargs -I '{}' dpkg -i '{}' | + | ls *$(dpkg -l | grep nvidia-kernel-dkms | awk '{ print $3 }')*deb | grep -v ^lib | grep -v nvidia-kernel | grep -v libopencl | xargs -I '{}' dpkg -i '{}' |
dpkg -i nvidia-driver-bin_*deb nvidia-driver_*deb nvidia-xconfig* nvidia-settings* libxnvctrl* nvidia-smi_* | dpkg -i nvidia-driver-bin_*deb nvidia-driver_*deb nvidia-xconfig* nvidia-settings* libxnvctrl* nvidia-smi_* | ||
Ligne 301: | Ligne 304: | ||
wget http://snapshot.debian.org/archive/debian/20150710T034220Z/pool/contrib/p/pycuda/pycuda_2015.1.2-1.dsc | wget http://snapshot.debian.org/archive/debian/20150710T034220Z/pool/contrib/p/pycuda/pycuda_2015.1.2-1.dsc | ||
wget http://snapshot.debian.org/archive/debian/20150710T034220Z/pool/contrib/p/pycuda/pycuda_2015.1.2.orig.tar.xz | wget http://snapshot.debian.org/archive/debian/20150710T034220Z/pool/contrib/p/pycuda/pycuda_2015.1.2.orig.tar.xz | ||
- | tar Jxf ../pycuda_2015.1.2-1.debian.tar.xz | + | tar Jxf pycuda_2015.1.2.orig.tar.xz |
cd pycuda-*/ | cd pycuda-*/ | ||
tar Jxf ../pycuda_2015.1.2-1.debian.tar.xz | tar Jxf ../pycuda_2015.1.2-1.debian.tar.xz | ||
Ligne 337: | Ligne 340: | ||
==== Exécution des exemples ==== | ==== Exécution des exemples ==== | ||
- | <code> | + | Sur une machine équipée de 3 cartes vidéo et 3 implémentations d'OpenCL pour CPU (AMD, Intel, PortableCL) |
+ | |||
+ | <code> | ||
python /usr/share/doc/python-pyopencl-doc/examples/benchmark.py | python /usr/share/doc/python-pyopencl-doc/examples/benchmark.py | ||
</code> | </code> | ||
+ | <code> | ||
+ | =============================================================== | ||
+ | Platform name: AMD Accelerated Parallel Processing | ||
+ | Platform profile: FULL_PROFILE | ||
+ | Platform vendor: Advanced Micro Devices, Inc. | ||
+ | Platform version: OpenCL 2.0 AMD-APP (1800.11) | ||
+ | --------------------------------------------------------------- | ||
+ | Device name: Fiji | ||
+ | Device type: GPU | ||
+ | Device memory: 4045 MB | ||
+ | Device max clock speed: 1000 MHz | ||
+ | Device compute units: 64 | ||
+ | Device max work group size: 256 | ||
+ | Device max work item sizes: [256, 256, 256] | ||
+ | Data points: 8388608 | ||
+ | Workers: 256 | ||
+ | Preferred work group size multiple: 64 | ||
+ | Execution time of test: 0.00037168 s | ||
+ | Results OK | ||
+ | =============================================================== | ||
+ | Platform name: AMD Accelerated Parallel Processing | ||
+ | Platform profile: FULL_PROFILE | ||
+ | Platform vendor: Advanced Micro Devices, Inc. | ||
+ | Platform version: OpenCL 2.0 AMD-APP (1800.11) | ||
+ | --------------------------------------------------------------- | ||
+ | Device name: Intel(R) Xeon(R) CPU E5-2665 0 @ 2.40GHz | ||
+ | Device type: CPU | ||
+ | Device memory: 128966 MB | ||
+ | Device max clock speed: 2309 MHz | ||
+ | Device compute units: 32 | ||
+ | Device max work group size: 1024 | ||
+ | Device max work item sizes: [1024, 1024, 1024] | ||
+ | Data points: 8388608 | ||
+ | Workers: 256 | ||
+ | Preferred work group size multiple: 1 | ||
+ | Execution time of test: 0.0192504 s | ||
+ | Results OK | ||
+ | =============================================================== | ||
+ | Platform name: Intel(R) OpenCL | ||
+ | Platform profile: FULL_PROFILE | ||
+ | Platform vendor: Intel(R) Corporation | ||
+ | Platform version: OpenCL 1.2 LINUX | ||
+ | --------------------------------------------------------------- | ||
+ | Device name: Intel(R) Xeon(R) CPU E5-2665 0 @ 2.40GHz | ||
+ | Device type: CPU | ||
+ | Device memory: 128966 MB | ||
+ | Device max clock speed: 2400 MHz | ||
+ | Device compute units: 32 | ||
+ | Device max work group size: 8192 | ||
+ | Device max work item sizes: [8192, 8192, 8192] | ||
+ | /usr/lib/python2.7/dist-packages/pyopencl/__init__.py:63: CompilerWarning: Non-empty compiler output encountered. Set the environment variable PYOPENCL_COMPILER_OUTPUT=1 to see more. | ||
+ | "to see more.", CompilerWarning) | ||
+ | Data points: 8388608 | ||
+ | Workers: 256 | ||
+ | Preferred work group size multiple: 128 | ||
+ | Execution time of test: 0.00310517 s | ||
+ | Results OK | ||
+ | =============================================================== | ||
+ | Platform name: Portable Computing Language | ||
+ | Platform profile: FULL_PROFILE | ||
+ | Platform vendor: The pocl project | ||
+ | Platform version: OpenCL 1.2 pocl 0.10 | ||
+ | --------------------------------------------------------------- | ||
+ | Device name: pthread-Intel(R) Xeon(R) CPU E5-2665 0 @ 2.40GHz | ||
+ | Device type: CPU | ||
+ | Device memory: 128966 MB | ||
+ | Device max clock speed: 3100 MHz | ||
+ | Device compute units: 32 | ||
+ | Device max work group size: 1024 | ||
+ | Device max work item sizes: [1024, 1024, 1024] | ||
+ | Data points: 8388608 | ||
+ | Workers: 256 | ||
+ | Preferred work group size multiple: 8 | ||
+ | Execution time of test: 0.007638 s | ||
+ | Results OK | ||
+ | =============================================================== | ||
+ | Platform name: NVIDIA CUDA | ||
+ | Platform profile: FULL_PROFILE | ||
+ | Platform vendor: NVIDIA Corporation | ||
+ | Platform version: OpenCL 1.2 CUDA 7.5.20 | ||
+ | --------------------------------------------------------------- | ||
+ | Device name: GeForce GTX 980 Ti | ||
+ | Device type: GPU | ||
+ | Device memory: 6143 MB | ||
+ | Device max clock speed: 1190 MHz | ||
+ | Device compute units: 22 | ||
+ | Device max work group size: 1024 | ||
+ | Device max work item sizes: [1024, 1024, 64] | ||
+ | Data points: 8388608 | ||
+ | Workers: 256 | ||
+ | Preferred work group size multiple: 32 | ||
+ | Execution time of test: 0.000522592 s | ||
+ | Results OK | ||
+ | =============================================================== | ||
+ | Platform name: NVIDIA CUDA | ||
+ | Platform profile: FULL_PROFILE | ||
+ | Platform vendor: NVIDIA Corporation | ||
+ | Platform version: OpenCL 1.2 CUDA 7.5.20 | ||
+ | --------------------------------------------------------------- | ||
+ | Device name: Quadro 600 | ||
+ | Device type: GPU | ||
+ | Device memory: 1023 MB | ||
+ | Device max clock speed: 1280 MHz | ||
+ | Device compute units: 2 | ||
+ | Device max work group size: 1024 | ||
+ | Device max work item sizes: [1024, 1024, 64] | ||
+ | Data points: 8388608 | ||
+ | Workers: 256 | ||
+ | Preferred work group size multiple: 32 | ||
+ | Execution time of test: 0.00468445 s | ||
+ | Results OK | ||
+ | </code> | ||
- | Vous aurez noté que l'installation préalable d'un SDK AMD a permis de disposer d'un support OpenCL sur le processeur. | ||