mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2011-12-30, 23:11   #661
flashjh
 
flashjh's Avatar
 
"Jerry"
Nov 2011
Vancouver, WA

1,123 Posts
Default

Quote:
Originally Posted by flashjh View Post
4.11 build is slower than 1.2b for me. I had to install CUDA 4.0 to get the lastest .dll files.

Anyone know why the newer one is slower or something I can do to make it faster? Anyone know how to get the ETA back in 4.11?
Quote:
Originally Posted by Brain View Post
As CUDALucas does auto-resume from checkpoint files we should recommend not using "-c" any more, do we? I will have to update the GPU guide in the new year...

By the way, the iteration times are so low as I didn't do complete 10000 runs. Kind of a bug.

Last but not least, utilisation for state-of-the-art expos (40M range) is 97% as before. Low utilisation is understandable for small FFT sizes...
I have not tried 1.3, but 1.4.1 is about 2x slower for me than 1.2b. I dropped the -c and -t and only use -D01 for GPU 2. I'm still learning... Is it better to use 1.4.1 to get the non-power-of-2-fft-sizes or use 1.2b to optimuze speed? Thanks.
flashjh is offline   Reply With Quote
Old 2011-12-31, 09:27   #662
Brain
 
Brain's Avatar
 
Dec 2009
Peine, Germany

331 Posts
Default

Quote:
Originally Posted by Brain View Post
As CUDALucas does auto-resume from checkpoint files we should recommend not using "-c" any more, do we? I will have to update the GPU guide in the new year...
Stupid me: If we don't use -c there are no checkpoints written. So we have to use it. I will have time to test more in the new year but currently I recommend using 1.2(b)!

DLL files download

Last fiddled with by Brain on 2011-12-31 at 09:30 Reason: DLL files download
Brain is offline   Reply With Quote
Old 2011-12-31, 12:44   #663
Brain
 
Brain's Avatar
 
Dec 2009
Peine, Germany

331 Posts
Smile From "does hardly use any CPU resources" to "takes almost a full CPU core"

Hi,
although nobody has yet confirmed that CL >= 1.4 uses more CPU resources I wrote msft this question. He answered that he is investigating and will try to fix it.
I CUDALucas (and all GPU based primality tests)
Thanks to msft!

Last fiddled with by Brain on 2011-12-31 at 12:47
Brain is offline   Reply With Quote
Old 2011-12-31, 14:19   #664
msft
 
msft's Avatar
 
Jul 2009
Tokyo

2×5×61 Posts
Default

Hi ,
Quote:
Originally Posted by Brain View Post
although nobody has yet confirmed that CL >= 1.4 uses more CPU resources I wrote msft this question.
Fixed CPU issue.
Thank you,
Attached Files
File Type: bz2 CUDALucas.1.4.2.tar.bz2 (27.2 KB, 166 views)
msft is offline   Reply With Quote
Old 2011-12-31, 14:35   #665
msft
 
msft's Avatar
 
Jul 2009
Tokyo

2×5×61 Posts
Default

exec file.
Attached Files
File Type: bz2 CUDALucas.1.4.2.cuda4.0.Linux64.tar.bz2 (32.9 KB, 185 views)
msft is offline   Reply With Quote
Old 2011-12-31, 14:36   #666
Brain
 
Brain's Avatar
 
Dec 2009
Peine, Germany

331 Posts
Default

Quote:
Originally Posted by msft View Post
Hi ,

Fixed CPU issue.
Thank you,
Thanks a lot. I noticed you didn't change the version string. For Win64 compilation I will change it to:
Code:
const char program_revision[] = "$Revision: 1.4.2 $";
My last test with 1.41 didn't show the high CPU utlilisation any more, see attached. I am a bit confused. Maybe I made a mistake with the "-c" param so that CL wrote it every iteration. Just guessing.

Will now compile again.
Attached Thumbnails
Click image for larger version

Name:	CPU-GPU-Utilisation.jpg
Views:	149
Size:	120.5 KB
ID:	7479  
Brain is offline   Reply With Quote
Old 2011-12-31, 14:48   #667
Brain
 
Brain's Avatar
 
Dec 2009
Peine, Germany

331 Posts
Default 1.4.2 looks good

Maybe even a bit faster. We'll see.
Attached Thumbnails
Click image for larger version

Name:	CPU-GPU-Utilisation.1.4.2.jpg
Views:	142
Size:	128.0 KB
ID:	7480  
Brain is offline   Reply With Quote
Old 2011-12-31, 14:50   #668
Brain
 
Brain's Avatar
 
Dec 2009
Peine, Germany

5138 Posts
Default Shader Model 1.3

CUDALucas 1.4.2 for Win64 / CUDA 4.0 / Compute Capability 1.3
Attached Files
File Type: exe CUDALucas.cuda4.0.sm_13.WIN64.exe (181.5 KB, 148 views)
Brain is offline   Reply With Quote
Old 2011-12-31, 14:53   #669
Brain
 
Brain's Avatar
 
Dec 2009
Peine, Germany

14B16 Posts
Default Shader Model 2.1

CUDALucas 1.4.2 for Win64 / CUDA 4.0 / Compute Capability 2.1
Attached Files
File Type: exe CUDALucas.cuda4.0.sm_21.WIN64.exe (179.0 KB, 383 views)
Brain is offline   Reply With Quote
Old 2011-12-31, 14:54   #670
msft
 
msft's Avatar
 
Jul 2009
Tokyo

2×5×61 Posts
Default

CUFFT benchmark with cuda4.0:
Code:
CUFFT_D2Z size=512 k time=1.269744 msec
CUFFT_D2Z size=1024 k time=2.609184 msec
CUFFT_D2Z size=1536 k time=4.359898 msec
CUFFT_D2Z size=2048 k time=5.615232 msec
CUFFT_D2Z size=2560 k time=7.277350 msec
CUFFT_D2Z size=3072 k time=8.969321 msec
CUFFT_D2Z size=3584 k time=10.251376 msec
CUFFT_D2Z size=4096 k time=11.749495 msec
CUFFT_D2Z size=4608 k time=13.065844 msec
CUFFT_D2Z size=5120 k time=14.971667 msec
CUFFT_D2Z size=5632 k time=148.589874 msec
CUFFT_D2Z size=6144 k time=19.145430 msec
CUFFT_D2Z size=6656 k time=217.340973 msec
CUFFT_D2Z size=7168 k time=21.095901 msec
CUFFT_D2Z size=7680 k time=24.699974 msec
CUFFT_D2Z size=8192 k time=24.172211 msec
Some fft length was very slow.
Ver 1.42 not avoid this length.
msft is offline   Reply With Quote
Old 2011-12-31, 15:27   #671
f11ksx
 
Dec 2011

13 Posts
Talking v1.14 VS v1.42

V1.41 is running pretty well just like v1.42

9.3 ms/iter for 54M exponent on GTX-580 card with v1.41
8.6 ms/iter with v1.42

Domo
f11ksx is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
Don't DC/LL them with CudaLucas LaurV Data 131 2017-05-02 18:41
CUDALucas / cuFFT Performance on CUDA 7 / 7.5 / 8 Brain GPU Computing 13 2016-02-19 15:53
CUDALucas: which binary to use? Karl M Johnson GPU Computing 15 2015-10-13 04:44
settings for cudaLucas fairsky GPU Computing 11 2013-11-03 02:08
Trying to run CUDALucas on Windows 8 CP Rodrigo GPU Computing 12 2012-03-07 23:20

All times are UTC. The time now is 14:46.


Fri Jul 7 14:46:09 UTC 2023 up 323 days, 12:14, 0 users, load averages: 1.22, 1.23, 1.12

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2023, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.

≠ ± ∓ ÷ × · − √ ‰ ⊗ ⊕ ⊖ ⊘ ⊙ ≤ ≥ ≦ ≧ ≨ ≩ ≺ ≻ ≼ ≽ ⊏ ⊐ ⊑ ⊒ ² ³ °
∠ ∟ ° ≅ ~ ‖ ⟂ ⫛
≡ ≜ ≈ ∝ ∞ ≪ ≫ ⌊⌋ ⌈⌉ ∘ ∏ ∐ ∑ ∧ ∨ ∩ ∪ ⨀ ⊕ ⊗ 𝖕 𝖖 𝖗 ⊲ ⊳
∅ ∖ ∁ ↦ ↣ ∩ ∪ ⊆ ⊂ ⊄ ⊊ ⊇ ⊃ ⊅ ⊋ ⊖ ∈ ∉ ∋ ∌ ℕ ℤ ℚ ℝ ℂ ℵ ℶ ℷ ℸ 𝓟
¬ ∨ ∧ ⊕ → ← ⇒ ⇐ ⇔ ∀ ∃ ∄ ∴ ∵ ⊤ ⊥ ⊢ ⊨ ⫤ ⊣ … ⋯ ⋮ ⋰ ⋱
∫ ∬ ∭ ∮ ∯ ∰ ∇ ∆ δ ∂ ℱ ℒ ℓ
𝛢𝛼 𝛣𝛽 𝛤𝛾 𝛥𝛿 𝛦𝜀𝜖 𝛧𝜁 𝛨𝜂 𝛩𝜃𝜗 𝛪𝜄 𝛫𝜅 𝛬𝜆 𝛭𝜇 𝛮𝜈 𝛯𝜉 𝛰𝜊 𝛱𝜋 𝛲𝜌 𝛴𝜎𝜍 𝛵𝜏 𝛶𝜐 𝛷𝜙𝜑 𝛸𝜒 𝛹𝜓 𝛺𝜔