![]() |
![]() |
#1 |
Dec 2009
Peine, Germany
33110 Posts |
![]()
Hi folks,
I am running CUDALucas compiled against CUDA 6.5. (download from here: https://sourceforge.net/projects/cudalucas/files/ ). I use 2 Titans (Kepler, GK110, CC 3.5). Question is: Will there be a speedup when recompiling against CUDA 7 / 7.5 / 8 ? Has anyone tried it? (I have compiled CUDALucas years ago when CUDA was at 4.2 or so...) I read that airsquirrels is working on this here: http://www.mersenneforum.org/showpos...postcount=2431 Could anybody provide a newer compile for evaluation? Maybe Win64, CUDA 7.5, CC 3.5? (Nvidia claims cuFFT improvements in CUDA 7: https://devblogs.nvidia.com/parallel...ture-overview/) Futher questions:
Bye, Sebastian |
![]() |
![]() |
![]() |
#2 |
"David"
Jul 2015
Ohio
10000001012 Posts |
![]()
I tried CUDALucas against both 6.5 and 7.5 and did see a small improvement using 7.5, enough that I installed both 6.5 (for mfaktc which hits a bug in 7.5) and 7.5 on most of my systems. I don't remember the exact benchmark numbers off hand other than it being worth it.
|
![]() |
![]() |
![]() |
#3 | |
Dec 2009
Peine, Germany
331 Posts |
![]() Quote:
|
|
![]() |
![]() |
![]() |
#4 |
"David"
Jul 2015
Ohio
11·47 Posts |
![]() |
![]() |
![]() |
![]() |
#5 |
"Kieren"
Jul 2011
In My Own Galaxy!
2×3×1,693 Posts |
![]() |
![]() |
![]() |
![]() |
#6 | |
Einyen
Dec 2003
Denmark
2×17×101 Posts |
![]() Quote:
|
|
![]() |
![]() |
![]() |
#7 |
"David"
Jul 2015
Ohio
11·47 Posts |
![]() |
![]() |
![]() |
![]() |
#8 | |
Dec 2009
Peine, Germany
331 Posts |
![]()
Old Version from 2012 (out-dated):
Quote:
Code:
WINDOWS MAKE: This can make non-debug versions from CUDA 4.0 and up. To compile CUDA 4.0 thru 6.5 you need to have the applicable CUDA toolkit installed and MSVS 2012. CUDA 6.5 requires the toolkit and MSVS 2012. ** You must have the correct MSVS, the applicable CUDA Toolkit and current drivers installed first ** 1. Obtain make.exe from here: http://www.equation.com/servlet/equation.cmd?fa=make 2. Place make.exe into the folder with the sourcefiles and makefile.win 3. Use MUST use the 'command shortcut' included with the appropriate version of MSVS If you want x86, use the x86 and if you want x64, use x64. Use the shortcut from MSVS 2010 for CUDA 4.0 to CUDA 5.0 and MSVS 2012 for CUDA 5.5/6.5 4. Open makefile.win and set your desired bit level, cuda and version and location of MSVS then save 5. Type: make -f makefile.win 6. When complete type make -f makefile.win clean 7. The executable is placed one directory up from your source files |
|
![]() |
![]() |
![]() |
#9 |
Dec 2009
Peine, Germany
331 Posts |
![]()
I have compiled "cudalucas-code-99-trunk" with CUDA 7.5 for Win64.
HowTo compile CUDALucas for Win64, additional info to README:
Tests with 5184K FFT length @Titan @836 MHz / 2600 MHz: CUDA 6.5: ~4.185ms / It @ ~ERR = 0.25 CUDA 7.5: ~4.22 ms / It @ ~ERR = 0.18 cufftbench still recommends 5184K as optimal FFT length in my exponent case. --> CUDA 7.5 seems to be at this run ~8% SLOWER than CUDA 6.5 but has a ~30% smaller error!?? Find attached my modified makefile.win for CUDA 7.5 support. I'd be very interested in independent compiles and checks. Maybe I did something wrong but I don't think so. At the weekend I will probably run a 7.5 double check but current performance is a disappointment. Bye, Sebastian |
![]() |
![]() |
![]() |
#10 |
"David"
Jul 2015
Ohio
10000001012 Posts |
![]()
The change in error makes me suspect a different FFT size was used.
Did you configure the FFT size yourself, or let the automatic error detection select it? |
![]() |
![]() |
![]() |
#11 | |
Dec 2009
Peine, Germany
331 Posts |
![]() Quote:
Code:
Device GeForce GTX TITAN Compatibility 3.5 clockRate (MHz) 875 memClockRate (MHz) 2600 fft max exp ms/iter 4096 75846319 3.0423 4320 79902611 3.9058 4374 80879779 4.0908 4536 83809729 4.1059 4608 85111207 4.1071 5184 95507747 4.1222 5488 100984691 4.7986 5600 103000823 5.0615 5832 107174381 5.4100 6048 111056879 5.4940 6144 112781477 5.5065 6250 114685037 5.7932 6400 117377567 5.8289 6480 118813021 5.8704 6561 120266023 6.1004 6912 126558077 6.2039 7776 142017539 6.3066 8192 149447533 6.4384 Last fiddled with by Brain on 2016-02-19 at 05:54 Reason: cufftbench |
|
![]() |
![]() |
![]() |
Thread Tools | |
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
CUDALucas (a.k.a. MaclucasFFTW/CUDA 2.3/CUFFTW) | msft | GPU Computing | 2832 | 2020-11-01 14:16 |
Don't DC/LL them with CudaLucas | LaurV | Data | 131 | 2017-05-02 18:41 |
CUDALucas gives all-zero residues | fivemack | GPU Computing | 4 | 2016-07-21 15:49 |
Performance of cuda-ecm on newer hardware? | fivemack | GMP-ECM | 14 | 2015-02-12 20:10 |
cuFFT on multiple GPUs | HHfromG | GPU Computing | 2 | 2014-05-11 19:57 |