mersenneforum.org

mersenneforum.org (https://www.mersenneforum.org/index.php)
-   GPU Computing (https://www.mersenneforum.org/forumdisplay.php?f=92)
-   -   CUDALucas (a.k.a. MaclucasFFTW/CUDA 2.3/CUFFTW) (https://www.mersenneforum.org/showthread.php?t=12576)

owftheevil 2015-02-07 02:57

Yes, it should be as you state. He's using the syntax for 2.03 which used the third parameter for the jump between fft values. I'll pm you in the morning about my suggestions on the README.

James Heinrich 2015-02-07 13:46

[QUOTE=flashjh;394771][url]http://www.mersenne.ca/cudalucas.php[/url]
CUDALucas -info -cufftbench 1024 8192 1024 >> benchmark.txt
Now it should be this, right?:
CUDALucas -cufftbench 1024 8192 5 >> benchmark.txt[/QUOTE]I have changed 1024 -> 5 per your suggestion.
I'm not familiar with CUDALucas (and the only computer I have access to at the moment is powered by i945GME), but I think the -info added some details about the GPU, has that been deprecated?

owftheevil 2015-02-07 14:15

Some card info is posted in the fft.txt file. The beginning of a typical such file looks like this:

[CODE]
Device GeForce GTX 570
Compatibility 2.0
clockRate (MHz) 1464
memClockRate (MHz) 1900

fft max exp ms/iter
1024 19535569 1.7232
1080 20580341 2.0214
1120 21325891 2.0459
1152 21921901 2.0549
1176 22368691 2.2586
[/CODE]The clockRate and memClockRate numbers are deceptive since they are pre-load numbers.

I would be glad to put in more info if it would be useful.

ET_ 2015-02-07 15:46

Is there a version 2.05 for Linux working with the GTX980?
Or should I wait for a 2.06 with more info as owftheevil just hinted?

Luigi

flashjh 2015-02-07 16:06

2.05 will work with Windows and Linux on the 980, should be out of beta today.

owftheevil 2015-02-07 16:43

@ET To compile with cc5.2, cuda7.0. is required. I can upload the binary to Sourceforge if you want it. Not having a cc5.2 card myself, it hasn't been tested.

flashjh 2015-02-07 16:51

Does the CC 5.0 CUDA 6.5 version not work on the 980?

I am able to compile 5.2 with CUDA 6.5 on Windows, does Linux not?

owftheevil 2015-02-07 17:09

Here's what I get when I try with 6.5:
[CODE]
[filbert@archfilbert trunk]$ make
/usr/local/cuda-6.5/bin/nvcc -O3 --generate-code arch=compute_52,code=sm_52 --compiler-options=-Wall -I/usr/local/cuda-6.5/include -c CUDALucas.cu
nvcc fatal : Unsupported gpu architecture 'compute_52'
Makefile:26: recipe for target 'CUDALucas.o' failed
make: *** [CUDALucas.o] Error 1
[/CODE]

flashjh 2015-02-07 17:30

[QUOTE=owftheevil;394819]Here's what I get when I try with 6.5:
[CODE]
[filbert@archfilbert trunk]$ make
/usr/local/cuda-6.5/bin/nvcc -O3 --generate-code arch=compute_52,code=sm_52 --compiler-options=-Wall -I/usr/local/cuda-6.5/include -c CUDALucas.cu
nvcc fatal : Unsupported gpu architecture 'compute_52'
Makefile:26: recipe for target 'CUDALucas.o' failed
make: *** [CUDALucas.o] Error 1
[/CODE][/QUOTE]

Interesting... I just tried again and it does work for Windows. I stand corrected, sorry. For Linux you'll need CUDA 7.0 but either way 2.05 will work with Windows and Linux.

owftheevil 2015-02-07 17:54

flashjh, you were also correct in that the 980 should work with the cc5.0 version. It just might not be optimal.

owftheevil 2015-02-07 18:09

Version built for cuda 4.2 - 7.0 and some quick benchmarks for comparisons of the different cuda versions on Linux:[FONT=sans-serif][COLOR=#000000]
[/COLOR][/FONT][COLOR=#000000][FONT=sans-serif][CODE]
[filbert@archfilbert trunk]$ ./CUDALucas-4.2 57885161
Using threads: square 256, splice 512.
Starting M57885161 fft length = 3136K
| Date Time | Test Num Iter Residue | FFT Error ms/It Time | ETA Done |
| Feb 07 09:41:08 | M57885161 10000 0x76c27556683cd84d | 3136K 0.21764 2.2289 22.28s | 1:11:50:02 0.01% |
| Feb 07 09:41:30 | M57885161 20000 0xfd8e311d20ffe6ab | 3136K 0.22456 2.2291 22.29s | 1:11:49:46 0.03% |

[/CODE]
[CODE]
[filbert@archfilbert trunk]$ ./CUDALucas-5.0 57885161

Using threads: square 256, splice 512.
Starting M57885161 fft length = 3136K
| Date Time | Test Num Iter Residue | FFT Error ms/It Time | ETA Done |
| Feb 07 09:42:51 | M57885161 10000 0x76c27556683cd84d | 3136K 0.25391 2.0388 20.38s | 1:08:46:35 0.01% |
| Feb 07 09:43:12 | M57885161 20000 0xfd8e311d20ffe6ab | 3136K 0.27148 2.0388 20.38s | 1:08:46:16 0.03% |

[/CODE]
[CODE]
[filbert@archfilbert trunk]$ ./CUDALucas-5.5 57885161

Using threads: square 256, splice 512.
Starting M57885161 fft length = 3136K
| Date Time | Test Num Iter Residue | FFT Error ms/It Time | ETA Done |
| Feb 07 09:44:07 | M57885161 10000 0x76c27556683cd84d | 3136K 0.23438 2.0287 20.28s | 1:08:36:55 0.01% |
| Feb 07 09:44:27 | M57885161 20000 0xfd8e311d20ffe6ab | 3136K 0.23535 2.0309 20.30s | 1:08:37:38 0.03% |

[/CODE]
[CODE]
[filbert@archfilbert trunk]$ ./CUDALucas-6.0 57885161

Using threads: square 256, splice 512.
Starting M57885161 fft length = 3136K
| Date Time | Test Num Iter Residue | FFT Error ms/It Time | ETA Done |
| Feb 07 09:45:23 | M57885161 10000 0x76c27556683cd84d | 3136K 0.26562 2.0886 20.88s | 1:09:34:40 0.01% |
| Feb 07 09:45:44 | M57885161 20000 0xfd8e311d20ffe6ab | 3136K 0.23828 2.0881 20.88s | 1:09:34:05 0.03% |

[/CODE]
[CODE]
[filbert@archfilbert trunk]$ ./CUDALucas-6.5 57885161

Using threads: square 256, splice 512.
Starting M57885161 fft length = 3136K
| Date Time | Test Num Iter Residue | FFT Error ms/It Time | ETA Done |
| Feb 07 09:46:35 | M57885161 10000 0x76c27556683cd84d | 3136K 0.23828 2.0607 20.60s | 1:09:07:44 0.01% |
| Feb 07 09:46:56 | M57885161 20000 0xfd8e311d20ffe6ab | 3136K 0.24805 2.0604 20.60s | 1:09:07:17 0.03% |

[/CODE]
[CODE]
[filbert@archfilbert trunk]$ ./CUDALucas-7.0 57885161

Using threads: square 256, splice 512.
Starting M57885161 fft length = 3136K
| Date Time | Test Num Iter Residue | FFT Error ms/It Time | ETA Done |
| Feb 07 09:47:44 | M57885161 10000 0x76c27556683cd84d | 3136K 0.25000 2.0633 20.63s | 1:09:10:14 0.01% |
| Feb 07 09:48:05 | M57885161 20000 0xfd8e311d20ffe6ab | 3136K 0.25000 2.0631 20.63s | 1:09:09:47 0.03% |

[/CODE]
[/FONT][/COLOR]


All times are UTC. The time now is 23:05.

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.