mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2010-05-10, 09:12   #166
msft
 
msft's Avatar
 
Jul 2009
Tokyo

26216 Posts
Default

Very Sorry, wavelet3000

Please change MaclucasFFTW.cu.
Quote:
890 //64bitOS bigA=6755399441055744.0;
891 //32bitOS bigA=(((6.0)*0x2000000L)*0x2000000L)*0x800;
892 bigA=6755399441055744.0;
msft is offline   Reply With Quote
Old 2010-05-10, 09:40   #167
msft
 
msft's Avatar
 
Jul 2009
Tokyo

10011000102 Posts
Default

Quote:
Originally Posted by frmky View Post
The 2048K FFT runs at 5.47 ms/iteration,
Very Good!!!
msft is offline   Reply With Quote
Old 2010-05-14, 23:32   #168
msft
 
msft's Avatar
 
Jul 2009
Tokyo

10011000102 Posts
Default

Hi,
Version "Q" at .0106 sec/iter for the 2048K FFT , .0214 sec/iter for the 4096K FFT , .0432 sec/iter for the 8192K FFT and .0895 sec/ier for the 16384K FFT on GTX260.
Attached Files
File Type: gz MacLucasFFTW.Q.tar.gz (40.9 KB, 89 views)
msft is offline   Reply With Quote
Old 2010-05-15, 20:08   #169
wavelet3000
 
May 2010

7 Posts
Default

With version Q, 64-bit works fine, no problem with 44497, 110203 or other numbers.

Thanks very much.
wavelet3000 is offline   Reply With Quote
Old 2010-05-15, 23:15   #170
frmky
 
frmky's Avatar
 
Jul 2003
So Cal

23×32×29 Posts
Default

As expected,
M( 42643801 )P, n = 4194304, MacLucasFFTW v8.1 Ballester
in just over 5 days. Although unintended, this also tested the restart code. I had the program running in a terminal (and not using screen) on a Windows machine. Windows update decided to reboot the computer, closing the terminal and stopping the program. The restart worked just as it should.
frmky is online now   Reply With Quote
Old 2010-05-16, 01:12   #171
msft
 
msft's Avatar
 
Jul 2009
Tokyo

11428 Posts
Default

Quote:
Originally Posted by frmky View Post
As expected,
M( 42643801 )P, n = 4194304, MacLucasFFTW v8.1 Ballester
in just over 5 days.
Your GTX480 is fastest computer on mersenne community today!
msft is offline   Reply With Quote
Old 2010-05-16, 12:56   #172
henryzz
Just call me Henry
 
henryzz's Avatar
 
"David"
Sep 2007
Cambridge (GMT/BST)

5,869 Posts
Default

Quote:
Originally Posted by msft View Post
Your GTX480 is fastest computer on mersenne community today!
Yes
Could be good for proving a new mersenne prime quickly.
henryzz is offline   Reply With Quote
Old 2010-05-16, 15:14   #173
TheJudger
 
TheJudger's Avatar
 
"Oliver"
Mar 2005
Germany

11×101 Posts
Default

Quote:
Originally Posted by msft View Post
Your GTX480 is fastest computer on mersenne community today!
At least close to.
Glucas 2.9.2-20080916 + dualsocket Xeon X5680 (3.33GHz hexacore):
2048k FFT: 4.7ms per iteration

With a Tesla-brandet Fermi (all DP-units enabled) you should beat this easily.

It looks like Glucas doesn't scale as good a your code on increasing FFT sizes (at least on this system) so perhaps you're allready faster for bigger FFTs. On the other hand Glucas supports much more FFT sizes.

Good job msft!
TheJudger is offline   Reply With Quote
Old 2010-05-18, 10:38   #174
frmky
 
frmky's Avatar
 
Jul 2003
So Cal

23·32·29 Posts
Default

Quote:
Originally Posted by TheJudger View Post
At least close to.
CUDA 3.1-beta is now out. Among the highlights is this little gem:
* Significant improvements in double-precision FFT performance on Fermi-architecture GPUs for 2^n transform sizes

Sure enough, the GTX 480 now runs at 4.66 ms/iter for the 2048K FFT and 9.37 ms/iter for the 4096K FFT.
frmky is online now   Reply With Quote
Old 2010-06-01, 12:33   #175
wavelet3000
 
May 2010

7 Posts
Default

M68808029
(4M fft)
Took 16 days on GTX280 and yielded 194 GHz-days. I am not sure if I will have the patience to double check it soon. In the meantime, I off to testing 88M-range exponent (with 8M fft) and shopping for Fermi
wavelet3000 is offline   Reply With Quote
Old 2010-06-02, 04:15   #176
frmky
 
frmky's Avatar
 
Jul 2003
So Cal

23×32×29 Posts
Default

I've been busy testing the 2M FFT on Fermi:
Code:
M( 30000037 )C, 0x307be1a2dc2bca38, n = 2097152, MacLucasFFTW v8.1  Ballester
M( 31000003 )C, 0x9bed7651387bd02a, n = 2097152, MacLucasFFTW v8.1  Ballester
M( 32000057 )C, 0x60bbddb7958f85e3, n = 2097152, MacLucasFFTW v8.1  Ballester
M( 33000001 )C, 0xe54b0c721739183f, n = 2097152, MacLucasFFTW v8.1  Ballester
M( 34000081 )C, 0x64415a7a626f0e34, n = 2097152, MacLucasFFTW v8.1  Ballester
M( 35000443 )C, 0xbf2fb6ccbc3f8780, n = 2097152, MacLucasFFTW v8.1  Ballester
M( 36000143 )C, 0xb0be92372eeab565, n = 2097152, MacLucasFFTW v8.1  Ballester
37000133 is running now.
frmky is online now   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
Don't DC/LL them with CudaLucas LaurV Data 131 2017-05-02 18:41
CUDALucas / cuFFT Performance on CUDA 7 / 7.5 / 8 Brain GPU Computing 13 2016-02-19 15:53
CUDALucas: which binary to use? Karl M Johnson GPU Computing 15 2015-10-13 04:44
settings for cudaLucas fairsky GPU Computing 11 2013-11-03 02:08
Trying to run CUDALucas on Windows 8 CP Rodrigo GPU Computing 12 2012-03-07 23:20

All times are UTC. The time now is 06:15.

Fri May 7 06:15:09 UTC 2021 up 29 days, 56 mins, 0 users, load averages: 2.40, 2.44, 2.13

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.