mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2009-11-24, 15:34   #100
Uncwilly
6809 > 6502
 
Uncwilly's Avatar
 
"""""""""""""""""""
Aug 2003
101Ă—103 Posts

254B16 Posts
Default

Quote:
Originally Posted by lycorn View Post
Yes. M216091 and M24036583 have been successfully verified.
It would probably be wise to add a GPU to the standard suite of verifiers, for any new candidate.
Uncwilly is offline   Reply With Quote
Old 2009-11-24, 17:11   #101
ET_
Banned
 
ET_'s Avatar
 
"Luigi"
Aug 2002
Team Italia

4,813 Posts
Default

Quote:
Originally Posted by Uncwilly View Post
It would probably be wise to add a GPU to the standard suite of verifiers, for any new candidate.
Maybe after GX300 comes out...

Luigi
ET_ is online now   Reply With Quote
Old 2009-11-24, 22:03   #102
em99010pepe
 
em99010pepe's Avatar
 
Sep 2004

2·5·283 Posts
Default

Quote:
Originally Posted by msft View Post
Hi, ET_

Now testing M23102129 with GTX260(GPU/CUDA) and M22094129 with Q8400(Prime95), It is same machine ;-p
Can you test 4 numbers in parallel on your Q8400 and the exact 4 in series on your GTX260?

Last fiddled with by em99010pepe on 2009-11-24 at 22:06
em99010pepe is offline   Reply With Quote
Old 2009-11-25, 04:23   #103
msft
 
msft's Avatar
 
Jul 2009
Tokyo

11428 Posts
Default

Hi, Uncwilly
Quote:
Originally Posted by Uncwilly View Post
It would probably be wise to add a GPU to the standard suite of verifiers, for any new candidate.
We need any news cadidate.

Hi, ET_
Quote:
Originally Posted by ET_ View Post
Maybe after GX300 comes out...
http://www.nvidia.com/object/io_1258360868914.html
>Editors’ note: As previously announced, the first Fermi-based consumer (GeForce) products are expected to be available first quarter 2010.

Hi, em99010pepe
Quote:
Originally Posted by em99010pepe View Post
Can you test 4 numbers in parallel on your Q8400 and the exact 4 in series on your GTX260?
I try it.

New GTX260 result.
M22728263

Thank you,
msft is offline   Reply With Quote
Old 2009-11-26, 04:11   #104
frmky
 
frmky's Avatar
 
Jul 2003
So Cal

23×32×29 Posts
Default

Three more double checks have finished. These are just under the 2048K/4096K boundary. Only one of three matched the previous result. Time will tell if the other two are correct. I also have a fourth running, 36500117, but after about a million iterations, the roundoff error grew above the limit and it switched over to a 4096K FFT. Therefore, it only about half way through the double-check.

36500089
36500111
36500119
frmky is offline   Reply With Quote
Old 2009-11-26, 09:05   #105
frmky
 
frmky's Avatar
 
Jul 2003
So Cal

23·32·29 Posts
Default

On the Tesla C1060, the TESRA version C is much slower than version y, but the non-TESRA version is the fastest yet with the 4096K FFT timing at 0.025 sec/iteration. If the improvements cannot also be used to optimize the TESRA version, then the TESRA version can now be dropped.
frmky is offline   Reply With Quote
Old 2009-11-26, 09:37   #106
bayanne
 
bayanne's Avatar
 
"Tony Gott"
Aug 2002
Yell, Shetland, UK

3×107 Posts
Default

Quote:
Originally Posted by msft View Post
Hi, bayanne

you can download MacLucasFFTW.cuda.C.tar.gz,
and test performance no change version and delete -DTESRA from Makefile version.
Thank you,
I had hoped to get this running as I had advised. However the wrong cable was in the box, and I 've had to get a special cable shipped from US.

Secondly, I did have problems compiling this version so wondered if you could refresh me on the steps I need to take....

Cheers
bayanne is offline   Reply With Quote
Old 2009-11-26, 09:51   #107
msft
 
msft's Avatar
 
Jul 2009
Tokyo

2×5×61 Posts
Default

Hi, TheJudger
I wish your programing success.

Quote:
Originally Posted by frmky View Post
Only one of three matched the previous result.
Fascinating,test is very important, overclocker's contaminating result?

Quote:
Originally Posted by frmky View Post
On the Tesla C1060, the TESRA version C is much slower than version y, but the non-TESRA version is the fastest yet with the 4096K FFT timing at 0.025 sec/iteration. If the improvements cannot also be used to optimize the TESRA version, then the TESRA version can now be dropped.
I understand, delete code in next version.

Thank you,
msft is offline   Reply With Quote
Old 2009-11-26, 09:53   #108
msft
 
msft's Avatar
 
Jul 2009
Tokyo

2×5×61 Posts
Default

Hi, bayanne

Quote:
Originally Posted by bayanne View Post
Secondly, I did have problems compiling this version so wondered if you could refresh me on the steps I need to take....
Please post err message.

Thank you,
msft is offline   Reply With Quote
Old 2009-11-26, 16:41   #109
msft
 
msft's Avatar
 
Jul 2009
Tokyo

2·5·61 Posts
Default

Quote:
Originally Posted by frmky View Post
Add Prime95 result.
msft is offline   Reply With Quote
Old 2009-11-26, 17:14   #110
TheJudger
 
TheJudger's Avatar
 
"Oliver"
Mar 2005
Germany

11·101 Posts
Default

Hi msft/frmky,

just some ideas out of my mind:

- perhaps you should choose exponents which have allready verified? In this case you can be sure if your results are OK or not immediatly.

- choose some exponents which are not so close to the fft limit. I didn't dive into the CUFFTW docs, perhaps the rounding/rounding errors are not so accurate as the CPU versions of MaclucasFFTW and you need to lower the FFT boundaries?

TheJudger

P.S. 22 million checks per second for TF :)
TheJudger is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
Don't DC/LL them with CudaLucas LaurV Data 131 2017-05-02 18:41
CUDALucas / cuFFT Performance on CUDA 7 / 7.5 / 8 Brain GPU Computing 13 2016-02-19 15:53
CUDALucas: which binary to use? Karl M Johnson GPU Computing 15 2015-10-13 04:44
settings for cudaLucas fairsky GPU Computing 11 2013-11-03 02:08
Trying to run CUDALucas on Windows 8 CP Rodrigo GPU Computing 12 2012-03-07 23:20

All times are UTC. The time now is 10:10.

Fri May 7 10:10:52 UTC 2021 up 29 days, 4:51, 0 users, load averages: 3.54, 3.18, 2.90

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.