mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2012-01-05, 02:23   #111
msft
 
msft's Avatar
 
Jul 2009
Tokyo

2×5×61 Posts
Default

Hi ,
Quote:
Originally Posted by rroonnaalldd View Post
Do you know what causes the underlined differences in err-rates between the 32bit- and 64bit-app?
cudatoolkit_4.0.17_linux_32_ubuntu10.10.run
gpucomputingsdk_4.0.17_linux.run
devdriver_4.1_linux_32_285.05.23.run
with GTX-550Ti
Code:
genefercuda.1.051$ ./GeneferCUDA.cuda4.0.Linux32 -b
GeneferCUDA 2.2.1 (CUDA) based on Genefer v2.2.1
Copyright (C) 2001-2003, Yves Gallot (v1.3)
Copyright (C) 2009, 2011 Mark Rodenkirch, David Underbakke (v2.2.1)
Copyright (C) 2010, 2011, Shoichiro Yamada (CUDA)
A program for finding large probable generalized Fermat primes.

Generalized Fermat Number Bench
2009574^8192+1  Time: 378 us/mul.       Err: 3.82e-01   51636 digits
1632282^16384+1 Time: 384 us/mul.       Err: 2.53e-01   101791 digits
1325824^32768+1 Time: 437 us/mul.       Err: 2.03e-01   200622 digits
1076904^65536+1 Time: 587 us/mul.       Err: 1.88e-01   395325 digits
874718^131072+1 Time: 817 us/mul.       Err: 3.47e-01   778813 digits
710492^262144+1 Time: 1.33 ms/mul.      Err: 4.21e-01   1533952 digits
577098^524288+1 Time: 2.58 ms/mul.      Err: 2.01e-01   3020555 digits
468750^1048576+1        Time: 5.25 ms/mul.      Err: 1.64e-01   5946413 digits
380742^2097152+1        Time: 10.6 ms/mul.      Err: 3.63e-01   11703432 digits
309258^4194304+1        Time: 22.7 ms/mul.      Err: 1.56e-01   23028076 digits
251196^8388608+1        Time: 46.5 ms/mul.      Err: 1.56e-01   45298590 digits
msft is offline   Reply With Quote
Old 2012-01-05, 03:37   #112
rroonnaalldd
 
rroonnaalldd's Avatar
 
Dec 2011

2·7 Posts
Default

Hi ,
Quote:
Originally Posted by msft View Post
cudatoolkit_4.0.17_linux_32_ubuntu10.10.run
gpucomputingsdk_4.0.17_linux.run
devdriver_4.1_linux_32_285.05.23.run
with GTX-550Ti
I have installed the same Cuda41rc2-driver and the older Cuda40-sdk. Only difference is my smaller GTS450...


[add]
All your error-rates seems to be the same except:
Quote:
309258^4194304+1 Time: 22.7 ms/mul. Err: 1.56e-01 23028076 digits
251196^8388608+1 Time: 46.5 ms/mul. Err: 1.56e-01 45298590 digits
I have:
Quote:
309258^4194304+1 Time: 35.8 ms/mul. Err: 4.07e-01 23028076 digits
251196^8388608+1 Time: 73.2 ms/mul. Err: 4.33e-01 45298590 digits

Last fiddled with by rroonnaalldd on 2012-01-05 at 03:56
rroonnaalldd is offline   Reply With Quote
Old 2012-01-05, 04:29   #113
axn
 
axn's Avatar
 
Jun 2003

10011110111012 Posts
Default

Quote:
Originally Posted by AG5BPilot View Post
Along the way, I found that at N=4194304 Genefer takes a very long time to start up. Looking at the code, I discovered that the really simple code at the beginning that computes the actual value of b^N, by repeatedly squaring b, takes a very long time when N=44194304. About 2 hours on my Core2Quad.
Quote:
Originally Posted by msft View Post
Or something like this(GMP ?).
Use the GMP, Luke

Code:
? N=222222^4194304;
? ##
  ***   last result computed in 2,749 ms.
That's 2.7 seconds on a 32-bit build of PARI (w/ GMP) running on Core 2 duo 2Ghz. A 64-build version will be much faster.

It can probably made even faster, by removing the 2's from the (even) base, and just doing the relevant number of squarings in the PRP loop.
axn is offline   Reply With Quote
Old 2012-01-05, 07:04   #114
msft
 
msft's Avatar
 
Jul 2009
Tokyo

2·5·61 Posts
Default

Quote:
Originally Posted by axn View Post
Use the GMP, Luke
I like this joke.
Do not have licence issue?
msft is offline   Reply With Quote
Old 2012-01-06, 11:34   #115
msft
 
msft's Avatar
 
Jul 2009
Tokyo

2×5×61 Posts
Default

Ver 1.06
Implement GMP.
Attached Files
File Type: bz2 GeneferCUDA.1.06.tar.bz2 (11.0 KB, 80 views)
msft is offline   Reply With Quote
Old 2012-01-06, 11:57   #116
rroonnaalldd
 
rroonnaalldd's Avatar
 
Dec 2011

E16 Posts
Default

gmp.h missing...

Solved by using apt-cache search gmp...

Last fiddled with by rroonnaalldd on 2012-01-06 at 12:20
rroonnaalldd is offline   Reply With Quote
Old 2012-01-06, 12:05   #117
debrouxl
 
debrouxl's Avatar
 
Sep 2009

977 Posts
Default

You have to install the external GMP software yourself
debrouxl is offline   Reply With Quote
Old 2012-01-06, 16:31   #118
rroonnaalldd
 
rroonnaalldd's Avatar
 
Dec 2011

168 Posts
Default

Got it working and did a comparison between v1.051 and 1.060:
Quote:
boinc@vmware2k-3:~/Cuda/test$ time ./GeneferCUDA_cudart_cuda40_64bit -b

2009574^8192+1 Time: 696 us/mul. Err: 3.82e-01 51636 digits
1632282^16384+1 Time: 713 us/mul. Err: 2.53e-01 101791 digits
1325824^32768+1 Time: 777 us/mul. Err: 2.03e-01 200622 digits
1076904^65536+1 Time: 1.01 ms/mul. Err: 1.88e-01 395325 digits
874718^131072+1 Time: 1.37 ms/mul. Err: 3.47e-01 778813 digits
710492^262144+1 Time: 2.11 ms/mul. Err: 4.21e-01 1533952 digits
577098^524288+1 Time: 4.09 ms/mul. Err: 2.01e-01 3020555 digits
468750^1048576+1 Time: 8.21 ms/mul. Err: 1.72e-01 5946413 digits
380742^2097152+1 Time: 16.7 ms/mul. Err: 3.63e-01 11703432 digits
309258^4194304+1 Time: 36.8 ms/mul. Err: 1.56e-01 23028076 digits
251196^8388608+1 Time: 74.7 ms/mul. Err: 1.56e-01 45298590 digits

real 34m31.539s
user 14m8.597s
sys 20m12.152s
Quote:
boinc@vmware2k-3:~/Cuda/test$ time ./GeneferCUDA_gmp_cuda40_64bit -b

2009574^8192+1 Time: 696 us/mul. Err: 3.82e-01 51636 digits
1632282^16384+1 Time: 714 us/mul. Err: 2.53e-01 101791 digits
1325824^32768+1 Time: 777 us/mul. Err: 2.03e-01 200622 digits
1076904^65536+1 Time: 1.01 ms/mul. Err: 1.88e-01 395325 digits
874718^131072+1 Time: 1.37 ms/mul. Err: 3.47e-01 778813 digits
710492^262144+1 Time: 2.11 ms/mul. Err: 4.21e-01 1533952 digits
577098^524288+1 Time: 4.09 ms/mul. Err: 2.01e-01 3020555 digits
468750^1048576+1 Time: 8.21 ms/mul. Err: 1.72e-01 5946413 digits
380742^2097152+1 Time: 16.7 ms/mul. Err: 3.63e-01 11703432 digits
309258^4194304+1 Time: 36.8 ms/mul. Err: 1.56e-01 23028076 digits
251196^8388608+1 Time: 74.7 ms/mul. Err: 1.56e-01 45298590 digits

real 34m31.597s
user 14m10.249s
sys 20m10.672s
rroonnaalldd is offline   Reply With Quote
Old 2012-01-07, 04:54   #119
msft
 
msft's Avatar
 
Jul 2009
Tokyo

2·5·61 Posts
Default

Quote:
Originally Posted by rroonnaalldd View Post
Got it working and did a comparison between v1.051 and 1.060:
"-b" option not use GMP.
Please try normal test.
By the way.
http://www.primegrid.com/forum_threa...rap=true#46251
AG5BPilot make non GMP version.
We can wait his work.
msft is offline   Reply With Quote
Old 2012-01-07, 05:06   #120
AG5BPilot
 
AG5BPilot's Avatar
 
Dec 2011
New York, U.S.A.

97 Posts
Default

Quote:
Originally Posted by msft View Post
"-b" option not use GMP.
Please try normal test.
By the way.
http://www.primegrid.com/forum_threa...rap=true#46251
AG5BPilot make non GMP version.
We can wait his work.
Hopefully it will work. :)

BTW, you can add this line into the residual check:

Code:
    check(  1248,4194304, "8f985a974820a6d3");
AG5BPilot is offline   Reply With Quote
Old 2012-01-08, 02:55   #121
msft
 
msft's Avatar
 
Jul 2009
Tokyo

2×5×61 Posts
Default

Ver 1.061
Add:
Code:
    check(  1248,4194304, "8f985a974820a6d3");
Attached Files
File Type: bz2 GeneferCUDA.1.061.tar.bz2 (11.1 KB, 89 views)
msft is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
Genefer's FFT applied to Mersenne squaring preda Software 0 2017-09-06 02:54
CUDA 5.5 ET_ GPU Computing 2 2013-06-13 15:50
AVX CPU LL vs CUDA LL nucleon GPU Computing 11 2012-01-04 17:52
Best CUDA GPU for the $$ Christenson GPU Computing 24 2011-05-01 00:06
CUDA? Xentar Conjectures 'R Us 6 2010-03-31 07:43

All times are UTC. The time now is 05:55.


Fri Aug 6 05:55:29 UTC 2021 up 14 days, 24 mins, 1 user, load averages: 3.37, 3.48, 3.21

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.