mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2012-07-14, 15:51   #1816
TheJudger
 
TheJudger's Avatar
 
"Oliver"
Mar 2005
Germany

11·101 Posts
Default

Some data from tf_barrett96.cu: mod_simple_96():
Code:
 qi = 0
 q =   00000007 3C3F1F20 C454D397
 nn =  00000000 00000000 00000000
 res = 00000007 3C3F1F1F C454D397
res = q - nn;

So for now it looks like CUDA 5.0.7 fails when somebody uses sub with carry when the subtrahend is 0. So for now it looks like a bug in CUDA 5.0.7.

Oliver
TheJudger is offline   Reply With Quote
Old 2012-07-15, 05:26   #1817
axn
 
axn's Avatar
 
Jun 2003

117358 Posts
Default

Quote:
Originally Posted by TheJudger View Post
Some data from tf_barrett96.cu: mod_simple_96():
Code:
 qi = 0
 q =   00000007 3C3F1F20 C454D397
 nn =  00000000 00000000 00000000
 res = 00000007 3C3F1F1F C454D397
res = q - nn;

So for now it looks like CUDA 5.0.7 fails when somebody uses sub with carry when the subtrahend is 0. So for now it looks like a bug in CUDA 5.0.7.

Oliver
Have you tried adding the volatile keyword to your asm statements?
axn is offline   Reply With Quote
Old 2012-07-17, 10:27   #1818
TheJudger
 
TheJudger's Avatar
 
"Oliver"
Mar 2005
Germany

11×101 Posts
Default

Nvidia confirmed the bug so I would say: not my fault/problem!

Oliver
TheJudger is offline   Reply With Quote
Old 2012-07-17, 15:53   #1819
Xyzzy
 
Xyzzy's Avatar
 
"Mike"
Aug 2002

5·17·97 Posts
Default

We submitted all of our completed work today in one file and were awarded some "extra credit". (See attached image.)

Code:
P-1 found a factor in stage #2, B1=565000, B2=12147500, E=6.
M56350163 has a factor: 24948611431313562132407
P-1 found a factor in stage #2, B1=540000, B2=11475000, E=6.
M54203297 has a factor: 43709161575143787520913
Attached Thumbnails
Click image for larger version

Name:	credit.png
Views:	155
Size:	15.4 KB
ID:	8205  
Xyzzy is offline   Reply With Quote
Old 2012-07-17, 17:02   #1820
James Heinrich
 
James Heinrich's Avatar
 
"James Heinrich"
May 2004
ex-Northern Ontario

23×149 Posts
Default

Quote:
Originally Posted by Xyzzy View Post
We submitted all of our completed work today in one file and were awarded some "extra credit"
I'm not even sure how to come up with those numbers... that's approx 150%-200% what credit you should get for those factors even as credited for TF.
James Heinrich is offline   Reply With Quote
Old 2012-07-17, 18:06   #1821
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

2·32·419 Posts
Default

Quote:
Originally Posted by James Heinrich View Post
I'm not even sure how to come up with those numbers... that's approx 150%-200% what credit you should get for those factors even as credited for TF.
James, did you change the manual web forms along the lines we were discussing? If so, did the B1/B2 bounds get recorded correctly? Maybe, the underlying PHP guessed the wrong FFT size or we passed in a bogus FFT size?
Prime95 is online now   Reply With Quote
Old 2012-07-17, 18:09   #1822
James Heinrich
 
James Heinrich's Avatar
 
"James Heinrich"
May 2004
ex-Northern Ontario

23·149 Posts
Default

Quote:
Originally Posted by Prime95 View Post
James, did you change the manual web forms along the lines we were discussing? If so, did the B1/B2 bounds get recorded correctly? Maybe, the underlying PHP guessed the wrong FFT size or we passed in a bogus FFT size?
No, I hadn't got to that yet (I was going to... the day PrimeNet was down for a few hours), the manual form is as yet unchanged.
James Heinrich is offline   Reply With Quote
Old 2012-07-17, 18:27   #1823
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

11101011101102 Posts
Default

Quote:
Originally Posted by James Heinrich View Post
No, I hadn't got to that yet...
Weird. All the more reason to make those changes!
Prime95 is online now   Reply With Quote
Old 2012-07-31, 23:28   #1824
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

2·32·419 Posts
Default 20+% improvement

Oliver,

I propose creating a barrett77_mul32. This is the same as barrett79_mul32 but with the mod_simple_96 moved out of the loop. As long as f does not exceed 77 bits, a will not exceed 80 bits (above 80 bits and square_96_160 will fail).

I tested this out and it passes the self tests up through 77 bits. Raw speed went from 205M/sec to 250M/sec.

Crude source is attached.
Attached Files
File Type: txt tf_barrett96.txt (43.1 KB, 216 views)
Prime95 is online now   Reply With Quote
Old 2012-07-31, 23:53   #1825
kladner
 
kladner's Avatar
 
"Kieren"
Jul 2011
In My Own Galaxy!

236568 Posts
Default

Quote:
Originally Posted by Prime95 View Post
Oliver,
.......
I tested this out and it passes the self tests up through 77 bits. Raw speed went from 205M/sec to 250M/sec.....
Wow.
kladner is offline   Reply With Quote
Old 2012-08-01, 00:34   #1826
Xyzzy
 
Xyzzy's Avatar
 
"Mike"
Aug 2002

5×17×97 Posts
Default

More "extra credit":

Code:
Processing result: M56505451 has a factor: 86553876518403762963169
CPU credit is 323.9309 GHz-days.
Processing result: M56488651 has a factor: 35566445275259107720993
CPU credit is 129.5622 GHz-days.
Processing result: M56491177 has a factor: 23502006329787341695151
CPU credit is 89.0731 GHz-days.
Xyzzy is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
mfakto: an OpenCL program for Mersenne prefactoring Bdot GPU Computing 1676 2021-06-30 21:23
The P-1 factoring CUDA program firejuggler GPU Computing 753 2020-12-12 18:07
gr-mfaktc: a CUDA program for generalized repunits prefactoring MrRepunit GPU Computing 32 2020-11-11 19:56
mfaktc 0.21 - CUDA runtime wrong keisentraut Software 2 2020-08-18 07:03
World's second-dumbest CUDA program fivemack Programming 112 2015-02-12 22:51

All times are UTC. The time now is 01:04.


Fri Aug 6 01:04:19 UTC 2021 up 13 days, 19:33, 1 user, load averages: 2.15, 2.42, 2.34

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.