mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2014-10-22, 20:19   #2410
James Heinrich
 
James Heinrich's Avatar
 
"James Heinrich"
May 2004
ex-Northern Ontario

23×149 Posts
Default

Does this mean GPU sieving will be available below 2^64 for large exponents, such as my own pet project of 1000M-4296M? I only need support down to 2^52 since everything has been done below that, but if GPU sieving could be enabled... that would be of tremendous benefit
James Heinrich is offline   Reply With Quote
Old 2014-10-22, 20:55   #2411
Bdot
 
Bdot's Avatar
 
Nov 2010
Germany

3·199 Posts
Default

mfakto's _gs kernels start at 260, also for your pet project, James.

So far, I was not approached to lower that limit, and I don't have that on my plan (yet).

Code:
got assignment: exp=4201971233 bit_min=66 bit_max=67 (0.00 GHz-days)
Starting trial factoring M4201971233 from 2^66 to 2^67 (0.00GHz-days)
Using GPU kernel "cl_barrett15_69_gs_2"
Date    Time | class   Pct |   time     ETA | GHz-d/day    Sieve     Wait
Oct 22 22:42 | 4611 100.0% |  0.002   0m00s |    160.06    80181    0.00%
no factor for M4201971233 from 2^66 to 2^67 [mfakto 0.15pre4-Win cl_barrett15_69_gs_2]
tf(): total time spent:  2.751s (111.71 GHz-days / day)
Maybe you have a factor (> 260) for me that I could verify to find?

edit2: Using less-classes, that same thing is even more fun:
Code:
got assignment: exp=4201971233 bit_min=66 bit_max=67 (0.00 GHz-days)
Starting trial factoring M4201971233 from 2^66 to 2^67 (0.00GHz-days)
Using GPU kernel "cl_barrett15_69_gs_2"
Date    Time | class   Pct |   time     ETA | GHz-d/day    Sieve     Wait
Oct 22 23:02 |  412 100.0% |  0.009   0m00s |    355.68    80181    0.00%
no factor for M4201971233 from 2^66 to 2^67 [mfakto 0.15pre4-Win cl_barrett15_69_gs_2]
tf(): total time spent:  0.995s (308.85 GHz-days / day)

Last fiddled with by Bdot on 2014-10-22 at 21:04 Reason: James, less-classes
Bdot is offline   Reply With Quote
Old 2014-10-22, 21:06   #2412
James Heinrich
 
James Heinrich's Avatar
 
"James Heinrich"
May 2004
ex-Northern Ontario

342710 Posts
Default

Quote:
Originally Posted by Bdot View Post
Maybe you have a factor (> 260) for me that I could verify to find?
The list of exponents-with-many-factors has many entries you can pick from. I can give you a pile of more specific exponents if you're interested. Here's a tiny sample with factors slightly larger than 60 bits:
Code:
Factor=2001862367,60,61
Factor=2000098873,60,61
Factor=2004561407,60,61
Factor=2005844293,60,61
Factor=2009094883,60,61
Factor=2003270579,60,61
Factor=2006109223,60,61
Factor=2008886611,60,61
Factor=2004315961,60,61
Factor=2001388097,60,61
James Heinrich is offline   Reply With Quote
Old 2014-10-22, 21:11   #2413
Bdot
 
Bdot's Avatar
 
Nov 2010
Germany

11258 Posts
Default

Code:
got assignment: exp=2001862367 bit_min=60 bit_max=61 (0.00 GHz-days)
Starting trial factoring M2001862367 from 2^60 to 2^61 (0.00GHz-days)
Using GPU kernel "cl_barrett15_69_gs_2"
Date    Time | class   Pct |   time     ETA | GHz-d/day    Sieve     Wait
Oct 22 23:08 |  417 100.0% |  0.001   0m00s |    104.99    80181    0.00%
M2001862367 has a factor: 1153068867805081159

found 1 factor for M2001862367 from 2^60 to 2^61 [mfakto 0.15pre4-Win cl_barrett15_69_gs_2]
tf(): total time spent:  0.167s (60.35 GHz-days / day)

got assignment: exp=2000098873 bit_min=60 bit_max=61 (0.00 GHz-days)
Starting trial factoring M2000098873 from 2^60 to 2^61 (0.00GHz-days)
Using GPU kernel "cl_barrett15_69_gs_2"
Date    Time | class   Pct |   time     ETA | GHz-d/day    Sieve     Wait
Oct 22 23:08 |   75  18.8% |  0.001   0m00s |    105.08    80181    0.00%
M2000098873 has a factor: 1153427718610610551
Oct 22 23:08 |  416 100.0% |  0.001   0m00s |    105.08    80181    0.00%
found 1 factor for M2000098873 from 2^60 to 2^61 [mfakto 0.15pre4-Win cl_barrett15_69_gs_2]
tf(): total time spent:  0.149s (67.70 GHz-days / day)

got assignment: exp=2004561407 bit_min=60 bit_max=61 (0.00 GHz-days)
Starting trial factoring M2004561407 from 2^60 to 2^61 (0.00GHz-days)
Using GPU kernel "cl_barrett15_69_gs_2"
Date    Time | class   Pct |   time     ETA | GHz-d/day    Sieve     Wait
Oct 22 23:08 |  232  56.3% |  0.001   0m00s |    104.85    80181    0.00%
M2004561407 has a factor: 1153386835577909609
Oct 22 23:08 |  417 100.0% |  0.001   0m00s |    104.85    80181    0.00%
found 1 factor for M2004561407 from 2^60 to 2^61 [mfakto 0.15pre4-Win cl_barrett15_69_gs_2]
tf(): total time spent:  0.147s (68.47 GHz-days / day)

got assignment: exp=2005844293 bit_min=60 bit_max=61 (0.00 GHz-days)
Starting trial factoring M2005844293 from 2^60 to 2^61 (0.00GHz-days)
Using GPU kernel "cl_barrett15_69_gs_2"
Date    Time | class   Pct |   time     ETA | GHz-d/day    Sieve     Wait
Oct 22 23:08 |  116  29.2% |  0.001   0m00s |    104.78    80181    0.00%
M2005844293 has a factor: 1153405062405321977
Oct 22 23:08 |  416 100.0% |  0.001   0m00s |    104.78    80181    0.00%
found 1 factor for M2005844293 from 2^60 to 2^61 [mfakto 0.15pre4-Win cl_barrett15_69_gs_2]
tf(): total time spent:  0.145s (69.37 GHz-days / day)

got assignment: exp=2009094883 bit_min=60 bit_max=61 (0.00 GHz-days)
Starting trial factoring M2009094883 from 2^60 to 2^61 (0.00GHz-days)
Using GPU kernel "cl_barrett15_69_gs_2"
Date    Time | class   Pct |   time     ETA | GHz-d/day    Sieve     Wait
Oct 22 23:08 |   53  14.6% |  0.001   0m00s |    104.61    80181    0.00%
M2009094883 has a factor: 1153540764802817159
Oct 22 23:08 |  417 100.0% |  0.001   0m00s |    104.61    80181    0.00%
found 1 factor for M2009094883 from 2^60 to 2^61 [mfakto 0.15pre4-Win cl_barrett15_69_gs_2]
tf(): total time spent:  0.146s (68.78 GHz-days / day)

got assignment: exp=2003270579 bit_min=60 bit_max=61 (0.00 GHz-days)
Starting trial factoring M2003270579 from 2^60 to 2^61 (0.00GHz-days)
Using GPU kernel "cl_barrett15_69_gs_2"
Date    Time | class   Pct |   time     ETA | GHz-d/day    Sieve     Wait
Oct 22 23:08 |  300  70.8% |  0.001   0m00s |    104.91    80181    0.00%
M2003270579 has a factor: 1153716299952772441
Oct 22 23:08 |  417 100.0% |  0.001   0m00s |    104.91    80181    0.00%
found 1 factor for M2003270579 from 2^60 to 2^61 [mfakto 0.15pre4-Win cl_barrett15_69_gs_2]
tf(): total time spent:  0.146s (68.98 GHz-days / day)

got assignment: exp=2006109223 bit_min=60 bit_max=61 (0.00 GHz-days)
Starting trial factoring M2006109223 from 2^60 to 2^61 (0.00GHz-days)
Using GPU kernel "cl_barrett15_69_gs_2"
Date    Time | class   Pct |   time     ETA | GHz-d/day    Sieve     Wait
Oct 22 23:08 |  152  35.4% |  0.001   0m00s |    104.77    80181    0.00%
M2006109223 has a factor: 1153764818690030153
Oct 22 23:08 |  416 100.0% |  0.001   0m00s |    104.77    80181    0.00%
found 1 factor for M2006109223 from 2^60 to 2^61 [mfakto 0.15pre4-Win cl_barrett15_69_gs_2]
tf(): total time spent:  0.145s (69.36 GHz-days / day)

got assignment: exp=2008886611 bit_min=60 bit_max=61 (0.00 GHz-days)
Starting trial factoring M2008886611 from 2^60 to 2^61 (0.00GHz-days)
Using GPU kernel "cl_barrett15_69_gs_2"
Date    Time | class   Pct |   time     ETA | GHz-d/day    Sieve     Wait
Oct 22 23:08 |  189  45.8% |  0.001   0m00s |    104.62    80181    0.00%
M2008886611 has a factor: 1153863845653352359
Oct 22 23:08 |  416 100.0% |  0.001   0m00s |    104.62    80181    0.00%
found 1 factor for M2008886611 from 2^60 to 2^61 [mfakto 0.15pre4-Win cl_barrett15_69_gs_2]
tf(): total time spent:  0.148s (67.86 GHz-days / day)

got assignment: exp=2004315961 bit_min=60 bit_max=61 (0.00 GHz-days)
Starting trial factoring M2004315961 from 2^60 to 2^61 (0.00GHz-days)
Using GPU kernel "cl_barrett15_69_gs_2"
Date    Time | class   Pct |   time     ETA | GHz-d/day    Sieve     Wait
Oct 22 23:08 |  291  70.8% |  0.001   0m00s |    104.86    80181    0.00%
M2004315961 has a factor: 1154230767950158183
Oct 22 23:08 |  416 100.0% |  0.001   0m00s |    104.86    80181    0.00%
found 1 factor for M2004315961 from 2^60 to 2^61 [mfakto 0.15pre4-Win cl_barrett15_69_gs_2]
tf(): total time spent:  0.146s (68.95 GHz-days / day)

got assignment: exp=2001388097 bit_min=60 bit_max=61 (0.00 GHz-days)
Starting trial factoring M2001388097 from 2^60 to 2^61 (0.00GHz-days)
Using GPU kernel "cl_barrett15_69_gs_2"
Date    Time | class   Pct |   time     ETA | GHz-d/day    Sieve     Wait
Oct 22 23:08 |  163  39.6% |  0.001   0m00s |    105.01    80181    0.00%
M2001388097 has a factor: 1154536360470905183
Oct 22 23:08 |  415 100.0% |  0.001   0m00s |    105.01    80181    0.00%
found 1 factor for M2001388097 from 2^60 to 2^61 [mfakto 0.15pre4-Win cl_barrett15_69_gs_2]
tf(): total time spent:  0.146s (69.05 GHz-days / day)
Bdot is offline   Reply With Quote
Old 2014-10-22, 21:55   #2414
TheJudger
 
TheJudger's Avatar
 
"Oliver"
Mar 2005
Germany

11×101 Posts
Default

Hi James,

Quote:
Originally Posted by James Heinrich View Post
Does this mean GPU sieving will be available below 2^64 for large exponents, such as my own pet project of 1000M-4296M? I only need support down to 2^52 since everything has been done below that, but if GPU sieving could be enabled... that would be of tremendous benefit
Yes, will use the "slow" 75bit schoolbook division kernel in 0.21 below 264 but anyway, GPU sieving is possible and feasible.
Slowish GT 630 (GK208) @900MHz:
Code:
M2001862367 has a factor: 1153068867805081159
found 1 factor for M2001862367 from 2^40 to 2^64 [mfaktc 0.21-pre7 75bit_mul32_gs]
tf(): total time spent:  8.702s

no factor for M2001862367 from 2^64 to 2^66 [mfaktc 0.21-pre7 barrett76_mul32_gs]
tf(): total time spent: 11.213s
Oliver
TheJudger is offline   Reply With Quote
Old 2014-10-22, 22:12   #2415
Bdot
 
Bdot's Avatar
 
Nov 2010
Germany

3×199 Posts
Default

But back to Oliver's question ...
Quote:
Originally Posted by TheJudger View Post
Comments?
I'd vote for 3.

In combination with your other plan to drop CC 1.x support, there should be sufficient shared memory to at least run, maybe not at best performance. Insufficient shared memory can also be detected, and lowering GPUSieveProcessSize can counter that.

But if you go for 4C (or B), the user also has to lower GPUSievePrimes, and may run into the same trouble.
Bdot is offline   Reply With Quote
Old 2014-10-24, 15:44   #2416
Ralf Recker
 
Ralf Recker's Avatar
 
Oct 2010

191 Posts
Default

Quote:
Originally Posted by Ralf Recker View Post
I hope next time NVIDIA tests their cards and drivers on boxes with 32 GB RAM installed.

I had to use msconfig to tell windows to use only 30 GB on boot to avoid a bunch of crashes.
Seems that the 344.48 driver fixed the issue.
Ralf Recker is offline   Reply With Quote
Old 2014-10-28, 07:24   #2417
Karl M Johnson
 
Karl M Johnson's Avatar
 
Mar 2010

3·137 Posts
Default

Quote:
Originally Posted by Ralf Recker View Post
Seems that the 344.48 driver fixed the issue.
Another confirmation from me.
Karl M Johnson is offline   Reply With Quote
Old 2014-11-05, 05:26   #2418
wombatman
I moo ablest echo power!
 
wombatman's Avatar
 
May 2013

13×137 Posts
Default

How do I enable less classes? I can compile for Windows 7 64-bit with no issue. Just wasn't clear what I needed to enable/disable to get that and try it out. Thanks!
wombatman is offline   Reply With Quote
Old 2014-11-05, 05:39   #2419
Mark Rose
 
Mark Rose's Avatar
 
"/X\(‘-‘)/X\"
Jan 2013

2·5·293 Posts
Default

Quote:
Originally Posted by wombatman View Post
How do I enable less classes? I can compile for Windows 7 64-bit with no issue. Just wasn't clear what I needed to enable/disable to get that and try it out. Thanks!
Remove
Code:
#define MORE_CLASSES
in params.h and recompile.
Mark Rose is offline   Reply With Quote
Old 2014-11-05, 13:18   #2420
wombatman
I moo ablest echo power!
 
wombatman's Avatar
 
May 2013

6F516 Posts
Default

Much appreciated!
wombatman is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
mfakto: an OpenCL program for Mersenne prefactoring Bdot GPU Computing 1676 2021-06-30 21:23
The P-1 factoring CUDA program firejuggler GPU Computing 753 2020-12-12 18:07
gr-mfaktc: a CUDA program for generalized repunits prefactoring MrRepunit GPU Computing 32 2020-11-11 19:56
mfaktc 0.21 - CUDA runtime wrong keisentraut Software 2 2020-08-18 07:03
World's second-dumbest CUDA program fivemack Programming 112 2015-02-12 22:51

All times are UTC. The time now is 04:41.


Fri Aug 6 04:41:51 UTC 2021 up 13 days, 23:10, 1 user, load averages: 1.99, 2.51, 3.50

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.