mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2013-05-02, 11:59   #144
James Heinrich
 
James Heinrich's Avatar
 
"James Heinrich"
May 2004
ex-Northern Ontario

D5D16 Posts
Default

Quote:
Originally Posted by frmky View Post
Here's a Windows version to try
Included is a worktodo.txt that should find a factor.
please try that first, make sure the factor is found, and please let me know if it worked!
Starting a run on my GTX 570. One thing that might be a concern (especially in light of Stef42's comment about stage2 memory) is the references to "zu" as a quantity of graphics memory:
Quote:
Selected B1=605000, B2=16637500, 4.1% chance of finding a factor
CUDA reports zuM of zuM GPU memory free.
Using e=6, d=2310, nrp=16
Using approximately zuM GPU memory.
Starting stage 1 P-1, M61262347, B1 = 605000, B2 = 16637500, e = 6, fft length =
3360K
Doing 873133 iterations
Running at 7.0ms/it, stage1 should be done in 1h40m and I'll report back with what happens when stage2 starts.
James Heinrich is offline   Reply With Quote
Old 2013-05-02, 12:21   #145
owftheevil
 
owftheevil's Avatar
 
"Carl Darby"
Oct 2012
Spring Mountains, Nevada

32·5·7 Posts
Default

The zu is not a big deal, simply a size specifier specific to gcc. For windows we need Iu instead.
owftheevil is offline   Reply With Quote
Old 2013-05-02, 12:22   #146
Stef42
 
Feb 2012
the Netherlands

2·29 Posts
Default

Right after stage 1 finished and stage 2 was initiated, I got a popup saying that CUDAPm1 crashed.
The Windows error log showed an APPCRASH, which is not very useful I think.

When followed was that after the gpu load dropped from 99% to 0%,
the memory remained at 1134MB usage until, I guess because of a time-out, was flushed.

Maybe I'll try a smaller P-1 exponent with a factor found to check.

Last fiddled with by Stef42 on 2013-05-02 at 12:28
Stef42 is offline   Reply With Quote
Old 2013-05-02, 13:14   #147
Karl M Johnson
 
Karl M Johnson's Avatar
 
Mar 2010

3·137 Posts
Default

Feedback so far:
1. Does not create checkpoints.
2. Beats CUDALucas in memory stability stress testing(60M exponents were free from errors on CL, found errors at 50K iterations on CP+1)
3. Fails at the beginning of stage 2 with out of memory error, should not be the case (6GB of vRAM, 16GB of RAM).
Karl M Johnson is offline   Reply With Quote
Old 2013-05-02, 13:56   #148
James Heinrich
 
James Heinrich's Avatar
 
"James Heinrich"
May 2004
ex-Northern Ontario

65358 Posts
Default

Quote:
Originally Posted by owftheevil View Post
The zu is not a big deal, simply a size specifier specific to gcc. For windows we need Iu instead.
Would it be a big deal that we're seeing "zu" instead of "Iu" on Windows?
James Heinrich is offline   Reply With Quote
Old 2013-05-02, 14:06   #149
owftheevil
 
owftheevil's Avatar
 
"Carl Darby"
Oct 2012
Spring Mountains, Nevada

32·5·7 Posts
Default

I think I found one problem. chalsall has his SPEs. Well, this was an ISPE, I for Ineffably. Its an easy fix, but will have to wait until I get home from work. In the meantime, running with b2 = even multiple of 2310 should bypass the error.

@Karl M Johnson:

1. Checkpoints are coming soon, maybe this weekend.
2. CPm1 during stage 1 does do more global memory reads than CuLu, so maybe thats why.
3. Unexpected. What is the error message?

Thank you all for your input.
owftheevil is offline   Reply With Quote
Old 2013-05-02, 14:09   #150
owftheevil
 
owftheevil's Avatar
 
"Carl Darby"
Oct 2012
Spring Mountains, Nevada

32·5·7 Posts
Default

Quote:
Originally Posted by James Heinrich View Post
Would it be a big deal that we're seeing "zu" instead of "Iu" on Windows?
%zu in printf prints size_t variable values, you need %Iu in windows to do the same thing.
owftheevil is offline   Reply With Quote
Old 2013-05-02, 14:10   #151
James Heinrich
 
James Heinrich's Avatar
 
"James Heinrich"
May 2004
ex-Northern Ontario

11·311 Posts
Default

Ah, now I understand what you mean.
James Heinrich is offline   Reply With Quote
Old 2013-05-02, 14:26   #152
Stef42
 
Feb 2012
the Netherlands

2·29 Posts
Default

I restarted CUDAPm1 using a b2 value, of a known P-1 with a factor found by me earlier.

This was the command-line:
Code:
cudapm1.exe -b2 550000
Output:
Code:
Iteration 164000 M9090017, 0xd7661b0c859fa9e5, n = 512K, CUDAPm1 v0.00 err = 0.0
2734 (0:01 real, 0.7921 ms/iter, ETA 0:01)
Iteration 165000 M9090017, 0x7d3f99a08f445b8b, n = 512K, CUDAPm1 v0.00 err = 0.0
2734 (0:01 real, 0.7878 ms/iter, ETA 0:00)
M9090017, 0x1d50507696eeef9f, offset = 0, n = 512K, CUDAPm1 v0.00
Stage 1 complete, estimated total time = 2:14
Starting stage 1 gcd.
M9090017 Stage 1 found no factor (P-1, B1=115000, B2=1495000, e=6, n=512K CUDAPm
1 v0.00)
Starting stage 2.
Zeros: 59077, Ones: 84923, Pairs: 18379
itime: 14.921770, transforms: 1, average: 14921.770000
ptime: 35.394836, transforms: 88612, average: 0.399436
ETA: 0:50
itime: 17.911887, transforms: 1, average: 17911.887000
ptime: 35.547328, transforms: 88434, average: 0.401964
ETA: 0:00
Stage 2 complete, estimated total time = 1:43
Accumulated Product: M9090017, 0x1a6840caa5d05db3, n = 512K, CUDAPm1 v0.00
Starting stage 2 gcd.
M9090017 has a factor: 516770062491225473521 (P-1, B1=115000, B2=1495000, e=6, n
=512K CUDAPm1 v0.00)
As you can see, there is a different B2 value. Still, it finished well.
Earlier on, the program would crash when starting stage 2. Any thoughts? I must have done something wrong

Bit more surprising: according to mersenne.ca,
in the past the factor was found in stage 1 using prime95, but CudaPm1 reports stage 2 in the output... ?
Exponent 9090017
Stef42 is offline   Reply With Quote
Old 2013-05-02, 14:45   #153
James Heinrich
 
James Heinrich's Avatar
 
"James Heinrich"
May 2004
ex-Northern Ontario

65358 Posts
Default

Quote:
Originally Posted by Stef42 View Post
M9090017 Stage 1 found no factor (P-1, B1=115000, B2=1495000, e=6, n=512K CUDAPm
1 v0.00)

Bit more surprising: according to mersenne.ca,
in the past the factor was found in stage 1 using prime95, but CudaPm1 reports stage 2 in the output... ?
That is a bit disturbing.

M9090017 has factor 516770062491225473521, with a k of 28425142796280
k-factored = 23 × 3 × 5 × 61 × 97 × 389 × 102913
minimal bounds to find this factor in stage2 would be B1=389,B2=102913
minimal bounds to find this factor in stage1 would be B1=102913

You ran this with B1=115000 so it should have found the factor, at least according to my understand of P-1
James Heinrich is offline   Reply With Quote
Old 2013-05-02, 14:48   #154
Stef42
 
Feb 2012
the Netherlands

2×29 Posts
Default

It can still find it, although I wonder, as you mentioned, in stage 2 rather than stage 1....
I will do some further testing on a different exponent.

Did the same test again on prime95 to verify:
Code:
[May 2 16:52] Worker starting
[May 2 16:52] Setting affinity to run worker on any logical CPU.
[May 2 16:52] P-1 on M9090017 with B1=110000
[May 2 16:54] M9090017 stage 1 complete. 317502 transforms. Time: 128.915 sec.
[May 2 16:54] Stage 1 GCD complete. Time: 6.593 sec.
[May 2 16:54] P-1 found a factor in stage #1, B1=110000.
[May 2 16:54] M9090017 has a factor: 516770062491225473521
[May 2 16:54] No work to do at the present time.  Waiting.

Last fiddled with by Stef42 on 2013-05-02 at 14:56
Stef42 is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
mfaktc: a CUDA program for Mersenne prefactoring TheJudger GPU Computing 3497 2021-06-05 12:27
World's second-dumbest CUDA program fivemack Programming 112 2015-02-12 22:51
World's dumbest CUDA program? xilman Programming 1 2009-11-16 10:26
Factoring program need help Citrix Lone Mersenne Hunters 8 2005-09-16 02:31
Factoring program ET_ Programming 3 2003-11-25 02:57

All times are UTC. The time now is 07:25.


Mon Aug 2 07:25:37 UTC 2021 up 10 days, 1:54, 0 users, load averages: 0.86, 1.10, 1.38

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.