mersenneforum.org

mersenneforum.org (https://www.mersenneforum.org/index.php)
-   GPU Computing (https://www.mersenneforum.org/forumdisplay.php?f=92)
-   -   The P-1 factoring CUDA program (https://www.mersenneforum.org/showthread.php?t=17835)

James Heinrich 2013-05-02 11:59

[QUOTE=frmky;338990]Here's a Windows version to try
Included is a worktodo.txt that should find a factor.
please try that first, make sure the factor is found, and please let me know if it worked![/QUOTE]Starting a run on my GTX 570. One thing that might be a concern (especially in light of [i]Stef42[/i]'s comment about stage2 memory) is the references to "zu" as a quantity of graphics memory:[quote]Selected B1=605000, B2=16637500, 4.1% chance of finding a factor
CUDA reports [color=red]zu[/color]M of [color=red]zu[/color]M GPU memory free.
Using e=6, d=2310, nrp=16
Using approximately [color=red]zu[/color]M GPU memory.
Starting stage 1 P-1, M61262347, B1 = 605000, B2 = 16637500, e = 6, fft length =
3360K
Doing 873133 iterations[/quote]Running at 7.0ms/it, stage1 should be done in 1h40m and I'll report back with what happens when stage2 starts.

owftheevil 2013-05-02 12:21

The zu is not a big deal, simply a size specifier specific to gcc. For windows we need Iu instead.

Stef42 2013-05-02 12:22

Right after stage 1 finished and stage 2 was initiated, I got a popup saying that CUDAPm1 crashed.
The Windows error log showed an APPCRASH, which is not very useful I think.

When followed was that after the gpu load dropped from 99% to 0%,
the memory remained at 1134MB usage until, I guess because of a time-out, was flushed.

Maybe I'll try a smaller P-1 exponent with a factor found to check.

Karl M Johnson 2013-05-02 13:14

Feedback so far:
1. Does not create checkpoints.
2. Beats CUDALucas in memory stability stress testing(60M exponents were free from errors on CL, found errors at 50K iterations on CP+1)
3. Fails at the beginning of stage 2 with out of memory error, should not be the case (6GB of vRAM, 16GB of RAM).

James Heinrich 2013-05-02 13:56

[QUOTE=owftheevil;339001]The zu is not a big deal, simply a size specifier specific to gcc. For windows we need Iu instead.[/QUOTE]Would it be a big deal that we're seeing "zu" instead of "Iu" on Windows?

owftheevil 2013-05-02 14:06

I think I found one problem. chalsall has his SPEs. Well, this was an ISPE, I for Ineffably. Its an easy fix, but will have to wait until I get home from work. In the meantime, running with b2 = even multiple of 2310 should bypass the error.

@Karl M Johnson:

1. Checkpoints are coming soon, maybe this weekend.
2. CPm1 during stage 1 does do more global memory reads than CuLu, so maybe thats why.
3. Unexpected. What is the error message?

Thank you all for your input.

owftheevil 2013-05-02 14:09

[QUOTE=James Heinrich;339008]Would it be a big deal that we're seeing "zu" instead of "Iu" on Windows?[/QUOTE]

%zu in printf prints size_t variable values, you need %Iu in windows to do the same thing.

James Heinrich 2013-05-02 14:10

Ah, now I understand what you mean.

Stef42 2013-05-02 14:26

I restarted CUDAPm1 using a b2 value, of a known P-1 with a factor found by me earlier.

This was the command-line:
[CODE]cudapm1.exe -b2 550000[/CODE]

Output:
[CODE]Iteration 164000 M9090017, 0xd7661b0c859fa9e5, n = 512K, CUDAPm1 v0.00 err = 0.0
2734 (0:01 real, 0.7921 ms/iter, ETA 0:01)
Iteration 165000 M9090017, 0x7d3f99a08f445b8b, n = 512K, CUDAPm1 v0.00 err = 0.0
2734 (0:01 real, 0.7878 ms/iter, ETA 0:00)
M9090017, 0x1d50507696eeef9f, offset = 0, n = 512K, CUDAPm1 v0.00
Stage 1 complete, estimated total time = 2:14
Starting stage 1 gcd.
M9090017 Stage 1 found no factor (P-1, B1=115000, B2=[COLOR="Red"]1495000[/COLOR], e=6, n=512K CUDAPm
1 v0.00)
Starting stage 2.
Zeros: 59077, Ones: 84923, Pairs: 18379
itime: 14.921770, transforms: 1, average: 14921.770000
ptime: 35.394836, transforms: 88612, average: 0.399436
ETA: 0:50
itime: 17.911887, transforms: 1, average: 17911.887000
ptime: 35.547328, transforms: 88434, average: 0.401964
ETA: 0:00
Stage 2 complete, estimated total time = 1:43
Accumulated Product: M9090017, 0x1a6840caa5d05db3, n = 512K, CUDAPm1 v0.00
Starting stage 2 gcd.
M9090017 has a factor: 516770062491225473521 (P-1, B1=115000, B2=1495000, e=6, n
=512K CUDAPm1 v0.00)[/CODE]

As you can see, there is a different B2 value. Still, it finished well.
Earlier on, the program would crash when starting stage 2. Any thoughts? I must have done something wrong :smile:

Bit more surprising: according to mersenne.ca,
in the past the factor was found in stage 1 using prime95, but CudaPm1 reports stage 2 in the output... ?
Exponent [URL="http://www.mersenne.ca/exponent/9090017#"]9090017[/URL]

James Heinrich 2013-05-02 14:45

[QUOTE=Stef42;339012][i]M9090017 Stage 1 found no factor (P-1, [b]B1=115000, B2=1495000[/b], e=6, n=512K CUDAPm
1 v0.00)[/i]
Bit more surprising: according to mersenne.ca,
in the past the factor was found in stage 1 using prime95, but CudaPm1 reports stage 2 in the output... ?[/QUOTE]That is a bit disturbing.

[URL=http://www.mersenne.ca/exponent/9090017]M9090017[/url] has factor [url=http://www.mersenne.ca/factor/516770062491225473521]516770062491225473521[/url], with a k of [url=http://www.mersenne.ca/k/28425142796280]28425142796280[/url]
k-factored = 2[sup]3[/sup] × 3 × 5 × 61 × 97 × 389 × 102913
minimal bounds to find this factor in stage2 would be B1=389,B2=102913
minimal bounds to find this factor in stage1 would be B1=102913

You ran this with B1=115000 so it [i]should[/i] have found the factor, at least according to my understand of P-1 :unsure:

Stef42 2013-05-02 14:48

It can still find it, although I wonder, as you mentioned, in stage 2 rather than stage 1....
I will do some further testing on a different exponent.

Did the same test again on prime95 to verify:
[CODE][May 2 16:52] Worker starting
[May 2 16:52] Setting affinity to run worker on any logical CPU.
[May 2 16:52] P-1 on M9090017 with B1=110000
[May 2 16:54] M9090017 stage 1 complete. 317502 transforms. Time: 128.915 sec.
[May 2 16:54] Stage 1 GCD complete. Time: 6.593 sec.
[May 2 16:54] P-1 found a factor in stage #1, B1=110000.
[May 2 16:54] M9090017 has a factor: 516770062491225473521
[May 2 16:54] No work to do at the present time. Waiting.
[/CODE]


All times are UTC. The time now is 23:19.

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.