mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2013-05-07, 14:35   #232
owftheevil
 
owftheevil's Avatar
 
"Carl Darby"
Oct 2012
Spring Mountains, Nevada

1001110112 Posts
Default

Quote:
Originally Posted by Stef42 View Post
I've got this exponent until iteration 50.000 run for you.
https://dl.dropboxusercontent.com/u/...m1%2050000.txt
I'm sorry, but my memory was wrong. It was iteration 450000 where the residues began to diverge. Yours match both of ours up to 50000. The reason thats important is that I'm at work now and have no access to my results, but I do have access to his. Thanks for the input.

Last fiddled with by owftheevil on 2013-05-07 at 14:37
owftheevil is offline   Reply With Quote
Old 2013-05-07, 14:55   #233
Stef42
 
Feb 2012
the Netherlands

728 Posts
Default

Quote:
Originally Posted by owftheevil View Post
I'm sorry, but my memory was wrong. It was iteration 450000 where the residues began to diverge. Yours match both of ours up to 50000. The reason thats important is that I'm at work now and have no access to my results, but I do have access to his. Thanks for the input.
Then I'll see if I can get it up to 450000 in a few hours...
Stef42 is offline   Reply With Quote
Old 2013-05-07, 21:02   #234
Aramis Wyler
 
Aramis Wyler's Avatar
 
"Bill Staffen"
Jan 2013
Pittsburgh, PA, USA

23×53 Posts
Default

Quote:
Originally Posted by owftheevil View Post
As to the cudaDevice Synchronize errors people are seeing, I'm almost convinced it is an Nvidia driver bug. On Linux, I'm getting something similar, only its a timeout error (error 6) instead of an unidentified error.
I thought the fact that it only happened (for me at least) when the cpu was occupied was important, considering Chasall's problems cropping up only when the cpu was running as well. Also if the purpose of the call directly related to the cpu (I believe it was described as keeping the cpu from running a loop while waiting?) the inverse situation where the cpu was already too busy to run a wait loop anyway might be relevant. The 580 is my main display card though - I'll try running the program on the 480 and see what it does. It has 1.5 gb of memory, which should be enough I hope.

EDIT: The quote I was trying to remember:
Quote:
Originally Posted by owftheevil View Post
The different kernels run synchronously, the cutilSafeThreadSync call is so the cpu doesn't do busy waiting and eat up an entire cpu core.
I had the wrong function call in mind.

Last fiddled with by Aramis Wyler on 2013-05-07 at 21:07
Aramis Wyler is offline   Reply With Quote
Old 2013-05-07, 21:59   #235
owftheevil
 
owftheevil's Avatar
 
"Carl Darby"
Oct 2012
Spring Mountains, Nevada

4738 Posts
Default

My run found the factor:

Code:
Iteration 764000 M61394569, 0xa524c6ae8ad4a231, n = 3360K, CUDAPm1 v0.10 err = 0.21777 (0:07 real, 6.9230 ms/iter, ETA 0:06)
M61394569, 0x30c664a860055a8f, n = 3360K, CUDAPm1 v0.10
Stage 1 complete, estimated total time = 1:28:32
.
.
.
Accumulated Product: M61394569, 0x80e4aa01c3bb4d17, n = 3360K, CUDAPm1 v0.10
Starting stage 2 gcd.
M61394569 has a factor: 189843460261039170580823 (P-1, B1=530000, B2=12985000, e=6, n=3360K CUDAPm1 v0.10)
@ Aramis Wyler: I think the error coming at high cpu load and not at low cpu load is coincidence, although I'm not ruling anything out yet. That's the reason I want to do some runs with the explicit and implicit host synchronizations removed.

Last fiddled with by owftheevil on 2013-05-07 at 22:56 Reason: include the penultimate and final residues of stage 1
owftheevil is offline   Reply With Quote
Old 2013-05-07, 23:35   #236
frmky
 
frmky's Avatar
 
Jul 2003
So Cal

22×232 Posts
Default

Here's a couple more that can be used for testing. I found these overnight.

Code:
M61747963 has a factor: 13383883517343994527281 (P-1, B1=610000, B2=610000, e=6, n=3584K CUDAPm1 v0.10)
M61829329 has a factor: 894781313041001886421561 (P-1, B1=615000, B2=16912500, e=6, n=3584K CUDAPm1 v0.10)
The first can be found in stage 1 with B1 = 3750. The second can be found with B1 = 750, B2 = 2750000.
frmky is online now   Reply With Quote
Old 2013-05-08, 00:06   #237
chalsall
If I May
 
chalsall's Avatar
 
"Chris Halsall"
Sep 2002
Barbados

9,767 Posts
Default

Quote:
Originally Posted by owftheevil View Post
@ Aramis Wyler: I think the error coming at high cpu load and not at low cpu load is coincidence, although I'm not ruling anything out yet. That's the reason I want to do some runs with the explicit and implicit host synchronizations removed.
Indeed. My card appears to be the worst possible case for debugging -- it seems to be just unstable.

I have spent days running tests in different situations -- heavy CPU load, no CPU load. Low ambient temperatures, high ambient temperatures. Low vRAM usage, high vRAM usage. And all the various combinations of the above.

I can find no correlation to the errors.

While it's still possible that there is a software bug somewhere in the stack, the evidence seems to suggest I have a bad card.

Last fiddled with by chalsall on 2013-05-08 at 00:07 Reason: s/spend/spent/
chalsall is offline   Reply With Quote
Old 2013-05-08, 00:39   #238
c10ck3r
 
c10ck3r's Avatar
 
Aug 2010
Kansas

547 Posts
Default

Quote:
Originally Posted by frmky View Post
New versions ...
Win32:
https://www.dropbox.com/s/alz4xodjje...2_20130503.zip
As usual, please let me know of problems.
So, what exactly do I have to do after downloading? The .exe doesn't seem to work as-is for me...
c10ck3r is offline   Reply With Quote
Old 2013-05-08, 11:52   #239
Aramis Wyler
 
Aramis Wyler's Avatar
 
"Bill Staffen"
Jan 2013
Pittsburgh, PA, USA

23·53 Posts
Default

I ran the same worktodo as before on my secondary card (a 480) to avoid the conditions mentioned earlier (used for display, etc). It didn't find a factor for M61262347, B1 = 605000, B2 = 16637500, e = 6, fft length = 3360K. It crashed trying to do M61394569, B1 = 605000, B2 = 16637500, e = 6, fft length = 3360K, but even more unforunate for me is that it used different bounds than my 580 did so the residues are completely different. I'll have to try to run it again and specify the bounds. The error on crash:

Iteration 587000 M61394569, 0x689e7131d4d15b81, n = 3360K, CUDAPm1 v0.10 err = 0.20313 (0:07 real, 6.7823 ms/iter, ETA 32:20)
Iteration = 587400, err = 0.46094 >= 0.43, quitting.
Estimated time spent so far: 1:06:31
C:/Users/childers/Dropbox/NFS/cudapm1/build/cudapm1-code-21/cudapm1-code-21/trunk/CUDAPm1.cu(1362) : cudaSafeCall() Runtime API error 17: invalid device pointer.

Though it looks to me like it didn't really crash, it quit because of a rounding error... and possibly then crashed. Full output is here.
Aramis Wyler is offline   Reply With Quote
Old 2013-05-08, 23:13   #240
owftheevil
 
owftheevil's Avatar
 
"Carl Darby"
Oct 2012
Spring Mountains, Nevada

32·5·7 Posts
Default

Yeah, if it quits during stage 1, I still have it trying to free stage 2 device memory, which hasn't been allocated yet. Fix coming soon.
owftheevil is offline   Reply With Quote
Old 2013-05-10, 01:45   #241
c10ck3r
 
c10ck3r's Avatar
 
Aug 2010
Kansas

547 Posts
Default

Quote:
Originally Posted by c10ck3r View Post
So, what exactly do I have to do after downloading? The .exe doesn't seem to work as-is for me...
BUMP
U
M
P
!?!?!
Thanks!
c10ck3r is offline   Reply With Quote
Old 2013-05-10, 04:28   #242
Karl M Johnson
 
Karl M Johnson's Avatar
 
Mar 2010

3·137 Posts
Default

1. Goto [url=http://www.mersenne.org/manual_assignment/] this page.
2. Select P-1 factoring.
3. Put whatever the server gives you into the worktodo.txt file, which is located in CUDAPm1's folder.
4. Create a batch file, say, run.bat, right click on it and select edit.
5. Paste the following there:
Code:
CUDAPm1
pause
6. Save it, open CUDAPm1.ini, tweak your settings, save em.
7. Run the batch file. It will process one assignment at a time from the worktodo.txt file.
Karl M Johnson is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
mfaktc: a CUDA program for Mersenne prefactoring TheJudger GPU Computing 3497 2021-06-05 12:27
World's second-dumbest CUDA program fivemack Programming 112 2015-02-12 22:51
World's dumbest CUDA program? xilman Programming 1 2009-11-16 10:26
Factoring program need help Citrix Lone Mersenne Hunters 8 2005-09-16 02:31
Factoring program ET_ Programming 3 2003-11-25 02:57

All times are UTC. The time now is 08:18.


Mon Aug 2 08:18:40 UTC 2021 up 10 days, 2:47, 0 users, load averages: 2.72, 2.17, 1.77

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.