![]() |
|
|
#265 |
|
"Carl Darby"
Oct 2012
Spring Mountains, Nevada
1001110112 Posts |
No, too big an fft will cause errors too. I think it has to do with how far the carries get propagated.
|
|
|
|
|
|
#266 |
|
"Carl Darby"
Oct 2012
Spring Mountains, Nevada
32×5×7 Posts |
Stage 1 save files are now implemented. It's not very polite in that it doesn't clean these up when its done. Some of you will want to keep these for extending b1 later. I'm starting work on stage 2 save files and will figure out the cleanup when that's ready.
|
|
|
|
|
|
#267 |
|
Aug 2010
Kansas
547 Posts |
Do you have a Win-32-bit compiled version of this available?
|
|
|
|
|
|
#268 |
|
"Carl Darby"
Oct 2012
Spring Mountains, Nevada
13B16 Posts |
Not yet. frmky has been doing the windows builds. I don't know when he will have time to get to it.
|
|
|
|
|
|
#269 |
|
"Carl Darby"
Oct 2012
Spring Mountains, Nevada
1001110112 Posts |
Just wanted to mention that without frmky's help none of this would be available until later this summer or maybe even fall.
|
|
|
|
|
|
#270 |
|
Jul 2003
So Cal
41048 Posts |
Windows binaries with latest changes, untested as usual.
Win32 https://www.dropbox.com/s/ecwuwbezul...2_20130520.zip x64 https://www.dropbox.com/s/ik1g9eza96...4_20130520.zip |
|
|
|
|
|
#271 |
|
Feb 2012
the Netherlands
5810 Posts |
Thank you very much!
|
|
|
|
|
|
#272 |
|
Mar 2010
6338 Posts |
Latest and greatest 64 bit binary works here
![]() Stopped and resumed a couple of times during stage 1, here are the end results on new whql forceware: Code:
Accumulated product stage 1: M63137587, 0x1f2595c1236f31dc, n = 3456K, CUDAPm1 v0.10 Accumulated product stage 2: M63137587, 0x412ca727e7d21026, n = 3456K, CUDAPm1 v0.10 |
|
|
|
|
|
#273 |
|
Romulan Interpreter
Jun 2011
Thailand
226668 Posts |
Having trouble with CUDAPm1. When I use "-b1 3100000" in the command line it works, but it stays a lot in that CPU routine that compute the product. A pari line line "n=3*10^6; lgn=log(n); z=prod(x=1,n,if(isprime(x),x^floor(lgn/log(x)),1)); ceil(log(z)/log(2))" returns in the same time, against all the logic and reason (pari should be much slower!).
But not this is the main problem. All values between 3200000 and 20M are parsed wrong, it says "B1 need to be at least 1" and does a test with B1=1 and B2=393xxx or so, which does find a factor, if one exists for these values. I am not sure if smaller values starting with 1 are parsed wrong too or not (like -b1 150000) When I use a value of -b1 over 20M, it is parsed right (but never returns from the CPU multiplication routine, not ever after half hour). So, what are the restrictions for B1? Or, are there any restrictions and I am doing something completely silly? (I would like to run "CUDAPm1 160403 -b1 12000000 -b2 12000000" for example... Max value I can use is around 3M1, which is not enough, the former one is 10M. And totally ignoring the fact that he wants B2 to be 13 times higher then B1, which is totally nonsense for these numbers.) Also, how can we "extend" a former B1? I tried the test cases: CUDAPm1 58610467 -b1 70843 -b2 694201 and CUDAPm1 58610467 -b1 694201 -b2 694201 they both find the factor [edit, first one in stage 2, second one in stage 1, as it is normal] if started from scratch (delete the checkpoint file in between). But now assuming I have a run with the first, I want, when I run the second, that it should continue from where B1 left. This is not possible, as the former B1 is recorded in the file, and if I let the file there, it is totally ignoring my command line, it says "found limits in the file" and only runs stage 2. If I delete the file, obviously it starts from the scratch, duplicating the most of the work. This is not what was intended when we talked about "extending B1". OTOH, resuming stage1 works very nice, and I believe it is only about ignoring that former B1 stored in the file (I did not look into the sources however, and for the record, I use win7 64 bits binaries). Question: why are you doing that whole product in the beginning? You can do exponentiation for every prime, this would make it easy to "extend" the B1 limit, and you would not need to stress the CPU "only" (the GPU is idle in this time, for minutes, depends how big B1 is). Code:
>CUDAPm1 630893 -b1 3100000 mkdir: cannot create directory `savefiles': File exists CUDA reports 1306M of 1535M GPU memory free. Using e=6, d=2310, nrp=480 Using approximately 155M GPU memory. B2 should be at least 390390, increasing it. B2 should be at least 40300000, increasing it. <<<< here it stays about 2 minutes, GPU is iddle, CPU hard computing the product, then everything continues normally. Starting stage 1 P-1, M630893, B1 = 3100000, B2 = 40300000, e = 6, fft length = 40K Doing 4471985 iterations Iteration 10000 M630893, 0x280b630169a8b5f7, n = 40K, CUDAPm1 v0.10 err = 0.00049 (0:17 real, 1.6675 ms/iter, ETA 2:04:00) Iteration 20000 M630893, 0xfb3b1f4975308539, n = 40K, CUDAPm1 v0.10 err = 0.00046 (0:01 real, 0.1044 ms/iter, ETA 7:44) Iteration 30000 M630893, 0xc90545f20507538b, n = 40K, CUDAPm1 v0.10 err = 0.00046 (0:01 real, 0.1039 ms/iter, ETA 7:41) Iteration 40000 M630893, 0x3ff1f732d6ebab86, n = 40K, CUDAPm1 v0.10 err = 0.00046 (0:01 real, 0.1041 ms/iter, ETA 7:41) Last fiddled with by LaurV on 2013-05-29 at 20:00 |
|
|
|
|
|
#274 |
|
"Carl Darby"
Oct 2012
Spring Mountains, Nevada
32×5×7 Posts |
LaurV, thanks for your input. I'll have time for a more complete response in about an hour, but for now I'll just say that most of what you are talking about hasn't been implemented yet, or hasn't been cleaned up yet. I was unaware of any problems parsing b1, I'll take a look as soon as I have time.
|
|
|
|
|
|
#275 | |||||
|
"Carl Darby"
Oct 2012
Spring Mountains, Nevada
13B16 Posts |
Quote:
Quote:
Quote:
Quote:
Quote:
|
|||||
|
|
|
![]() |
| Thread Tools | |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| mfaktc: a CUDA program for Mersenne prefactoring | TheJudger | GPU Computing | 3497 | 2021-06-05 12:27 |
| World's second-dumbest CUDA program | fivemack | Programming | 112 | 2015-02-12 22:51 |
| World's dumbest CUDA program? | xilman | Programming | 1 | 2009-11-16 10:26 |
| Factoring program need help | Citrix | Lone Mersenne Hunters | 8 | 2005-09-16 02:31 |
| Factoring program | ET_ | Programming | 3 | 2003-11-25 02:57 |