mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2013-03-02, 17:06   #45
owftheevil
 
owftheevil's Avatar
 
"Carl Darby"
Oct 2012
Spring Mountains, Nevada

32·5·7 Posts
Default

And I thank you for yours.
owftheevil is offline   Reply With Quote
Old 2013-03-02, 17:28   #46
chalsall
If I May
 
chalsall's Avatar
 
"Chris Halsall"
Sep 2002
Barbados

254708 Posts
Default

Quote:
Originally Posted by owftheevil View Post
And I thank you for yours.
Thank you.
chalsall is offline   Reply With Quote
Old 2013-03-02, 17:47   #47
xilman
Bamboozled!
 
xilman's Avatar
 
"๐’‰บ๐’ŒŒ๐’‡ท๐’†ท๐’€ญ"
May 2003
Down not across

265778 Posts
Default

Quote:
Originally Posted by kracker View Post
Now give us your half-first born child. Naow!
Top half or bottom half?
xilman is offline   Reply With Quote
Old 2013-03-02, 18:49   #48
ET_
Banned
 
ET_'s Avatar
 
"Luigi"
Aug 2002
Team Italia

3·1,619 Posts
Default

Quote:
Originally Posted by xilman View Post
Top half or bottom half?
Axial symmetry?

Luigi
ET_ is offline   Reply With Quote
Old 2013-03-02, 21:56   #49
owftheevil
 
owftheevil's Avatar
 
"Carl Darby"
Oct 2012
Spring Mountains, Nevada

32·5·7 Posts
Default

Here's the code if anyone wants to play with it. It builds without problems on Ubuntu 12.04 with either Cuda4.2 or Cuda5.0 and gmp5.1.0. Its esssentially a slightly modified CUDALucas, so if you can build CuLu on Windows, you have a good start on building this.


Edit to add: run it with e.g.

Code:
./CUDA-pm1 60593041, -b1 1000, [-f 3360k]
If you don't specify b1 it defaults to 1. Specifying the fft is only necessary if b1 is small (< ~ 690). In that case, the test will finish before it knows if the fft is big enough and will sometimes give invalid results.
Attached Files
File Type: zip cudapm1-0.00.zip (27.9 KB, 448 views)

Last fiddled with by owftheevil on 2013-03-02 at 22:04
owftheevil is offline   Reply With Quote
Old 2013-03-02, 22:15   #50
Batalov
 
Batalov's Avatar
 
"Serge"
Mar 2008
Phi(4,2^7658614+1)/2

19×232 Posts
Default

Very impressive and very "evil"! "The lesser of two weevils".
Batalov is offline   Reply With Quote
Old 2013-03-02, 23:21   #51
ET_
Banned
 
ET_'s Avatar
 
"Luigi"
Aug 2002
Team Italia

3×1,619 Posts
Default

Quote:
Originally Posted by owftheevil View Post
Here's the code if anyone wants to play with it. It builds without problems on Ubuntu 12.04 with either Cuda4.2 or Cuda5.0 and gmp5.1.0. Its esssentially a slightly modified CUDALucas, so if you can build CuLu on Windows, you have a good start on building this.


Edit to add: run it with e.g.

Code:
./CUDA-pm1 60593041, -b1 1000, [-f 3360k]
If you don't specify b1 it defaults to 1. Specifying the fft is only necessary if b1 is small (< ~ 690). In that case, the test will finish before it knows if the fft is big enough and will sometimes give invalid results.
Version 0.00, I love it!

Thanks a lot!

Luigi

P.S. the auto-correction works like a charm...
Code:
luigi@luigi-ubuntu:~/luigi/CUDA/cudapm1-0.00$ ./CUDA-Pm1 60593041, -b1 1000

Starting Stage 1 P-1, M60593041, B1 = 1000, fft length = 3200K
Doing 1475 iterations
Running careful round off test for 1000 iterations. If average error >= 0.25, the test will restart with a longer FFT.
Iteration  100, average error = 0.14299, max error = 0.34317
Iteration  200, average error = 0.13821, max error = 0.32842
Iteration = 284 < 1000 && err = 0.35178 >= 0.35, increasing n from 3200K
Starting Stage 1 P-1, M60593041, B1 = 1000, fft length = 3360K
Doing 1475 iterations
Running careful round off test for 1000 iterations. If average error >= 0.25, the test will restart with a longer FFT.
Iteration  100, average error = 0.05615, max error = 0.12774
Iteration  200, average error = 0.05472, max error = 0.13322
Iteration  300, average error = 0.05601, max error = 0.13491
Iteration  400, average error = 0.05518, max error = 0.14575
Iteration  500, average error = 0.05690, max error = 0.12720
Iteration  600, average error = 0.05746, max error = 0.13921
Iteration  700, average error = 0.05881, max error = 0.13656
Iteration  800, average error = 0.05901, max error = 0.13856
Iteration  900, average error = 0.05955, max error = 0.14368
Iteration 1000, average error = 0.05916 < 0.25 (max error = 0.14575), continuing test.
M60593041, 0x962b95049cafb7d9, offset = 0, n = 3360K, CUDA-P-1 v0.00
Stage 1 complete, estimated total time = 0:58
M60593041 has a factor: 2105528336291622770155712978260232660484461209
P.S.2: M1257787 has a factor: 1

Last fiddled with by ET_ on 2013-03-02 at 23:32 Reason: After downloading, making and testing...
ET_ is offline   Reply With Quote
Old 2013-03-02, 23:35   #52
owftheevil
 
owftheevil's Avatar
 
"Carl Darby"
Oct 2012
Spring Mountains, Nevada

32·5·7 Posts
Default

I'm not sure, but I think Dubslow is responsible for the roundoff test part. Its hard to tell who did what on CuLu.

Edit: I didn't see the PS. I have so far been too lazy to make a different message for no factor found. I was thinking of just adding "but you already knew that, didn't you."

Last fiddled with by owftheevil on 2013-03-02 at 23:37
owftheevil is offline   Reply With Quote
Old 2013-03-02, 23:39   #53
henryzz
Just call me Henry
 
henryzz's Avatar
 
"David"
Sep 2007
Liverpool (GMT/BST)

37×163 Posts
Default

I can see that cpus are going to become obsolete for P-1 stage 1 soon. This should help kill the P-1 deficit.
henryzz is offline   Reply With Quote
Old 2013-03-02, 23:43   #54
firejuggler
 
firejuggler's Avatar
 
"Vincent"
Apr 2010
Over the rainbow

1011010001002 Posts
Default

Thanks for your work owftheevil.
firejuggler is offline   Reply With Quote
Old 2013-03-03, 01:39   #55
frmky
 
frmky's Avatar
 
Jul 2003
So Cal

A2316 Posts
Default

Great! I very much look forward to this being polished. Stage-1 only is certainly much better than no P-1. Once this uses GIMPS input files and saves results in GIMPS output format, I'll switch my 4 C1060's from double-checks to P-1 Stage-1.
frmky is online now   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
mfaktc: a CUDA program for Mersenne prefactoring TheJudger GPU Computing 3622 2023-01-25 16:41
World's second-dumbest CUDA program fivemack Programming 112 2015-02-12 22:51
World's dumbest CUDA program? xilman Programming 1 2009-11-16 10:26
Factoring program need help Citrix Lone Mersenne Hunters 8 2005-09-16 02:31
Factoring program ET_ Programming 3 2003-11-25 02:57

All times are UTC. The time now is 05:26.


Sun Jan 29 05:26:53 UTC 2023 up 164 days, 2:55, 0 users, load averages: 0.65, 1.05, 1.18

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2023, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.

โ‰  ยฑ โˆ“ รท ร— ยท โˆ’ โˆš โ€ฐ โŠ— โŠ• โŠ– โŠ˜ โŠ™ โ‰ค โ‰ฅ โ‰ฆ โ‰ง โ‰จ โ‰ฉ โ‰บ โ‰ป โ‰ผ โ‰ฝ โŠ โŠ โŠ‘ โŠ’ ยฒ ยณ ยฐ
โˆ  โˆŸ ยฐ โ‰… ~ โ€– โŸ‚ โซ›
โ‰ก โ‰œ โ‰ˆ โˆ โˆž โ‰ช โ‰ซ โŒŠโŒ‹ โŒˆโŒ‰ โˆ˜ โˆ โˆ โˆ‘ โˆง โˆจ โˆฉ โˆช โจ€ โŠ• โŠ— ๐–• ๐–– ๐–— โŠฒ โŠณ
โˆ… โˆ– โˆ โ†ฆ โ†ฃ โˆฉ โˆช โŠ† โŠ‚ โŠ„ โŠŠ โŠ‡ โŠƒ โŠ… โŠ‹ โŠ– โˆˆ โˆ‰ โˆ‹ โˆŒ โ„• โ„ค โ„š โ„ โ„‚ โ„ต โ„ถ โ„ท โ„ธ ๐“Ÿ
ยฌ โˆจ โˆง โŠ• โ†’ โ† โ‡’ โ‡ โ‡” โˆ€ โˆƒ โˆ„ โˆด โˆต โŠค โŠฅ โŠข โŠจ โซค โŠฃ โ€ฆ โ‹ฏ โ‹ฎ โ‹ฐ โ‹ฑ
โˆซ โˆฌ โˆญ โˆฎ โˆฏ โˆฐ โˆ‡ โˆ† ฮด โˆ‚ โ„ฑ โ„’ โ„“
๐›ข๐›ผ ๐›ฃ๐›ฝ ๐›ค๐›พ ๐›ฅ๐›ฟ ๐›ฆ๐œ€๐œ– ๐›ง๐œ ๐›จ๐œ‚ ๐›ฉ๐œƒ๐œ— ๐›ช๐œ„ ๐›ซ๐œ… ๐›ฌ๐œ† ๐›ญ๐œ‡ ๐›ฎ๐œˆ ๐›ฏ๐œ‰ ๐›ฐ๐œŠ ๐›ฑ๐œ‹ ๐›ฒ๐œŒ ๐›ด๐œŽ๐œ ๐›ต๐œ ๐›ถ๐œ ๐›ท๐œ™๐œ‘ ๐›ธ๐œ’ ๐›น๐œ“ ๐›บ๐œ”