![]() |
|
|
#34 |
|
A Sunny Moo
Aug 2007
USA
2·47·67 Posts |
|
|
|
|
|
|
#35 |
|
May 2010
499 Posts |
Would Ken or anyone else be interested in developing a CUDA sieve for a fixed n? If so, it could be used on the n=1000000 sieve for the Operation Megabit Twin subproject.
Last fiddled with by Oddball on 2010-09-19 at 05:28 |
|
|
|
|
|
#36 |
|
Jan 2005
Caught in a sieve
5×79 Posts |
Forget what I said here before; that was a fixed-K sieve.
The main obstacle to a fixed-N sieve is producing a list of primes to sieve against on the GPU. It might be doable. Last fiddled with by Ken_g6 on 2010-09-19 at 17:59 |
|
|
|
|
|
#37 | |
|
A Sunny Moo
Aug 2007
USA
142328 Posts |
Quote:
|
|
|
|
|
|
|
#38 |
|
Jan 2005
Caught in a sieve
6138 Posts |
Yeah, TPSieve-CUDA should work for a single N. It might even be faster than NewPGen, per CPU core run. And you might be able to run two or more processes at a time. But there would be a lot of bandwidth back and forth.
|
|
|
|
|
|
#39 | |
|
May 2010
499 Posts |
It doesn't work for large k ranges, though. From:
http://www.mersenneforum.org/showpos...6&postcount=86 Quote:
Last fiddled with by Oddball on 2010-09-19 at 20:47 |
|
|
|
|
|
|
#40 |
|
Jan 2005
Caught in a sieve
5·79 Posts |
That is an immense K range! The bitmap wouldn't fit in memory if you're getting that big. And besides, TPSieve wouldn't work until pmin > kmax.
How many candidates do you have in that range? Are you just sieving for twins, or for more specialized forms? |
|
|
|
|
|
#41 | |
|
May 2010
49910 Posts |
Quote:
http://www.mersenneforum.org/showthread.php?t=13439 The file has about 85M candidates. |
|
|
|
|
|
|
#42 | |
|
Mar 2006
Germany
57668 Posts |
Quote:
|
|
|
|
|
|
|
#43 |
|
Jan 2005
Caught in a sieve
5·79 Posts |
Hi, all,
I have a new version of TPSieve-CUDA, V0.2.0, out for testing. It's a lot faster than the current version, but I've added one algorithm I'm not completely sure about. I need people to test this new version against previous sieve files, where 22 <= nstep <= 31 and p > 9T. (Just make sure it says that nstep is changed to 22.) And let me know if it misses any factors. If this works, I have big plans for this app. One of those involves improving its speed a little more still. But if this app misses any factors, I need to know ASAP so I can re-arrange stuff. Thanks! |
|
|
|
|
|
#44 |
|
"Dave"
Sep 2005
UK
1010110110002 Posts |
I have re-run 510T-511T of 480000-485000 and the factors match. I used the default value of -m as the meaning has either changed or the new code puts a greater load on the GPU. My original best value of 24576 caused the code to grind to a halt. With the default value I got an improvement from 138M p/sec (CPU 0.41) to 195M p/sec (CPU 0.57).
|
|
|
|
![]() |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Fast Mersenne Testing on the GPU using CUDA | Andrew Thall | GPU Computing | 109 | 2014-07-28 22:14 |
| Inconsistent factors with TPSieve | Caldera | Twin Prime Search | 7 | 2013-01-05 18:32 |
| tpsieve-cuda slows down with increasing p | amphoria | Twin Prime Search | 0 | 2011-07-23 10:52 |
| Is TPSieve-0.2.1 faster than Newpgen? | cipher | Twin Prime Search | 4 | 2009-05-18 18:36 |
| Thread for non-PrimeNet LL testing | ThomRuley | Lone Mersenne Hunters | 6 | 2005-10-16 20:11 |