![]() |
|
|
#45 |
|
Jan 2005
Caught in a sieve
18B16 Posts |
TPSieve-CUDA v0.2.1 is out, in the usual download location. (See the first post in the thread.) It should be a little faster than 0.2.0 on the GPU, and a lot lighter on the CPU, particularly on Windows, and even more so on Linux 32-bit. Hopefully this will allow a return to the higher speeds of the past on this sieve.
P.S. No, the meaning of -m didn't change. |
|
|
|
|
|
#46 |
|
Jan 2005
Caught in a sieve
1100010112 Posts |
I've just re-thought the CPU multiplication, and I think I'm going to have to pull this build. I'll try again tomorrow.
|
|
|
|
|
|
#47 |
|
Jan 2005
Caught in a sieve
5·79 Posts |
Alright, v0.2.1a is uploaded. It may use 15-30% more CPU, but at least it won't print incorrect results.
(I hope. )
|
|
|
|
|
|
#48 |
|
Jan 2005
Caught in a sieve
5·79 Posts |
And now v0.2.2 is out. It ought to be faster than v0.2.1a, if only because it's more efficient with the CPU. But it's also more efficient with GPUs, particularly older ones.
Last fiddled with by Ken_g6 on 2010-10-14 at 05:08 |
|
|
|
|
|
#49 |
|
Jan 2005
Caught in a sieve
5×79 Posts |
And I've released v0.2.2b, which fixes some bugs that could cause factors to be missed. This is why I asked for testing!
|
|
|
|
|
|
#50 |
|
Jan 2005
Caught in a sieve
5×79 Posts |
And one more bugfix, 0.2.2c, to restore about 1/10000 missing factors.
|
|
|
|
|
|
#51 |
|
Feb 2007
21110 Posts |
Can anyone tell me what would be the optimal settings for GTX 465.
i keep getting 202.6M p/sec 0.69 CPU cores but my GPU is barely used at 15% how can i have it be utilized closed to 60%-70% or higher. Also there is no CUDA windows 64 bit? Thanks. |
|
|
|
|
|
#52 | |
|
Jan 2005
Caught in a sieve
5×79 Posts |
Quote:
-m 16384 -Q 10e6 Then try doubling -m until it either slows down or crashes! Then try running two processes at once. On GTX465 and higher, that should improve performance. You might have to lower -m to do this, though. And one more thing that's probably better than all these other suggestions combined: Try reserving the same range across a larger N range. The main reason your GPU isn't used more is that it finishes its N range, for all given P's, before the next set of P's can be computed! Reserving the entire 480000-500000 range, and using two processes without any new flags, should in theory use 100% of your GPU! No. There's no free 64-bit compiler from Microsoft. And on most other projects it doesn't matter. |
|
|
|
|
|
|
#53 |
|
Jun 2009
22·52·7 Posts |
I have a twin sieve file created with NewPGen. I'd like to continue sieving on my GPU, but it says "invalid header in input file". How can I convert one format to the other?
Thanks Peter |
|
|
|
![]() |
| Thread Tools | |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Fast Mersenne Testing on the GPU using CUDA | Andrew Thall | GPU Computing | 109 | 2014-07-28 22:14 |
| Inconsistent factors with TPSieve | Caldera | Twin Prime Search | 7 | 2013-01-05 18:32 |
| tpsieve-cuda slows down with increasing p | amphoria | Twin Prime Search | 0 | 2011-07-23 10:52 |
| Is TPSieve-0.2.1 faster than Newpgen? | cipher | Twin Prime Search | 4 | 2009-05-18 18:36 |
| Thread for non-PrimeNet LL testing | ThomRuley | Lone Mersenne Hunters | 6 | 2005-10-16 20:11 |