![]() |
|
|
#1 | |
|
Jul 2009
Tokyo
2·5·61 Posts |
llrpisrc.zip convert to CUDA.
![]() support k*2^n+1 & prime only. Quote:
|
|
|
|
|
|
|
#2 |
|
Account Deleted
"Tim Sorbera"
Aug 2006
San Antonio, TX USA
10B716 Posts |
Looks like it's significantly slower than a CPU right now:
Code:
99*2^83863+1 is prime! Time : 7.407 sec. |
|
|
|
|
|
#3 |
|
Sep 2004
B0E16 Posts |
Corei5 750@3.6 GHz with llr3.8.4 version.
Code:
5*2^23473+1 is prime! Time : 332.759 ms. 11*2^18759+1 is prime! Time : 167.324 ms. 99*2^83863+1 is prime! Time : 4.739 sec. 21*2^94801+1 is prime! Time : 5.356 sec. 39*2^113549+1 is prime! Time : 7.782 sec. Last fiddled with by em99010pepe on 2011-01-08 at 18:27 |
|
|
|
|
|
#4 |
|
A Sunny Moo
Aug 2007
USA
189A16 Posts |
![]() Awesome! ![]() Gary and I are still working to get his GPU functioning again...we've been running into some strange issues with driver config on Ubuntu 10.04, but will hopefully be able to get it working soon. As soon as we do, I'm open to help with any testing that's needed. |
|
|
|
|
|
#5 |
|
Sep 2004
2·5·283 Posts |
msft,
Please make a test with bigger numbers: 5*2^1282755+1 5*2^1320487+1 |
|
|
|
|
|
#6 | |
|
Oct 2010
19110 Posts |
Quote:
First impression: ralf@quadriga ~/llrcuda.0.07 $ time ./llrCUDA -q"5*2^1282755+1" -d Starting Proth prime test of 5*2^1282755+1, FFTLEN = 131072 ; a = 3 5*2^1282755+1, bit: 20000 / 1282757 [1.55%]. Time per bit: 2.113 ms. Quick comparison: Time per bit on the CPU: ~0.812 ms. CPU Result (LLR 3.8.4): ralf@quadriga ~ $ time sllr -q"5*2^1282755+1" -d Resuming Proth prime test of 5*2^1282755+1 at bit 20876 [1.62%] 5*2^1282755+1 is prime! Time : 1041.208 sec. real 17m4.170s user 17m2.276s sys 0m1.640s I've accidently interrupted the CPU run. The first 1.62% took: real 0m17.266s user 0m17.113s sys 0m0.028s so you need to add ca. 17 seconds to the 1041 seconds above... Last fiddled with by Ralf Recker on 2011-01-08 at 22:52 |
|
|
|
|
|
|
#7 |
|
Oct 2010
191 Posts |
OK. Here is the result from the GPU (details above):
ralf@quadriga ~/llrcuda.0.07 $ time ./llrCUDA -q"5*2^1282755+1" -d Starting Proth prime test of 5*2^1282755+1, FFTLEN = 131072 ; a = 3 5*2^1282755+1 is prime! Time : 2708.763 sec. real 45m8.793s user 45m2.749s sys 0m5.644s Edit: A version compiled with --arch=sm_21 is slower (2.155 ms per bit), a version compiled with --arch=sm_20 is a tiny bit faster (2.085 ms per bit). Last fiddled with by Ralf Recker on 2011-01-08 at 23:32 |
|
|
|
|
|
#8 |
|
Sep 2004
2×5×283 Posts |
Ralf Recker,
First of all thank you. Second, can you post the specs of your machine (memory, harddrives, DVD-R, etc)? I want to make some calculations about energy efficiency so I need to know how many and type of components you have on it to make an energy consumption estimate. |
|
|
|
|
|
#9 |
|
Jul 2009
Tokyo
2×5×61 Posts |
GTX460:
5*2^1282755+1 is prime! Time : 4491.564 sec. 5*2^1320487+1 is prime! Time : 4447.951 sec. |
|
|
|
|
|
#10 | |
|
Jul 2009
Tokyo
61010 Posts |
Fix abort with non prime.
Quote:
|
|
|
|
|
|
|
#11 |
|
May 2004
FRANCE
24·3·13 Posts |
Hi,
First, Best wishes to you for an happy new year, and many congrats for this work! Indeed, I am very interested in your attempts, although I have presently neither hardware nor software to develop my code with CUDA... However, I am now working on a new version of llrpi, which is no more limited to IBDWT and small k's : it works with zero-padded FFT for k's from 22 to 45 bits large, and generic modular reduction for larger k's. Moreover, the portable "gwpnum" code is written as a library, like the George Woltman's "gwnum" one. It seems to work fine for k*2^n+1 and k*2^-1 numbers (and using generic reduction for more general ones), so, I shall release the new source shortly. Best Regards, Jean |
|
|
|
![]() |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| LLRcuda | shanecruise | Riesel Prime Search | 8 | 2014-09-16 02:09 |
| LLRCUDA - getting it to work | diep | GPU Computing | 1 | 2013-10-02 12:12 |