![]() |
|
|
#353 |
|
May 2004
FRANCE
24×3×13 Posts |
Hi,
I verified the primality of the last prime announced on the Prime Database Verification Status, using llrCUDA V3.8.1 : ./llrCUDA -d -a7 -oVerbose=1 -oDebug=0 -q"1880370^524288+1" [Sat Jan 20 18:55:48 2018] Base factorized as : 2*3^2*5*17*1229 Base prime factor(s) taken : 17, 1229 Starting N-1 prime test of 1880370^524288+1 Using complex rational base DWT and generic reduction, FFT length = 1507328, a = 3 [Mon Jan 22 06:51:05 2018] 3^((N-1)/1229)-1 is coprime to N! [Mon Jan 22 08:40:50 2018] 3^((N-1)/17)-1 is coprime to N! 1880370^524288+1 is prime! (3289511 decimal digits) Time : 123098.306 sec. The total time is pretty satisfying, but gcd's calculus by giants code took several hours... I have to write a CUDA gcd code! Regards, Jean |
|
|
|
|
|
#354 | |
|
Dec 2011
After 1.58M nines:)
1,699 Posts |
Quote:
34 hours is OK time :) |
|
|
|
|
|
|
#355 |
|
May 2004
FRANCE
24×3×13 Posts |
|
|
|
|
|
|
#356 |
|
Feb 2011
33 Posts |
Anybody got a Windows binary?
Willing to try RTX 2080 and how it is doing... |
|
|
|
|
|
#357 |
|
"Curtis"
Feb 2005
Riverside, CA
133368 Posts |
I have llrcuda running on ubuntu 16.04, SB-era Xeon with Quadro 5000 GPU.
When testing with k=443 and exponents around 3.3M, the FFT chosen is about 4 times larger than regular sllr64 (917nnn vs 224k). I tried another exponent just over 6M, and again FFT is roughly 4x larger (~1.5M vs 400k). llrcuda uses 100% load on one core, 90+% on GPU, but production is marginally worse than using that one CPU core for sllr64 alone. If FFT size choice is a bug and can be fixed, it appears my GPU could do work at the rate of 3-4 cores of this system, which would be quite nice! Previous versions have shown better speed with smaller k; I'll next try k=13 and report any improvement. |
|
|
|
|
|
#358 | |
|
"Robert Gerbicz"
Oct 2005
Hungary
3×547 Posts |
Quote:
gcd(c0,n)=1 && gcd(c1,n)=1 ... gcd(c_t,n)=1 is true iff gcd(r,n)=1, where r=(c0*c1*c2*...*c_t)%n so in every case where you'd use the Generalized Pocklington theorem it is enough to compute only one gcd. Last fiddled with by R. Gerbicz on 2018-11-29 at 19:04 |
|
|
|
|
|
|
#359 | |
|
Sep 2006
The Netherlands
32716 Posts |
VB Curtis i'm carefully watching your findings.
The Xeon i do not know how many gflops a single core is on paper double precision, yet if it's a Fermi Quadro 500 GPU it's having 359.04 gflops double precision and 120GB/s bandwidth to the GDDR5 with 152 watt TDP - but that last number is just a Coca Cola Toto number. What Ghz setting does the Xeon have? For now blindfolded guess the SB is single core factor 20 slower than the Quadro... Quote:
Last fiddled with by diep on 2018-11-29 at 19:59 |
|
|
|
|
|
|
#360 |
|
"Curtis"
Feb 2005
Riverside, CA
2×2,927 Posts |
|
|
|
|
|
|
#361 |
|
"Daniel Jackson"
May 2011
14285714285714285714
14018 Posts |
Could someone please tell me where to get a Windows 64-bit build?
|
|
|
|
|
|
#362 | |
|
Jan 2005
Caught in a sieve
5×79 Posts |
Quote:
It sure looks like llrCUDA hasn't been updated in awhile.
|
|
|
|
|
|
|
#363 |
|
"Daniel Jackson"
May 2011
14285714285714285714
769 Posts |
What about general numbers of the form k*b^n+/-1? Proth20 only does numbers of the form k*2^n+1, k < 10^8-1). I want to try and find Top 5000 primes with other bases (i.e. b=6, 10, 62, 94, and any other bases that I want to use). Basically, I'm looking for a CUDA equivalent of PFGW.
Last fiddled with by Stargate38 on 2021-01-04 at 01:17 Reason: more info |
|
|
|
![]() |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| LLRcuda | shanecruise | Riesel Prime Search | 8 | 2014-09-16 02:09 |
| LLRCUDA - getting it to work | diep | GPU Computing | 1 | 2013-10-02 12:12 |