mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2018-01-22, 09:07   #353
Jean Penné
 
Jean Penné's Avatar
 
May 2004
FRANCE

10001011012 Posts
Default Verfying a Genealized Fermat prime

Hi,

I verified the primality of the last prime announced on the Prime Database Verification Status, using llrCUDA V3.8.1 :

./llrCUDA -d -a7 -oVerbose=1 -oDebug=0 -q"1880370^524288+1"
[Sat Jan 20 18:55:48 2018]
Base factorized as : 2*3^2*5*17*1229
Base prime factor(s) taken : 17, 1229
Starting N-1 prime test of 1880370^524288+1
Using complex rational base DWT and generic reduction, FFT length = 1507328, a = 3
[Mon Jan 22 06:51:05 2018]
3^((N-1)/1229)-1 is coprime to N!
[Mon Jan 22 08:40:50 2018]
3^((N-1)/17)-1 is coprime to N!
1880370^524288+1 is prime! (3289511 decimal digits) Time : 123098.306 sec.

The total time is pretty satisfying, but gcd's calculus by giants code took several hours... I have to write a CUDA gcd code!
Regards,
Jean
Jean Penné is online now   Reply With Quote
Old 2018-01-22, 09:51   #354
pepi37
 
pepi37's Avatar
 
Dec 2011
After milion nines:)

128510 Posts
Default

Quote:
Originally Posted by Jean Penné View Post
Hi,

I verified the primality of the last prime announced on the Prime Database Verification Status, using llrCUDA V3.8.1 :

./llrCUDA -d -a7 -oVerbose=1 -oDebug=0 -q"1880370^524288+1"
[Sat Jan 20 18:55:48 2018]
Base factorized as : 2*3^2*5*17*1229
Base prime factor(s) taken : 17, 1229
Starting N-1 prime test of 1880370^524288+1
Using complex rational base DWT and generic reduction, FFT length = 1507328, a = 3
[Mon Jan 22 06:51:05 2018]
3^((N-1)/1229)-1 is coprime to N!
[Mon Jan 22 08:40:50 2018]
3^((N-1)/17)-1 is coprime to N!
1880370^524288+1 is prime! (3289511 decimal digits) Time : 123098.306 sec.

The total time is pretty satisfying, but gcd's calculus by giants code took several hours... I have to write a CUDA gcd code!
Regards,
Jean
What GPU was in this case?
34 hours is OK time :)
pepi37 is offline   Reply With Quote
Old 2018-01-22, 12:45   #355
Jean Penné
 
Jean Penné's Avatar
 
May 2004
FRANCE

557 Posts
Default

Quote:
Originally Posted by pepi37 View Post
What GPU was in this case?
34 hours is OK time :)
As I wrote in a previous post :

The GPU is EVGA GeForce GTX 1080 FTW HYBRID GAMING 8Go
The CUDA version is 8.0.44

The CPU is Intel Core i7-5930K(3.5GHz, 6 cores)

The CPU system is Ubuntu Linux x86_64

Regards,
Jean
Jean Penné is online now   Reply With Quote
Old 2018-09-24, 09:03   #356
Honza
 
Honza's Avatar
 
Feb 2011

22×5 Posts
Default

Anybody got a Windows binary?
Willing to try RTX 2080 and how it is doing...
Honza is offline   Reply With Quote
Old 2018-11-29, 18:19   #357
VBCurtis
 
VBCurtis's Avatar
 
"Curtis"
Feb 2005
Riverside, CA

11×383 Posts
Default

I have llrcuda running on ubuntu 16.04, SB-era Xeon with Quadro 5000 GPU.
When testing with k=443 and exponents around 3.3M, the FFT chosen is about 4 times larger than regular sllr64 (917nnn vs 224k).
I tried another exponent just over 6M, and again FFT is roughly 4x larger (~1.5M vs 400k).
llrcuda uses 100% load on one core, 90+% on GPU, but production is marginally worse than using that one CPU core for sllr64 alone.

If FFT size choice is a bug and can be fixed, it appears my GPU could do work at the rate of 3-4 cores of this system, which would be quite nice!

Previous versions have shown better speed with smaller k; I'll next try k=13 and report any improvement.
VBCurtis is offline   Reply With Quote
Old 2018-11-29, 19:03   #358
R. Gerbicz
 
R. Gerbicz's Avatar
 
"Robert Gerbicz"
Oct 2005
Hungary

29×47 Posts
Default

Quote:
Originally Posted by Jean Penné View Post
Mon Jan 22 06:51:05 2018]
3^((N-1)/1229)-1 is coprime to N!
[Mon Jan 22 08:40:50 2018]
3^((N-1)/17)-1 is coprime to N!
1880370^524288+1 is prime! (3289511 decimal digits) Time : 123098.306 sec.

The total time is pretty satisfying, but gcd's calculus by giants code took several hours... I have to write a CUDA gcd code!
Regards,
Jean
You can get a trivial speedup here:
gcd(c0,n)=1 && gcd(c1,n)=1 ... gcd(c_t,n)=1 is true iff gcd(r,n)=1, where
r=(c0*c1*c2*...*c_t)%n
so in every case where you'd use the Generalized Pocklington theorem it is enough to compute only one gcd.

Last fiddled with by R. Gerbicz on 2018-11-29 at 19:04
R. Gerbicz is offline   Reply With Quote
Old 2018-11-29, 19:57   #359
diep
 
diep's Avatar
 
Sep 2006
The Netherlands

10101000102 Posts
Default

VB Curtis i'm carefully watching your findings.

The Xeon i do not know how many gflops a single core is on paper double precision, yet if it's a Fermi Quadro 500 GPU it's having 359.04 gflops double precision and 120GB/s bandwidth to the GDDR5 with 152 watt TDP - but that last number is just a Coca Cola Toto number.

What Ghz setting does the Xeon have?

For now blindfolded guess the SB is single core factor 20 slower than the Quadro...

Quote:
Originally Posted by VBCurtis View Post
I have llrcuda running on ubuntu 16.04, SB-era Xeon with Quadro 5000 GPU.
When testing with k=443 and exponents around 3.3M, the FFT chosen is about 4 times larger than regular sllr64 (917nnn vs 224k).
I tried another exponent just over 6M, and again FFT is roughly 4x larger (~1.5M vs 400k).
llrcuda uses 100% load on one core, 90+% on GPU, but production is marginally worse than using that one CPU core for sllr64 alone.

If FFT size choice is a bug and can be fixed, it appears my GPU could do work at the rate of 3-4 cores of this system, which would be quite nice!

Previous versions have shown better speed with smaller k; I'll next try k=13 and report any improvement.

Last fiddled with by diep on 2018-11-29 at 19:59
diep is offline   Reply With Quote
Old 2018-11-29, 20:20   #360
VBCurtis
 
VBCurtis's Avatar
 
"Curtis"
Feb 2005
Riverside, CA

11·383 Posts
Default

Quote:
Originally Posted by diep View Post
What Ghz setting does the Xeon have?

For now blindfolded guess the SB is single core factor 20 slower than the Quadro...
Xeon x5650, 6x2.66ghz. Two CPUs in the box, not that it matters for this llrcuda attempt.
VBCurtis is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
LLRcuda shanecruise Riesel Prime Search 8 2014-09-16 02:09
LLRCUDA - getting it to work diep GPU Computing 1 2013-10-02 12:12

All times are UTC. The time now is 06:47.

Fri Jul 10 06:47:42 UTC 2020 up 107 days, 4:20, 0 users, load averages: 1.05, 1.08, 1.33

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2020, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.