mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2011-01-08, 18:02   #1
msft
 
msft's Avatar
 
Jul 2009
Tokyo

2·5·61 Posts
Default llrCUDA

llrpisrc.zip convert to CUDA.
support k*2^n+1 & prime only.
Quote:
5*2^23473+1 is prime! Time : 7.933 sec.
11*2^18759+1 is prime! Time : 6.200 sec.
99*2^83863+1 is prime! Time : 32.564 sec.
21*2^94801+1 is prime! Time : 37.656 sec.
39*2^113549+1 is prime! Time : 62.529 sec.
Attached Files
File Type: gz llrCUDA.0.07.tar.gz (93.0 KB, 295 views)
msft is offline   Reply With Quote
Old 2011-01-08, 18:22   #2
Mini-Geek
Account Deleted
 
Mini-Geek's Avatar
 
"Tim Sorbera"
Aug 2006
San Antonio, TX USA

17·251 Posts
Default

Quote:
Originally Posted by msft View Post
llrpisrc.zip convert to CUDA.
support k*2^n+1 & prime only.
Looks like it's significantly slower than a CPU right now:
Code:
99*2^83863+1 is prime!  Time : 7.407 sec.
If its speed can be improved and at least the prime-only limitation removed, this could be a huge thing for projects like NPLB!
Mini-Geek is offline   Reply With Quote
Old 2011-01-08, 18:26   #3
em99010pepe
 
em99010pepe's Avatar
 
Sep 2004

2·5·283 Posts
Default

Corei5 750@3.6 GHz with llr3.8.4 version.

Code:
5*2^23473+1 is prime!  Time : 332.759 ms.
11*2^18759+1 is prime!  Time : 167.324 ms.
99*2^83863+1 is prime!  Time : 4.739 sec.
21*2^94801+1 is prime!  Time : 5.356 sec.
39*2^113549+1 is prime!  Time : 7.782 sec.

Last fiddled with by em99010pepe on 2011-01-08 at 18:27
em99010pepe is offline   Reply With Quote
Old 2011-01-08, 19:11   #4
mdettweiler
A Sunny Moo
 
mdettweiler's Avatar
 
Aug 2007
USA (GMT-5)

3×2,083 Posts
Default



Awesome!

Gary and I are still working to get his GPU functioning again...we've been running into some strange issues with driver config on Ubuntu 10.04, but will hopefully be able to get it working soon. As soon as we do, I'm open to help with any testing that's needed.
mdettweiler is offline   Reply With Quote
Old 2011-01-08, 19:32   #5
em99010pepe
 
em99010pepe's Avatar
 
Sep 2004

283010 Posts
Default

msft,

Please make a test with bigger numbers:

5*2^1282755+1
5*2^1320487+1
em99010pepe is offline   Reply With Quote
Old 2011-01-08, 22:23   #6
Ralf Recker
 
Ralf Recker's Avatar
 
Oct 2010

191 Posts
Default

Quote:
Originally Posted by em99010pepe View Post
msft,

Please make a test with bigger numbers:

5*2^1282755+1
5*2^1320487+1
Just started the first one on my lowly GTX 460 (MSI factory overclocked @725 MHz, 64 Bit Linux, compiled with the CUDA 3.1 toolkit, Driver version 256.53) and on a Core of a Q9550 @ 3.6 GHz... results follow...

First impression:

ralf@quadriga ~/llrcuda.0.07 $ time ./llrCUDA -q"5*2^1282755+1" -d
Starting Proth prime test of 5*2^1282755+1, FFTLEN = 131072 ; a = 3
5*2^1282755+1, bit: 20000 / 1282757 [1.55%]. Time per bit: 2.113 ms.

Quick comparison: Time per bit on the CPU: ~0.812 ms.

CPU Result (LLR 3.8.4):

ralf@quadriga ~ $ time sllr -q"5*2^1282755+1" -d
Resuming Proth prime test of 5*2^1282755+1 at bit 20876 [1.62%]


5*2^1282755+1 is prime! Time : 1041.208 sec.

real 17m4.170s
user 17m2.276s
sys 0m1.640s

I've accidently interrupted the CPU run. The first 1.62% took:

real 0m17.266s
user 0m17.113s
sys 0m0.028s

so you need to add ca. 17 seconds to the 1041 seconds above...

Last fiddled with by Ralf Recker on 2011-01-08 at 22:52
Ralf Recker is offline   Reply With Quote
Old 2011-01-08, 23:06   #7
Ralf Recker
 
Ralf Recker's Avatar
 
Oct 2010

19110 Posts
Default

OK. Here is the result from the GPU (details above):

ralf@quadriga ~/llrcuda.0.07 $ time ./llrCUDA -q"5*2^1282755+1" -d
Starting Proth prime test of 5*2^1282755+1, FFTLEN = 131072 ; a = 3
5*2^1282755+1 is prime! Time : 2708.763 sec.

real 45m8.793s
user 45m2.749s
sys 0m5.644s

Edit: A version compiled with --arch=sm_21 is slower (2.155 ms per bit), a version compiled with --arch=sm_20 is a tiny bit faster (2.085 ms per bit).

Last fiddled with by Ralf Recker on 2011-01-08 at 23:32
Ralf Recker is offline   Reply With Quote
Old 2011-01-08, 23:40   #8
em99010pepe
 
em99010pepe's Avatar
 
Sep 2004

54168 Posts
Default

Ralf Recker,

First of all thank you. Second, can you post the specs of your machine (memory, harddrives, DVD-R, etc)? I want to make some calculations about energy efficiency so I need to know how many and type of components you have on it to make an energy consumption estimate.
em99010pepe is offline   Reply With Quote
Old 2011-01-09, 00:34   #9
msft
 
msft's Avatar
 
Jul 2009
Tokyo

2×5×61 Posts
Default

GTX460:
5*2^1282755+1 is prime! Time : 4491.564 sec.
5*2^1320487+1 is prime! Time : 4447.951 sec.
msft is offline   Reply With Quote
Old 2011-01-09, 02:17   #10
msft
 
msft's Avatar
 
Jul 2009
Tokyo

2×5×61 Posts
Default

Fix abort with non prime.
Quote:
5*2^23471+1 is not prime. Proth RES64: FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFE Time : 8.686 sec.
5*2^23473+1 is prime! Time : 8.206 sec.
RE64 value is llrpisrc.zip original bug(with 64linux).somebody can fix ?
Attached Files
File Type: gz llrCUDA.0.08.tar.gz (92.8 KB, 142 views)
msft is offline   Reply With Quote
Old 2011-01-09, 09:49   #11
Jean Penné
 
Jean Penné's Avatar
 
May 2004
FRANCE

23·3·23 Posts
Default Very interesting work!

Hi,

First, Best wishes to you for an happy new year, and many congrats for
this work! Indeed, I am very interested in your attempts, although I have
presently neither hardware nor software to develop my code with CUDA...

However, I am now working on a new version of llrpi, which is no more
limited to IBDWT and small k's : it works with zero-padded FFT for k's
from 22 to 45 bits large, and generic modular reduction for larger k's.

Moreover, the portable "gwpnum" code is written as a library, like the
George Woltman's "gwnum" one.

It seems to work fine for k*2^n+1 and k*2^-1 numbers (and using generic
reduction for more general ones), so, I shall release the new source shortly.

Best Regards,
Jean
Jean Penné is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
LLRcuda shanecruise Riesel Prime Search 8 2014-09-16 02:09
LLRCUDA - getting it to work diep GPU Computing 1 2013-10-02 12:12

All times are UTC. The time now is 09:20.

Fri May 29 09:20:14 UTC 2020 up 65 days, 6:53, 0 users, load averages: 1.85, 1.42, 1.32

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2020, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.