![]() |
![]() |
#1 |
Mar 2012
Hyderabad, India
3×17 Posts |
![]()
Is anyone working with llrcuda? or any idea about gpu enabled prime search.
I need help (OpEd: Moved from the "Post lots of primes" thread) Last fiddled with by Kosmaj on 2012-08-15 at 06:22 |
![]() |
![]() |
![]() |
#2 | |
"Carlos Pinho"
Oct 2011
Milton Keynes, UK
23×641 Posts |
![]() Quote:
Pages that might me of interest: http://www.bc-team.org/downloads.php?cat=7 http://primegrid.pytalhost.net/Mirror.htm Edit: Sorry for the offtopic. Last fiddled with by pinhodecarlos on 2012-08-10 at 06:12 |
|
![]() |
![]() |
![]() |
#3 | |
"Curtis"
Feb 2005
Riverside, CA
5,623 Posts |
![]() Quote:
Indeed, I am. See the k=5 thread for my results with LLRcuda. Create a thread, or PM me, if you have any questions. Threads are wise for things other might want answers to! In short, it's great for specific n-ranges below FFT jumps; but there are few FFT sizes, so wide ranges of n are inefficient. Also, the bigger the n the more efficient CUDA is compared to CPU testing. -Curtis |
|
![]() |
![]() |
![]() |
#4 | |
Basketry That Evening!
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88
3×29×83 Posts |
![]() Quote:
I can't help with actually doing it though, you'd have to ask msft for more details. |
|
![]() |
![]() |
![]() |
#5 |
"Carlos Pinho"
Oct 2011
Milton Keynes, UK
23·641 Posts |
![]()
Is there any OpenCL LLR version?
|
![]() |
![]() |
![]() |
#6 |
"Mr. Meeseeks"
Jan 2012
California, USA
87F16 Posts |
![]() |
![]() |
![]() |
![]() |
#7 |
Romulan Interpreter
"name field"
Jun 2011
Thailand
10,273 Posts |
![]()
Let's say not that it is inefficient, but it only plays well with FFT's which are powers of two. So, if accidentally you have to do a LL for an exponent in (say) 37M, where P95 and cudaLucas would also use a 2^x FFT, then clLucas is same efficient as the other two. And so on for other ranges where the 2^x is theoretically optimal. But as long as you are LL-ing exponents for which P95 or cudaLucas would select a better FFT, non-power of two, then clLucas would either be very slow with than non-power of two FFT, or you can force it to use the next higher 2^x FFT, for which the speed can be a little faster than with the smaller non-2^x, but still far away from what P95 or cudaLucas could do, due to the fact that now the FFT is higher.
Last fiddled with by LaurV on 2014-09-15 at 03:28 |
![]() |
![]() |
![]() |
#8 |
"Carlos Pinho"
Oct 2011
Milton Keynes, UK
23·641 Posts |
![]()
I am not worried about the inefficient because I will use the company electricity. Can you guys point me to the link of the client? I pretend to PRP riesel base 2 for k=5 at n>5M.
|
![]() |
![]() |
![]() |
#9 | |
"Mr. Meeseeks"
Jan 2012
California, USA
3×52×29 Posts |
![]() Quote:
![]() The main reason why: Not worth it... Besides, CPU's usually beat or are very close to GPU's with less power nowadays with FMA3, and that gap is especially very noticeable in lower FFT's where there is no memory bottleneck(CPU)... |
|
![]() |
![]() |
![]() |
Thread Tools | |
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
llrCUDA | msft | GPU Computing | 381 | 2022-05-25 13:31 |
LLRCUDA - getting it to work | diep | GPU Computing | 1 | 2013-10-02 12:12 |