![]() |
|
|
#1 |
|
(loop (#_fork))
Feb 2006
Cambridge, England
641910 Posts |
A bargainhardware.com dual E5-2650v1 machine costs £300, uses 220 watts, and does 7.2ms per iteration on four 44M exponents in parallel
A GTX1080 costs £665, uses about 150 watts, and does 2.4ms per iteration on a single 45M exponent. So the machine designed five years ago for no-holds-barred double-precision wins quite handily even without the benefits of AVX2, and I should probably stop running cudalucas. Running equivalent comparison for ECM now Last fiddled with by fivemack on 2017-01-07 at 12:06 |
|
|
|
|
|
#2 | |
|
Nov 2008
7658 Posts |
Quote:
|
|
|
|
|
|
|
#3 |
|
(loop (#_fork))
Feb 2006
Cambridge, England
72·131 Posts |
I get the impression there is more than enough GPU TF effort already in place. Mostly I got the GPU for factorisation, running ECM and polynomial selection, which it does pretty well.
|
|
|
|
|
|
#4 |
|
Einyen
Dec 2003
Denmark
35×13 Posts |
GTX 1080 is one of those newer card with DP = 1/32th of SP performance, 257 GFLOPS vs 8228 GFLOPS, so it is not best for LL tests. There is really a need for a new consumer card with better DP performance.
|
|
|
|
|
|
#5 |
|
"/X\(‘-‘)/X\"
Jan 2013
22×733 Posts |
Apparently this year's Vega from AMD will have 1/16th DP.
That dual processor machine looks like a good deal for LL. |
|
|
|
|
|
#6 |
|
"Serge"
Mar 2008
Phi(4,2^7658614+1)/2
36×13 Posts |
Amen to that!
In contrast to cudalucas (which makes calls to some vanilla FFT and hits all the artificially NVIDIA-imposed bottlenecks), one should really want to run some DWT or NTT algorithm (like geneFer or Cyclo) to make GPUs really shine. |
|
|
|
|
|
#7 | |
|
P90 years forever!
Aug 2002
Yeehaw, FL
1D6616 Posts |
Quote:
I briefly explored an all-integer solution (not NTT though). My conclusion was I was unlikely to significantly beat the current CUDALucas timings. IIRC, it would be roughly +/- 20%. I think this was on a 6xx GPU. |
|
|
|
|
|
|
#8 | |
|
Feb 2016
UK
6608 Posts |
Quote:
It would also be more fair to compare against previous generation used. Taking a 980Ti for example, it is approx 2/3 the rated boost SP FLOPS with a target cost under half a 1080, although TDP is higher at 250W. If you really want DP, what about the R9 280X? It was possibly the last fast consumer card before they started to cripple DP. A quick look on ebay shows them under £150, and that gets you ball park of 1 DP TFLOP. Still 250W TDP though. If anyone can give me idiot proof instructions on how to bench it, I can do it on mine. I've bios mod lowered voltage so in practice it only takes around 200W now. Unfortunately probably not going to happen, unless you can find a compelling consumer DP requirement. If anything, the trend seems to be going the other way, with ever more FLOPS at lower precision. |
|
|
|
|
|
|
#9 |
|
"Serge"
Mar 2008
Phi(4,2^7658614+1)/2
36·13 Posts |
Right! (I must have been thinking about llrCUDA -- which simply calls FFTw)
NTT gave GeNeFer a new life (b ranges were extended, and now that it is implemented in OCL it is free of NVIDIA shackles). |
|
|
|
|
|
#10 |
|
"/X\(‘-‘)/X\"
Jan 2013
22×733 Posts |
|
|
|
|
|
|
#11 |
|
Dec 2011
After milion nines:)
5·172 Posts |
Can llrCUDA can be rewritten as llrocl?
|
|
|
|
![]() |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| GPU brand | kracker | GPU Computing | 12 | 2014-04-20 15:56 |
| Brand New to Prime95 | bmorgan | PrimeNet | 5 | 2013-02-20 22:06 |
| NFS sqrt by hand | henryzz | Factoring | 18 | 2010-09-26 00:55 |
| Zeta function by hand | Damian | Math | 0 | 2006-07-27 14:43 |
| What CPU brand do you prefer? | eepiccolo | Hardware | 29 | 2003-05-11 05:57 |