![]() |
|
|
#12 |
|
1976 Toyota Corona years forever!
"Wayne"
Nov 2006
Saskatchewan, Canada
22·7·167 Posts |
On the less effective end:
Code:
Q9550 20,000,001 (Time... 25 iterations) Cores Per Iteration %Improvement 1 14.51 2 9.46 +53% 3 8.32 +21% 4 6.96 +34% Over all effectiveness 4 cores just slightly more than twice as fast as 1. Last fiddled with by petrw1 on 2012-04-11 at 18:56 |
|
|
|
|
|
#13 |
|
1976 Toyota Corona years forever!
"Wayne"
Nov 2006
Saskatchewan, Canada
22·7·167 Posts |
On the more effective end
Code:
i5-750 OC to 3.2 20,000,001 (Time... 25 iterations) Cores Per Iteration %Improvement 1 13.05 2 6.99 87% 3 4.83 84% 4 4.04 53% Last fiddled with by petrw1 on 2012-04-11 at 18:57 |
|
|
|
|
|
#14 |
|
Feb 2012
the Netherlands
2·29 Posts |
The Q9550 is not really a quad-core CPU, just like every other S775 4core CPU.
Actually, 2 dual-cores on one die, so I suppose this architecture has something to do with the test results. Last fiddled with by Stef42 on 2012-04-12 at 14:36 |
|
|
|
|
|
#15 | |
|
Oct 2011
7·97 Posts |
Quote:
Code:
Single thread total throughput on 27.5M exps: Workers %/day 1 11.636 2 (C1&C2) 17.95 2 (C1&C3) 21.30 3 22.89 4 24.17 3+mfak 5kSP - 22.1 3+mfak 200Ksp - 20.3 Double thread total throughput on 27.5M exps: Workers %/day 1 18.48 2 25.13 2 instances of P95, 1 single threads, 1 double threads, total throughput on 27.5M exps: Workers %/day 1 & 1 26.18 2 & 1 25.11 1 & 1 + mfak 5k - 23.80 1 & 1 + mfak 200K - 21.12 2 & 1 + mfak 5k - 24.01 |
|
|
|
|
|
|
#16 |
|
Apr 2012
2×5 Posts |
So a multi-computer test isn't ruled out as potentially faster than a single processor. But a number of known things limit it, as well as probably loads of unknowns which may rule it out completely. For a start the ram of the controller computer has to be fast enough to feed all processors (as I think the fft and inverse can't be split, and it's the multiplication stage which allows multi-core. So the start and end of an iteration would have to be on a single core?). If ram is already the limiting factor then that's that.
Thank you for the replies. |
|
|
|
|
|
#17 | |
|
Nov 2003
22·5·373 Posts |
Quote:
tiring. And the answer remains the same. The "goal" you suggest is distinctly sub-optimal. It is much more efficient, given the availability of multiple cores, to simply run multiple (separate) LL tests on different exponents. Throughput would be substantially higher. |
|
|
|
|
|
|
#18 | |
|
Oct 2011
7×97 Posts |
Quote:
The Lucas-Lehmer primality test is remarkably simple. It states that for P > 2, 2P-1 is prime if and only if Sp-2 is zero in this sequence: S0 = 4, SN = (SN-12 - 2) mod (2P-1). For example, to prove 27 - 1 is prime: S0 = 4 S1 = (4 * 4 - 2) mod 127 = 14 S2 = (14 * 14 - 2) mod 127 = 67 S3 = (67 * 67 - 2) mod 127 = 42 S4 = (42 * 42 - 2) mod 127 = 111 S5 = (111 * 111 - 2) mod 127 = 0 S2 cannot be calculated before S1 is known, and likely the time to send the multiplication to multiple computers to calculate and receive it back would probably take longer than for a single computer to complete it. This is true, to an extent. The properties of certain processors make them more efficient under certain circumstances. From the testing I was doing, it shows a Core 2 Quad with 1 double thread and 1 single thread exponent running can outperform 4 single thread exponents. With the slowdown George has come across with the optimization of AVX with 4 cores running, the above example may also prove more efficient. Last fiddled with by bcp19 on 2012-04-12 at 18:33 |
|
|
|
|
|
|
#19 |
|
Basketry That Evening!
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88
1C3516 Posts |
Well, okay, in the "typical" use case, RDS is right, where typical excludes such things as a duo-dual-core and AVX. Even with AVX, it's still more efficient to run one per core.
|
|
|
|
![]() |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| does half-precision have any use for GIMPS? | ixfd64 | GPU Computing | 9 | 2017-08-05 22:12 |
| Single vs Dual channel memory | TObject | Hardware | 5 | 2014-12-24 05:58 |
| How to have all 4 cores working on a single number? | tech96 | Information & Answers | 5 | 2014-07-04 09:53 |
| Why factoring is single-core designed? | otutusaus | Software | 33 | 2010-11-20 21:05 |
| 4 checkins in a single calendar month from a single computer | Gary Edstrom | Lounge | 7 | 2003-01-13 22:35 |