![]() |
|
|
#1 |
|
Oct 2005
Italy
33910 Posts |
Will be possible , in future, that a single exponent is checked by different machines by software parallelization ?
|
|
|
|
|
|
#2 |
|
Aug 2002
Termonfeckin, IE
22·691 Posts |
Never say never but it is highly unlikely as the Lucas-Lehmer test is sequential where each step depends on the result of the previous one.
Last fiddled with by garo on 2005-10-27 at 16:25 |
|
|
|
|
|
#3 | |
|
Bamboozled!
"𒉺𒌌𒇷𒆷ð’€"
May 2003
Down not across
250428 Posts |
Quote:
Paul |
|
|
|
|
|
|
#4 | |
|
∂2ω=0
Sep 2002
República de California
19·613 Posts |
Quote:
|
|
|
|
|
|
|
#5 |
|
Feb 2004
France
16248 Posts |
GLucas does LLT by parallelizing the FFT work with several threads.
With specific very fast connectivity (Quadrics, as an example) available between several big machines with many CPUs, one may imagine replacing the threads by MPI, once it has been made clear which parts of the FFT memory is used by each thread on each machine. I guess the bigger the exponent to be checked is, the more efficient such an architecture is. But moving mprime or GLucas or MLucas to MPI is very complicated and very few computing centers having clusters connected by MPI/Quadrics would want to search a new Mersenne prime. So, such an idea does not seem useful. BUT, it may be interesting to some team to try discovering if F33 (the 33th Fermat number) is prime or not. With only one very fast CPU (3.6 GHz), it should take about 4000 years of computation ... But, if one could use 16 clusters of 128 CPUs, it should take only 1 to 2 years .... Since new big machines (IBM/PPC, Sun/SPARC, Bull/Itanium2, ...) are now delivering machines with 32 or 64 very powerful processors with HT/SMT, and since there is now a race to have several cores (2 to 8, like in Cell) in one chip, it should be possible to check the primality of F33 in less than 1 year of computation, on 4 machines with 64x8 cores ; in 5 years from now, I guess. Since LLT can be used to prove a Fermat number is prime or composite, mprime could be used. But, is it worth to parallelize mprime and to use 4 very big machines in cluster during 1 year for only 1 bit result (F33 prime or not) ? Fame ? Tony Last fiddled with by T.Rex on 2005-10-27 at 19:31 |
|
|
|
|
|
#6 | |
|
Bamboozled!
"𒉺𒌌𒇷𒆷ð’€"
May 2003
Down not across
2A2216 Posts |
Quote:
However, I was discussing possibility rather than cost-effectiveness. It's possible to parallelize within an exponentiation step --- everyone who's thought about the situation agrees on that. Most people who have thought about the situation have failed to come up with a scheme to parallelize the exponentiation itself. Paul |
|
|
|
|
|
|
#7 | |
|
∂2ω=0
Sep 2002
República de California
19×613 Posts |
Quote:
|
|
|
|
|
|
|
#8 | |
|
Tribal Bullet
Oct 2004
DD716 Posts |
Quote:
You'd need hundreds of ordinary computers to manage that, and this isn't going to scale nearly as well as linpack. You can technically use a few dozen of Clearspeed's chips, but I doubt you'd be able to manage the off-chip memory bandwidth needed to keep them all busy. jasonp |
|
|
|
|
|
|
#9 | |
|
"Nancy"
Aug 2002
Alexandria
46438 Posts |
Quote:
Alex |
|
|
|
|
|
|
#10 | |
|
May 2003
111001112 Posts |
Quote:
The double-word limbs (complex floating point transform, not NTT) that I'm using heavily in PIES/LG presently unfortunately don't appear to help at all, as they're alas fractionally 'noisier' than a double-length transform would be. Phil |
|
|
|
|