![]() |
|
|
#12 |
|
"Lucan"
Dec 2006
England
2·3·13·83 Posts |
Thankyou. That was more or less what I was trying to say.
Does the incorporation of a FPU within the main processor have advantages which militate against the development of a separate dedicated chip? Last fiddled with by davieddy on 2007-10-13 at 08:27 |
|
|
|
|
|
#13 |
|
"Lucan"
Dec 2006
England
2×3×13×83 Posts |
PS I do appreciate that the inclusion of a DP
FPU with the world's computers is the "raison d'etre" for GIMPS. PPS And in 128-bit FP I suspect we would get more than double size in the mantissa compared with 64-bit DP. Last fiddled with by davieddy on 2007-10-13 at 10:54 |
|
|
|
|
|
#14 |
|
Tribal Bullet
Oct 2004
3·1,181 Posts |
The biggest advantage is that if someone else designs an FPU that has 106-bit mantissa precision and gets it to run at 2.X GHz, then you don't need to :) Otherwise, programmable logic is large enough nowadays that if you really wanted that kind of precision in dedicated hardware you could design it yourself. But then you are limited to a few hundred MHz at most, and you have to design a board to hold the logic chip. The chip costs as much as 2-10 complete PCs just by itself, and so to make a better business case than just buying 2-10 PCs you have to get 2-10x the throughput of a single PC. In this you are competing against CPUs that large companies have invested billions of dollars and man-centuries of design work to build. Ernst's customers can get that kind of speedup for their own custom problems, but the LL test maps too well to general-purpose hardware for a custom solution to compete with double precision on an Intel/AMD processor
|
|
|
|
|
|
#15 | |
|
Undefined
"The unspeakable one"
Jun 2006
My evil lair
24·389 Posts |
Quote:
|
|
|
|
|
|
|
#16 | |
|
Tribal Bullet
Oct 2004
3×1,181 Posts |
Quote:
|
|
|
|
|
|
|
#17 |
|
"Lucan"
Dec 2006
England
2×3×13×83 Posts |
Isn't this the real reason that pentiums seem so well suited
to the LLT? Last fiddled with by davieddy on 2007-10-15 at 07:48 |
|
|
|
|
|
#18 |
|
Aug 2002
North San Diego County
5×137 Posts |
That, and Intel's implementation of SIMD (SSE2) is better than AMD's at the moment. I don't know if that will change with AMD's Barcelona core and successors, but it could; AMD chips have been quicker than Intel in the past (circa P2, P3 and P4 prior to SSE2 optimization by George).
|
|
|
|
|
|
#19 | |
|
"Lucan"
Dec 2006
England
2×3×13×83 Posts |
Quote:
For the uninitiated, what (in a nutshell preferably) does SSE2 do? Last fiddled with by davieddy on 2007-10-15 at 09:43 Reason: trying to spell "uninitiated":) |
|
|
|
|
|
|
#20 |
|
(loop (#_fork))
Feb 2006
Cambridge, England
3·2,141 Posts |
SSE2 is the instruction-set extension that lets you request two double-precision FP operations at a time.
On Pentium4 and Opteron, the two DP operations are performed consecutively in the pipeline, so each SSE2 instruction takes up two pipeline slots. On Core2 and Barcelona, they are performed simultaneously. |
|
|
|
|
|
#21 |
|
"Lucan"
Dec 2006
England
2×3×13×83 Posts |
My undesanding of FFT is that it replaces multiplication
operations with addition/subtractions. Specifically "butterfly" operations where from inputs a and b we seek (a+b) and (a-b). Can this not be accomplished in hardware (preferably in fixed-point)? |
|
|
|
|
|
#22 | |
|
Tribal Bullet
Oct 2004
3·1,181 Posts |
Quote:
Technically the FFT replaces a dense matrix multiplication (the discrete fourier transform) with a sparse matrix multiplication that happens to have a nice recursive structure. A good introduction to the FFT is in Numerical Recipes (the whole book is online at www.nr.com) and also the links page at www.fftw.org Number-theoretic FFTs use integers modulo a prime p for all these computations, with all results reduced modulo p. a,b, and w in this case are integers, or possibly (depending on p) complex integers with the real and imaginary parts separately reduced mod p. There are 'Fast Walsh Transforms' where w is always 1, but they cannot be used for arbitrary size convolutions. Last fiddled with by jasonp on 2007-10-16 at 03:38 |
|
|
|
|
![]() |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| does half-precision have any use for GIMPS? | ixfd64 | GPU Computing | 9 | 2017-08-05 22:12 |
| translating double to single precision? | ixfd64 | Hardware | 5 | 2012-09-12 05:10 |
| Dual Core to process single work unit? | JimboPrimer | Homework Help | 18 | 2011-08-28 04:08 |
| exclude single core from quad core cpu for gimps | jippie | Information & Answers | 7 | 2009-12-14 22:04 |
| 4 checkins in a single calendar month from a single computer | Gary Edstrom | Lounge | 7 | 2003-01-13 22:35 |