![]() |
|
|
#34 |
|
Sep 2002
2·331 Posts |
Other processing possibilities.
AGEIA Technologies Inc PhysX chip, dedicated Physics Processing Unit (PPU) Expect to see PPU enabled systems and boards in time for the 2005 Christmas buying season. Native hardware support of NovedeX Physics engine. 2 Terabits/second of bandwidth is presently contemplated for internal memories facilitating data movement to/from the FPE. The internal memory structure has no "set associativity" limitations. PPU provides a library of common linear algebra and physics related algorithms implemented using the DME and FPE. However, application specific or custom algorithms may also be defined within PPU for execution by the DME and FPE. Xbox 2/Xbox Next/Xenon/Xbox 360 (? which name ?) "Xenon's CPU has three 3.0 GHz PowerPC cores. Each core is capable of two instructions per cycle and has an L1 cache with 32 KB for data and 32 KB for instructions. The three cores share 1 MB of L2 cache." ? Ship before end of 2005, two versions one with hard drive |
|
|
|
|
|
#35 |
|
May 2005
23·7·29 Posts |
Would it be possible / feasible to run LL tests on graphics hardware? The latest graphics chips from nV and ATi have a big processing power, the question is: would it be possible to utilize it in such a way?
Any thoughts on that
|
|
|
|
|
|
#36 | |
|
Sep 2003
Borg HQ, Delta Quadrant
2×33×13 Posts |
Quote:
|
|
|
|
|
|
|
#39 | ||
|
"Richard B. Woods"
Aug 2002
Wisconsin USA
22·3·641 Posts |
Quote:
Quote:
|
||
|
|
|
|
|
#40 | ||
|
Apr 2003
Berlin, Germany
192 Posts |
Quote:
Quote:
The double precision capabilities of Cell (more the SPE ones than the PPE's) have already been discussed in this thread. Just read above. |
||
|
|
|
|
|
#41 |
|
Apr 2003
Berlin, Germany
192 Posts |
While reading through an article in a Linux magazine, where they show the
possibilities of letting Linux run on Cell, I found an interesting document mentioned: Unleashing the power: A programming example of large FFTs on Cell The original source is here: http://www.power.org/news/events/barcelona/ And the document is here: http://www.power.org/news/events/barcelona/11_chow.pdf It speaks about single precision FFTs, but that doesn't matter, since it covers nearly all important factors, which might be interesting for implementing a LL test on Cell. They say, their FFT implementation would be already close to being computationally bound, so this would even be more the case when using double precision. |
|
|
|
|
|
#42 |
|
May 2005
Naperville, IL, USA
110001002 Posts |
Will the SSE3 instructions bail out Intel or does the AMD implementation of SSE3 keep AMD in the lead for serious number crunching?
|
|
|
|
|
|
#43 | |
|
Apr 2003
Berlin, Germany
192 Posts |
Quote:
However, here is the data I collected from the appropriate optimization manuals for the register-to-register variants of these instructions. Intel didn't give the numbers for the case, when memory operands are involved and delivered from shortest latency cache (as it is often the case in Prime95). For the K8 these instructions have a 1 (HADDPx/HSUBPx) or 2 cycles (ADDSUBPx, MOVxDUP) longer latency. Code:
Prescott/Nocona:
Instruction(s) Latency/ involved Units
Throughput
ADDSUBPD/ADDSUBPS 5 / 2 FP_ADD
HADDPD/HADDPS 13 / 4 FP_ADD,FP_MISC
HSUBPD/HSUBPS 13 / 4 FP_ADD,FP_MISC
MOVDDUP xmm1, xmm2 4 / 2 FP_MOVE
MOVSHDUP xmm1, xmm2 6 / 2 FP_MOVE
MOVSLDUP xmm1, xmm2 6 / 2 FP_MOVE
K8 Stepping E:
Instruction(s) Latency/ involved Units
Throughput
ADDSUBPD/ADDSUBPS 5 / 2 FADD
HADDPD/HADDPS 5 / 2 FADD
HSUBPD/HSUBPS 5 / 2 FMUL (maybe for parallel execution)
MOVDDUP xmm1, xmm2 2 / 2 FMUL
MOVSHDUP xmm1, xmm2 3 / 2 FMUL
MOVSLDUP xmm1, xmm2 3 / 2 FADD
As you can see, SSE3 wouldn't change the situation. |
|
|
|
|
![]() |
| Thread Tools | |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Some transition probabilities | fivemack | Aliquot Sequences | 9 | 2012-03-16 08:49 |
| Caught in transition? | cheesehead | Forum Feedback | 1 | 2011-12-11 05:14 |
| Major overhaul of the DB | 10metreh | Aliquot Sequences | 5 | 2010-08-29 01:10 |
| How will the transition to v5 work? | ixfd64 | PrimeNet | 3 | 2008-10-01 01:42 |
| server transition news | ltd | Prime Sierpinski Project | 4 | 2006-04-19 20:25 |