Quote:
Originally Posted by Prime95
Just got numbers for the 2K FFT:
SSE2: 100 iters in 1.110 ms
AVX: 100 iters in 0.682 ms
I'm not sure if these numbers will extrapolate to large FFTs. These small FFTs operate completely in the L1 cache.
|
Nice. I use the smaller FFTs most of the time in LLR anyway so I am not too worried.