Can you test 4 numbers in parallel on your Q8400 and the exact 4 in series on your GTX260?
Hi, Uncwilly
http://www.nvidia.com/object/io_1258360868914.html >Editors' note: As previously announced, the first Fermi­based consumer (GeForce) products are expected to be available first quarter 2010.
New GTX260 result. M22728263 

Three more double checks have finished. These are just under the 2048K/4096K boundary. Only one of three matched the previous result. Time will tell if the other two are correct. I also have a fourth running, 36500117, but after about a million iterations, the roundoff error grew above the limit and it switched over to a 4096K FFT. Therefore, it only about half way through the doublecheck.
36500089 36500111 36500119 
On the Tesla C1060, the TESRA version C is much slower than version y, but the nonTESRA version is the fastest yet with the 4096K FFT timing at 0.025 sec/iteration. If the improvements cannot also be used to optimize the TESRA version, then the TESRA version can now be dropped.

Secondly, I did have problems compiling this version so wondered if you could refresh me on the steps I need to take.... Cheers 

Hi, TheJudger
I wish your programing success. Fascinating,test is very important, overclocker's contaminating result? Quote:
Thank you, 

Hi msft/frmky,
just some ideas out of my mind:  perhaps you should choose exponents which have allready verified? In this case you can be sure if your results are OK or not immediatly.  choose some exponents which are not so close to the fft limit. I didn't dive into the CUFFTW docs, perhaps the rounding/rounding errors are not so accurate as the CPU versions of MaclucasFFTW and you need to lower the FFT boundaries? TheJudger P.S. 22 million checks per second for TF :) 
