I think the benchmark should run a bit longer than it does... It is so short right now that any activity on the box will make the result invalid... If it ran each FFT size for maybe one to five minutes it would average out all these irregularities... What I do now is run the benchmark a few times and take the best one...
