View Single Post
Old 2006-11-16, 06:51   #4
S485122
 
S485122's Avatar
 
Sep 2006
Brussels, Belgium

2·5·167 Posts
Default

I did a lot of benchmarks in safe mode. No software running execpt Prime95. There is no significant difference between running one or two instances of Prime95. But the differences between "best times" from one run to another can go up to 2% on LL tests and up to 6% on factorisation tests.

This is why I think that an average of times would be more significant if enough iterations are used for each test.

It is possible that the policy of "best times" has to do with the development of the software see ftp://mersenne.org/gimps/p4notes.doc specially the part "Trace cache and branch prediction" (there are some parts of the algorithm that use a variable number of cycles...)

A possible compromise solution could be that the benchmarks outputs:
- best time.
- worst time.
- average time.
-standard deviation.

But this would give a lot of data to process.
S485122 is offline   Reply With Quote