mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Software

Reply
 
Thread Tools
Old 2003-10-26, 00:26   #1
carltao
 
Oct 2003

10102 Posts
Default Why Athlon XP benchmarks much more slowly?

I don't think Athlon Xp 2.2G is slower than P4 1.8G, why the benchmark results show that?
Is it possible that the program was not well optimized for AMD CPU's?
carltao is offline   Reply With Quote
Old 2003-10-26, 01:18   #2
GP2
 
GP2's Avatar
 
Sep 2003

5×11×47 Posts
Default

P4 CPUs have SSE2 instructions, which are much more efficient for this program. Athlon 64s also have these, but not older Athlons.
GP2 is offline   Reply With Quote
Old 2003-10-26, 03:53   #3
carltao
 
Oct 2003

128 Posts
Default


iSSE2 so magic? I have heard that P4 without optimization is far slower than XP, and comparable with XP with optimization. I still think there should be more done on XP optimization for the program. Thanks.
This benchmark makes me feel reluctant to run prime95 anymore on my XP machine, because its low efficiency.
carltao is offline   Reply With Quote
Old 2003-10-26, 04:05   #4
carltao
 
Oct 2003

2×5 Posts
Default

Athlon XP supports MMX(+), iSSE and 3DNow(+), is it possible for prime95 to use these instruction sets?
carltao is offline   Reply With Quote
Old 2003-10-26, 06:49   #5
QuintLeo
 
QuintLeo's Avatar
 
Oct 2002
Lost in the hills of Iowa

1C016 Posts
Default

Prime is one of the very few cases where the SSE2 optimizations make a CPU a *lot* more efficient, similar to the case with Altivec and RC5 (G4 and later AltiVec CPUs blow EVERYTHING else away on RC5 per Mhz, and in the upper speed ranges in absolute keyrate terms as well).

It's also VERY memory-intensive, thus also favoring the P4 and it's very high front-side bus speed.


Some preliminary observations have shown that the Opteron gains quite a bit from it's SSE2 support, but still lags the P4 somewhat due to much lower clock speed than current P4s. The Athlon64 should also close the gap on the P4 quite a bit, for the same reason.


Distributed computing projects tend to use a very small set of the instructions in a CPU, so they are VERY sensisive to how efficient those particular instructions are. They are NOT a good indication of the "general purpose" power or capabilities of any CPU.

Again, as an example, the Athlon and K5 and Pentium-III benefit in RC5 work because they implimented a "barrel shifter" in hardware - RC5 relies a LOT on one particular instruction that uses that barrel shifter. The P4 suffers a LOT from having it's barrel shifter implimented in microcode, which makes that particular instruction a LOT slower.

The Altivec CPUs benefit from being able to work on something like 16 different RC5 keys AT THE SAME TIME, so even though each key is processed by the AltiVec unit a lot slower, the overall average is that it gets through a lot MORE keys in a given period of time.
QuintLeo is offline   Reply With Quote
Old 2003-10-26, 06:54   #6
carltao
 
Oct 2003

128 Posts
Default

thanks for the detailed explanation,
But I am still confused at whether Prime95 takes full power of Athlon XP's...

Last fiddled with by carltao on 2003-10-26 at 06:56
carltao is offline   Reply With Quote
Old 2003-10-26, 07:05   #7
QuintLeo
 
QuintLeo's Avatar
 
Oct 2002
Lost in the hills of Iowa

26·7 Posts
Default

BTW - MMX is an integer-only instructions set. This does NOT help on Prime LL work, which is floating-point based. It DID help the Pentium-MMX somewhat on RC5 work, for similar reasons to why AltiVec CPUs are efficient in that project - but doesn't handle nearly as MANY keys, so was only a 50% or so speedup as opposed to a 300%+ speedup for AltiVec (and the K5 was STILL faster at 117.5 Mhz than a Pentium-MMX was at 166Mhz ).

SSE and 3DNow are single-precision floating point instruction sets (IIRC) - Prime LL work needs a lot higher precision than these sets are capable of.
QuintLeo is offline   Reply With Quote
Old 2003-10-26, 07:09   #8
QuintLeo
 
QuintLeo's Avatar
 
Oct 2002
Lost in the hills of Iowa

7008 Posts
Default

I run Prime on all of my Athlons here - they may not be as fast as the 'leading edge" of P4s, but they CAN argue with any P4 in their price range (they lose, but it's fairly close)....

P4's vs. Athlons are a long-running debate for Prime cost-effectiveness - if you stay away from the "bleeding edge", like my trio of Thuroughbred XP1800s (all running 1800Mhz or faster) - the closest cost-effective P4 is the Northwood 1.7Ghz or 1.8Ghz, 'cause those can be overclocked about as much. Depends mostly on how expen$ive your electric is - mine here is cheap, so it's a tossup over a 3 year period, other places where electric is a dime per KWH or more the P4s win easily.
QuintLeo is offline   Reply With Quote
Old 2003-10-26, 11:24   #9
carltao
 
Oct 2003

2·5 Posts
Default

My XP 1800+ can run @2.3G with AVC112C86 stably on nForce2. Athlon XPs are really great general-purpose CPUs.
carltao is offline   Reply With Quote
Old 2003-10-26, 12:42   #10
dsouza123
 
dsouza123's Avatar
 
Sep 2002

12268 Posts
Default

An Athlon is most efficient for one part of Prime95.

For Trial Factoring to a bit depth of 2^64 Athlons are the most efficient (x86 architecture). At an equal clock speed Athlons will beat the other CPUs ( will calculate it faster ).
Though given a P4 with a sufficiently higher clock speed, the P4 can finish quicker than an Athlon.

At 2^65 and above P4s are top. The SSE2 instructions are more efficient with 64 bit doubles. Combined with the clock speed advantage make it unbeatable.

I am not sure what speed a P4 would need to beat an Athlon @2.3 Ghz up to 2^64 bits, it may require one faster than the highest unclocked available ( 3.2 at this point ).
dsouza123 is offline   Reply With Quote
Old 2003-10-28, 08:11   #11
carltao
 
Oct 2003

10102 Posts
Default

Thanks for giving me confidence to run prime95 on my machine.
carltao is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
Athlon II 620 x4 henryzz Hardware 5 2009-10-28 03:25
Prime-search apps run slowly on Ubuntu 8.04 mdettweiler Linux 13 2009-03-21 08:04
amd athlon 64 bug? Unregistered Hardware 17 2004-06-01 22:57
Athlon 64 ThomRuley Hardware 7 2003-09-28 13:31
Athlon vs. P4 ThomRuley Lone Mersenne Hunters 1 2003-07-11 23:15

All times are UTC. The time now is 08:09.


Sat Jul 17 08:09:09 UTC 2021 up 50 days, 5:56, 1 user, load averages: 1.25, 1.41, 1.36

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.