mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware

Reply
 
Thread Tools
Old 2004-09-06, 08:35   #12
Dresdenboy
 
Dresdenboy's Avatar
 
Apr 2003
Berlin, Germany

192 Posts
Default

Quote:
Originally Posted by Dresdenboy
The long pipeline causes really high latencies for SSE2 instructions (compared to Northwood).
Correction: I forgot, that the new version of this measuring tool adds a lot of latency/throughput results for combined instructions, so the high times at the end misled me a bit The actual SSE2 instruction latencies just increased by 1. But that is additionally to increased L1 and L2 latencies.
Dresdenboy is offline   Reply With Quote
Old 2004-09-08, 03:53   #13
SalemTheCat100
 
Oct 2002

2·13 Posts
Default

Quote:
Originally Posted by Angular
I have been reading the new Prescott reviews with a bit of shock. I expected the prescott doubles both the L1 and L2 cache. The biggest surprise to me is that the integer pipeline increased from 21 stages to 31 stages. This compares to the 10/15 stage (integer/FP) pipeline of the Athlon XP. I am not an assembly programmer but this sounds like it could be trouble for Prime95's highly optimized asm code. Given the market dominance of Intel it may be more critical to fix this problem (if it exists) than to fix the Operteron.
Here we go again. If it's an Intel problem then drop anything that might be of value to Prime95 contributors who happen to be using AMD processors.

Keep doing that and maybe there won't be any AMD Prime95 contributors left.

SALEM
SalemTheCat100 is offline   Reply With Quote
Old 2004-09-08, 15:55   #14
db597
 
db597's Avatar
 
Jan 2003

CE16 Posts
Default Prescott faster than Northwood over 3.7GHz

Can't remember where I first saw these numbers, but someone did some benchmarks and found that the Prescott finally overtakes the Northwood at around 3.7Gig. That's for overall performance, no idea about for Prime95 specifically.

IMHO, reason for the overtake seems to be the cache size. At over 3.7Gig, the memory and bus can't keep up, so the bigger cache allows the Prescott to pull ahead. It's got nothing to do with Prescotts new pipeline or design though - which will forever be worse at no matter what clockspeed.

And if you're overclocking, the Prescott will take even longer to overtake the Northwood. When I say overclock, I'm thinking about 2.4Gig northwoods being run at 3gig with 1000MHz FSB and 500MHz DDR (lots of enthusiasts seem to have this). In such a case, it will be well over 4GHz before Prescott will catch up with Northwood.

Also, don't count on compiler optimisations to do much to help prescott. Though some might argue that speed will increase as software gets optimised for the new 31 stage pipeline, it won't. There's some results on ixbt using an optimised intel fortran compiler (-xP switch) :

http://www.ixbt.com/cpu/insidespeccpu2000-part-e.shtml

As you can see, using the optimised compiler, Prescott only benefits by more than 1% in 2 out of the 14 benchmarks. And of those 2, 1 improvement is very marginal (191.fma3d). The other is about 15% (168.wupwise). This is possibily due to the use of SSE3.

Until we get into 4GHz+ territory, there's no point to get a Prescott. And this won't happen until earliest Jaunary next year when E0 stepping Prescotts hit the shops - should run cooler as well. But what's worse, a 4GHz+ Prescott will need a LGA775 board with DDR2 and PCI Express. So you're looking at changing your entire setup, not just a simple upgrade. Even the casing will be changing to the BTX format to enable better cooling for the nearly 100W that Prescott will generate. I just don't see Intel doing 4GHz+ on Socket 478, they can't cool it enough.
db597 is offline   Reply With Quote
Old 2004-09-26, 03:37   #15
ATH
Einyen
 
ATH's Avatar
 
Dec 2003
Denmark

22×863 Posts
Default

I got a prescott 3.4 Ghz processor. No matter what priority I set Prime95 at, it only uses 50% CPU, any ideas? Also Prime95 has found L2 cache correct at 1024kb but have Unknown at L1 cache, is that important?

Last fiddled with by ATH on 2004-09-26 at 03:40
ATH is offline   Reply With Quote
Old 2004-09-26, 04:30   #16
JuanTutors
 
JuanTutors's Avatar
 
"Juan Tutors"
Mar 2004

571 Posts
Default

Quote:
Originally Posted by ATH
I got a prescott 3.4 Ghz processor. No matter what priority I set Prime95 at, it only uses 50% CPU, any ideas? Also Prime95 has found L2 cache correct at 1024kb but have Unknown at L1 cache, is that important?
About the 50% thing, look in http://www.mersenneforum.org/showthread.php?t=3064. I don't know about the rest of the stuff.
JuanTutors is offline   Reply With Quote
Old 2004-09-26, 17:01   #17
moo
 
moo's Avatar
 
Jul 2004
Nowhere

80910 Posts
Default

u proibly have ht enabled its not bad read around the forums theres a lot on it there.
moo is offline   Reply With Quote
Old 2004-11-15, 04:08   #18
RickC
 
Mar 2003

4510 Posts
Default Prescott vs. Northwood performance varies

A multi-page article with many benchmarks:

http://tech-report.com/reviews/2004q1/p4-prescott/

Here's an interesting quote from page 2 of the article:

"Five new instructions for complex arithmetic allow for better handling of tasks like Fast Fourier Transforms; these instructions should enhance the Pentium 4's potential in scientific and distributed computing scenarios."


I also found the "Prescott New Instructions Software Developer’s Guide":

http://www.intel.com/cd/ids/develope...tion/66756.htm

The htm page has a link to the pdf.

I'm a software engineer with a rusty math degree but I don't know assembler.
RickC is offline   Reply With Quote
Old 2004-11-15, 07:04   #19
ET_
Banned
 
ET_'s Avatar
 
"Luigi"
Aug 2002
Team Italia

5·7·139 Posts
Default

Quote:
Originally Posted by RickC
A multi-page article with many benchmarks:

http://tech-report.com/reviews/2004q1/p4-prescott/

Here's an interesting quote from page 2 of the article:

"Five new instructions for complex arithmetic allow for better handling of tasks like Fast Fourier Transforms; these instructions should enhance the Pentium 4's potential in scientific and distributed computing scenarios."


I also found the "Prescott New Instructions Software Developer’s Guide":

http://www.intel.com/cd/ids/develope...tion/66756.htm

The htm page has a link to the pdf.

I'm a software engineer with a rusty math degree but I don't know assembler.
SSE3 instructions have been deeply studied by George Woltman and the people involved in Prime95 developing.

They sadly reported here in the forum that SSE3 registers and instructions are of little or no use to enhance Prime95 throughput.

Luigi
ET_ is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
How to use prime95 for stage 1 & GMP-ECM for stage 2 Prime95 Lone Mersenne Hunters 118 2022-07-04 18:19
GMP-ECM & Prime95 Stage 1 Files Gordon GMP-ECM 3 2016-01-08 12:44
The news giveth, the news taketh away... NBtarheel_33 Hardware 17 2009-05-04 15:52
Stage 1 with mprime/prime95, stage 2 with GMP-ECM D. B. Staple Factoring 2 2007-12-14 00:21
Prescott impact to prime95. nucleon Hardware 35 2003-07-22 06:02

All times are UTC. The time now is 13:16.


Fri Jul 7 13:16:27 UTC 2023 up 323 days, 10:45, 0 users, load averages: 1.36, 1.23, 1.15

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2023, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.

≠ ± ∓ ÷ × · − √ ‰ ⊗ ⊕ ⊖ ⊘ ⊙ ≤ ≥ ≦ ≧ ≨ ≩ ≺ ≻ ≼ ≽ ⊏ ⊐ ⊑ ⊒ ² ³ °
∠ ∟ ° ≅ ~ ‖ ⟂ ⫛
≡ ≜ ≈ ∝ ∞ ≪ ≫ ⌊⌋ ⌈⌉ ∘ ∏ ∐ ∑ ∧ ∨ ∩ ∪ ⨀ ⊕ ⊗ 𝖕 𝖖 𝖗 ⊲ ⊳
∅ ∖ ∁ ↦ ↣ ∩ ∪ ⊆ ⊂ ⊄ ⊊ ⊇ ⊃ ⊅ ⊋ ⊖ ∈ ∉ ∋ ∌ ℕ ℤ ℚ ℝ ℂ ℵ ℶ ℷ ℸ 𝓟
¬ ∨ ∧ ⊕ → ← ⇒ ⇐ ⇔ ∀ ∃ ∄ ∴ ∵ ⊤ ⊥ ⊢ ⊨ ⫤ ⊣ … ⋯ ⋮ ⋰ ⋱
∫ ∬ ∭ ∮ ∯ ∰ ∇ ∆ δ ∂ ℱ ℒ ℓ
𝛢𝛼 𝛣𝛽 𝛤𝛾 𝛥𝛿 𝛦𝜀𝜖 𝛧𝜁 𝛨𝜂 𝛩𝜃𝜗 𝛪𝜄 𝛫𝜅 𝛬𝜆 𝛭𝜇 𝛮𝜈 𝛯𝜉 𝛰𝜊 𝛱𝜋 𝛲𝜌 𝛴𝜎𝜍 𝛵𝜏 𝛶𝜐 𝛷𝜙𝜑 𝛸𝜒 𝛹𝜓 𝛺𝜔