mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware

Reply
 
Thread Tools
Old 2011-10-12, 12:40   #12
fivemack
(loop (#_fork))
 
fivemack's Avatar
 
Feb 2006
Cambridge, England

2·7·461 Posts
Default

The dual-Xeon has six memory controllers and the 1100T has two; msieve in my experience scales with the number of memory controllers. I haven't made measurements of the difference between -t4 and -t6 on a six-core Intel or AMD machine, but other people on this forum I think have.

Of course the dual-Xeon machine costs at least four times as much as the 1100T machine.

I would wait a couple of months and see how well Sandy Bridge E does, since it has four memory controllers at a not-unreasonable price.
fivemack is offline   Reply With Quote
Old 2011-10-12, 18:04   #13
Batalov
 
Batalov's Avatar
 
"Serge"
Mar 2008
San Diego, Calif.

32×7×163 Posts
Default

Quote:
Originally Posted by em99010pepe View Post
For msieve use in LA phase do you recommend one AMD 1100T with 16 GB or one dual Intel® Xeon™ X5650 SixCore 2.26GHz 12M 6,4 QPI 12 MB with 4x Dimm 8GB 1333MHz DDR3 ECC Reg w/Parity CL9 DIMM? Does msieve scale well with increase of number of cores? Sorry for the question but I have not experience running msieve on a CPU with more than 4 cores.

Thanks in advance,

Carlos
Alas, for linear algebra, 4 to 6-core transition (940 to 1055T) already left me disappointed, scaling was not too good - it was existent but significantly levelled. Arguably part of the reason is mentioned by Tom (especially with the DDR2 board and memory that I kept from 940). Dual Xeon did much better (but I only had a short window to test and not own it); it was a $10K+ system though, not far to compare.

1055T was not a total waste, because while running LA with -t 4 one may run some two extra processes on the backburner. (I'll build two computers for kids out of all these parts a bit later.)

I am no longer too excited about this release; just revived the thread. Now, after reviews, the power consumption seems to be quite a nail in its OC potential's coffin. (300W+ with light OC? gaaah!)
Batalov is offline   Reply With Quote
Old 2011-10-12, 21:43   #14
nucleon
 
nucleon's Avatar
 
Mar 2003
Melbourne

5·103 Posts
Default

Quote:
Originally Posted by fivemack View Post

I would wait a couple of months and see how well Sandy Bridge E does, since it has four memory controllers at a not-unreasonable price.
SNB-E first looks:

http://www.tomshardware.com/reviews/...ance,3026.html

6cores, 50GB/sec mem speed, drool.

It is a _demon_ with memory performance. This thing is a P-1 specialist. I found with my pseudo-scientific testing, that memory speed was important for P-1 to a point.

-- Craig
nucleon is offline   Reply With Quote
Old 2011-10-12, 21:47   #15
nucleon
 
nucleon's Avatar
 
Mar 2003
Melbourne

5×103 Posts
Default

Bulldozer with it's 1FP per 2 int cores, will be interesting to see what's the optimal prime95 run. I know on the '8' core cpu, it won't be 8x LL. Maybe 4x LL and 4x TF perhaps? Depends on memory bandwidth, ala core2 quads.

-- Craig
nucleon is offline   Reply With Quote
Old 2011-10-12, 21:48   #16
Dubslow
Basketry That Evening!
 
Dubslow's Avatar
 
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88

3×29×83 Posts
Default

As for the Bulldozer shared FPU thing, the first graph here shows that it actually manages to keep apiece the 2600k. Considering it's other failings though, I'd just go with a 2600k, 2500k, or better yet, save up for Ivy Bridge. That's supposed to be a 30% performance gain over SB at the same power. I want to see that!
Dubslow is offline   Reply With Quote
Old 2011-10-12, 23:40   #17
Christenson
 
Christenson's Avatar
 
Dec 2010
Monticello

34038 Posts
Default

What if we set the first integer core to feed factor candidates to mfaktc, and the second one and the FPU to do LL tests? That sound like a good mix?
Christenson is offline   Reply With Quote
Old 2011-10-13, 00:09   #18
Dubslow
Basketry That Evening!
 
Dubslow's Avatar
 
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88

3×29×83 Posts
Default

Good luck coding that
Would the scheduler be smart enough to lock down the FPU to the LL?
Dubslow is offline   Reply With Quote
Old 2011-10-13, 01:47   #19
Christenson
 
Christenson's Avatar
 
Dec 2010
Monticello

34038 Posts
Default

I think it would be a matter of auditing mfaktc on the CPU side and making sure it made relatively few floating point calls...or possibly none at all, saving the context switches...the rest is done in task manager when we tell it how we want to use the cores.

I do such things in motor control codes, and mfaktc isn't all that complicated.
Christenson is offline   Reply With Quote
Old 2011-10-13, 09:00   #20
nucleon
 
nucleon's Avatar
 
Mar 2003
Melbourne

5·103 Posts
Default

Quote:
Originally Posted by Christenson View Post
What if we set the first integer core to feed factor candidates to mfaktc, and the second one and the FPU to do LL tests? That sound like a good mix?
Definitely. Sounds like a plan. The hard part would be to work out the core numbering. Say the cores go like this: fp core split over core0 and core1, then core2 and core 3 etc.... So your work distribution would be:

Core0: Prime95 LL testing
Core1: mfaktc instance0
Core2: Prime95 LL testing
Core3: mfaktc instance1
Core4: Prime95 LL testing
Core5: mfaktc instance2
etc...

That is easily mapped under windows.

That would suit up to say 2x GTX560Ti cards. That would suit 1x GTX580 easily. Depends on clock speed of CPU and available IO. You might get away with 2x GTX570s, YMMV. I had a 2600k clocked to 4.5GHz and that was almost enough to drive 2x GTX580s. It can drive a GTX580 and GTX460 though.

-- Craig
*By drive I mean get at least 95% GPU utilization with mfaktc.
nucleon is offline   Reply With Quote
Old 2011-10-13, 19:02   #21
Dubslow
Basketry That Evening!
 
Dubslow's Avatar
 
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88

3×29×83 Posts
Default

Even though this would work fine, it still seems you'd get better value/throughput by waiting for Ivy Bridge
Dubslow is offline   Reply With Quote
Old 2011-10-13, 21:20   #22
nucleon
 
nucleon's Avatar
 
Mar 2003
Melbourne

10038 Posts
Default

Quote:
Originally Posted by Dubslow View Post
Even though this would work fine, it still seems you'd get better value/throughput by waiting for Ivy Bridge
:)

Ivy Bridge will still only be 4x cores, hopefully they'll increase the production clock rate a bit.

I'm keen on Sandy Bridge-E CPUs. Granted a bit more on the pricey, by oodles of memory bandwidth will make them a P-1 beast. And they'll be 6 cores.

-- Craig
nucleon is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
(windows) How to launch mfaktO/C with processor affinity swl551 Software 1 2012-09-24 23:29
Intel unveils 50-core maths co-processor card stars10250 Hardware 16 2011-06-25 17:24
Prime fatal error on 3D launch kaeptn_kork Hardware 11 2009-03-17 01:58
Product launch xilman Factoring 12 2006-07-11 16:23
Intel Core Duo processor drew Hardware 5 2006-05-29 07:00

All times are UTC. The time now is 14:42.


Fri Jul 7 14:42:35 UTC 2023 up 323 days, 12:11, 0 users, load averages: 1.58, 1.32, 1.12

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2023, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.

≠ ± ∓ ÷ × · − √ ‰ ⊗ ⊕ ⊖ ⊘ ⊙ ≤ ≥ ≦ ≧ ≨ ≩ ≺ ≻ ≼ ≽ ⊏ ⊐ ⊑ ⊒ ² ³ °
∠ ∟ ° ≅ ~ ‖ ⟂ ⫛
≡ ≜ ≈ ∝ ∞ ≪ ≫ ⌊⌋ ⌈⌉ ∘ ∏ ∐ ∑ ∧ ∨ ∩ ∪ ⨀ ⊕ ⊗ 𝖕 𝖖 𝖗 ⊲ ⊳
∅ ∖ ∁ ↦ ↣ ∩ ∪ ⊆ ⊂ ⊄ ⊊ ⊇ ⊃ ⊅ ⊋ ⊖ ∈ ∉ ∋ ∌ ℕ ℤ ℚ ℝ ℂ ℵ ℶ ℷ ℸ 𝓟
¬ ∨ ∧ ⊕ → ← ⇒ ⇐ ⇔ ∀ ∃ ∄ ∴ ∵ ⊤ ⊥ ⊢ ⊨ ⫤ ⊣ … ⋯ ⋮ ⋰ ⋱
∫ ∬ ∭ ∮ ∯ ∰ ∇ ∆ δ ∂ ℱ ℒ ℓ
𝛢𝛼 𝛣𝛽 𝛤𝛾 𝛥𝛿 𝛦𝜀𝜖 𝛧𝜁 𝛨𝜂 𝛩𝜃𝜗 𝛪𝜄 𝛫𝜅 𝛬𝜆 𝛭𝜇 𝛮𝜈 𝛯𝜉 𝛰𝜊 𝛱𝜋 𝛲𝜌 𝛴𝜎𝜍 𝛵𝜏 𝛶𝜐 𝛷𝜙𝜑 𝛸𝜒 𝛹𝜓 𝛺𝜔