mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware

Reply
 
Thread Tools
Old 2012-05-14, 09:23   #1
beans
 
May 2012

12 Posts
Default 2x Xeon x5450 QuadCore Workstation for factorization?

Hi,

I want to buy a computer and want to use it for large integer factorization. I have the opportunity to get an old FSC Celsius R550 workstation with a D2569 Dual Socket 771 board and 2 Xeon 5450 for about 500€. Is it useful to invest in such a machine ore will do a single Intel i7 Quad a better job on gnfs-lasieve?

Thanks.
beans is offline   Reply With Quote
Old 2012-05-14, 09:53   #2
fivemack
(loop (#_fork))
 
fivemack's Avatar
 
Feb 2006
Cambridge, England

642310 Posts
Default

That's an interesting decision point.

Sieving the same region with the same polynomial on a Xeon 5420 and an i7-2600S I get

(0.10065 sec/rel) (Xeon 5420, one thread) - note this is 2.5GHz rather than the 3.0GHz of a 5450
(0.05327 sec/rel) (i7-2600S, one turbo-boosted thread)
(0.06987 sec/rel) (i7-2600S, four threads running on one core each, time for median thread)
(0.11194 sec/rel) (i7-2600S, eight threads running on eight hypercores, time for median thread)

So the Xeon workstation may be a bit faster in aggregate (though note that the i7/2600S is a low-power model at 2.8GHz whilst you'd probably be using a 3770K at 3.5GHz), at the price of using a *lot* more electricity. I would get a minimal Ivy Bridge box instead - I think that can be done for 500€

Last fiddled with by fivemack on 2012-05-14 at 17:41 Reason: update with eight-jobs figures from i7-2600S
fivemack is offline   Reply With Quote
Old 2012-05-14, 16:31   #3
henryzz
Just call me Henry
 
henryzz's Avatar
 
"David"
Sep 2007
Cambridge (GMT/BST)

2×33×109 Posts
Default

Not mentioning that you would have to handle twice as many cores.
henryzz is offline   Reply With Quote
Old 2012-05-14, 17:35   #4
fivemack
(loop (#_fork))
 
fivemack's Avatar
 
Feb 2006
Cambridge, England

3×2,141 Posts
Default

(my colleague turns out to be out of the office today so I was able to run the eight-threads test)
fivemack is offline   Reply With Quote
Old 2012-05-14, 17:38   #5
fivemack
(loop (#_fork))
 
fivemack's Avatar
 
Feb 2006
Cambridge, England

642310 Posts
Default

Quote:
Originally Posted by henryzz View Post
Not mentioning that you would have to handle twice as many cores.
No, for GNFS sieving you want to run eight threads on either a dual quad-Core2-Xeon or a single quad-{Sandy,Ivy}Bridge with hyperthreading.
fivemack is offline   Reply With Quote
Old 2012-05-15, 10:30   #6
henryzz
Just call me Henry
 
henryzz's Avatar
 
"David"
Sep 2007
Cambridge (GMT/BST)

2·33·109 Posts
Default

Quote:
Originally Posted by fivemack View Post
No, for GNFS sieving you want to run eight threads on either a dual quad-Core2-Xeon or a single quad-{Sandy,Ivy}Bridge with hyperthreading.
How much gain does hyperthreading gain you?
henryzz is offline   Reply With Quote
Old 2012-05-15, 10:49   #7
fivemack
(loop (#_fork))
 
fivemack's Avatar
 
Feb 2006
Cambridge, England

3·2,141 Posts
Default

Quote:
Originally Posted by henryzz View Post
How much gain does hyperthreading gain you?
(0.06987 sec/rel) (i7-2600S, four threads running on one core each, time for median thread)
(0.11194 sec/rel) (i7-2600S, eight threads running on eight hypercores, time for median thread)

so with four threads you get 57.2 relations per second and with eight you get 71.4; so hyperthreading is gaining about 25%

You know, you could have done that arithmetic yourself.

Last fiddled with by fivemack on 2012-05-15 at 10:50
fivemack is offline   Reply With Quote
Old 2012-05-15, 16:42   #8
henryzz
Just call me Henry
 
henryzz's Avatar
 
"David"
Sep 2007
Cambridge (GMT/BST)

2·33·109 Posts
Default

Quote:
Originally Posted by fivemack View Post
(0.06987 sec/rel) (i7-2600S, four threads running on one core each, time for median thread)
(0.11194 sec/rel) (i7-2600S, eight threads running on eight hypercores, time for median thread)

so with four threads you get 57.2 relations per second and with eight you get 71.4; so hyperthreading is gaining about 25%

You know, you could have done that arithmetic yourself.
Sorry I didn't see the numbers above when I made that post.
henryzz is offline   Reply With Quote
Old 2012-05-15, 22:20   #9
fivemack
(loop (#_fork))
 
fivemack's Avatar
 
Feb 2006
Cambridge, England

3·2,141 Posts
Default

On an i7/960 at the office (so hyper-threading but not much turboing)

one thread 0.06429s 15.6r/s
six threads 0.06853s 87.6r/s
twelve threads 0.10810s 111r/s

so here I'm getting a 27% boost from HT, about the same as the SNB.

(the Q6600 downstairs gets me 0.10033s/r, one thread on the 48-thread K10 gets 0.12355)

The test is /home/nfsworld/gnfs-batalov-old/gnfs-lasieve4I14e -a testcase.txt -f 16000000 -c 2000 with the testcase.txt file attached; I'd be interested to see figures from people with Sandy/Ivy Bridges, and particularly I'd be interested in seeing N-thread and 2N-thread runs on Bulldozers. 64-bit Linux results would be comparable with the ones I've given here.
Attached Files
File Type: txt testcase.txt (558 Bytes, 77 views)

Last fiddled with by fivemack on 2012-05-15 at 22:45
fivemack is offline   Reply With Quote
Old 2012-05-15, 22:22   #10
Dubslow
Basketry That Evening!
 
Dubslow's Avatar
 
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88

3×29×83 Posts
Default

Why would HT help? In theory, one full core should be able to do the same work as two HThreads on one core.
Dubslow is offline   Reply With Quote
Old 2012-05-15, 22:35   #11
fivemack
(loop (#_fork))
 
fivemack's Avatar
 
Feb 2006
Cambridge, England

3·2,141 Posts
Default

Quote:
Originally Posted by Dubslow View Post
Why would HT help? In theory, one full core should be able to do the same work as two HThreads on one core.
That's not true in theory, which is why it's not true in practice. It's not a matter of alternating issuing a line of micro-ops from each thread; what happens is that each thread pushes micro-ops into a separate queue, and the issue unit can pull micro-ops out of either queue to fill the execution units.

If the code were sufficiently superbly optimised that every execution unit could be filled at every cycle by either thread, then this would come out the same as alternation; but there's very little code (though prime95 is probably an example that comes close) which is that optimised. Generally there are opportunities for things from thread 2 to fill holes in thread 1 and vice versa.

Last fiddled with by fivemack on 2012-05-15 at 22:38
fivemack is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
Motherboard for Xeon ATH Hardware 7 2015-10-10 02:13
New Xeon firejuggler Hardware 8 2014-09-10 06:37
Xeon Phi TObject Hardware 34 2013-10-17 20:52
Advice wanted - HP workstation with XEON processors AntonVrba Hardware 4 2006-06-04 11:01
Dual Xeon Workstation RickC Hardware 15 2003-12-17 01:35

All times are UTC. The time now is 19:30.


Sun Aug 1 19:30:07 UTC 2021 up 9 days, 13:59, 0 users, load averages: 1.86, 1.79, 1.84

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.