mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware

Reply
 
Thread Tools
Old 2012-05-15, 22:45   #12
Dubslow
Basketry That Evening!
 
Dubslow's Avatar
 
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88

1C3516 Posts
Default

Quote:
Originally Posted by fivemack View Post
That's not true in theory, which is why it's not true in practice. It's not a matter of alternating issuing a line of micro-ops from each thread; what happens is that each thread pushes micro-ops into a separate queue, and the issue unit can pull micro-ops out of either queue to fill the execution units.

If the code were sufficiently superbly optimised that every execution unit could be filled at every cycle by either thread, then this would come out the same as alternation; but there's very little code (though prime95 is probably an example that comes close) which is that optimised. Generally there are opportunities for things from thread 2 to fill holes in thread 1 and vice versa.
Ah, okay. Prime95 forms the core of my experiences with high-speed computing :-)

So despite my monitor showing full CPU load, that doesn't necessarily mean that each cycle is being used to execute the program? (Currently running GMP-ECM via YAFU, but it appears the same (fully loaded) whether it's Msieve, GMP-ECM, YAFU/SIQS, or lasieve/ that's running.)
Dubslow is offline   Reply With Quote
Old 2012-05-15, 22:59   #13
fivemack
(loop (#_fork))
 
fivemack's Avatar
 
Feb 2006
Cambridge, England

144278 Posts
Default

It's not so much a question of every cycle being used, as of every sub-slot within every cycle being used, and I don't think performance monitors generally monitor at that level.

For ecm on the i7/960:

1 copy does a 1e7 curve on (102^128+1)/7443446978046721 in 85.9s
6 copies take average 89.4s
12 copies take average 169.7s

so ecm gains only about 5% from HT - still not to be sniffed at

When I wrote a program whose purpose in life was entirely to stall on memory access, I get a 60% gain from HT - six copies run in 16 seconds each, twelve copies run in 20 seconds each

Last fiddled with by fivemack on 2012-05-15 at 23:43
fivemack is offline   Reply With Quote
Old 2012-05-21, 13:13   #14
R.D. Silverman
 
R.D. Silverman's Avatar
 
Nov 2003

22·5·373 Posts
Default

Quote:
Originally Posted by fivemack View Post
No, for GNFS sieving you want to run eight threads on either a dual quad-Core2-Xeon or a single quad-{Sandy,Ivy}Bridge with hyperthreading.
Running 8 GNFS threads requires a LOT of memory........
R.D. Silverman is offline   Reply With Quote
Old 2012-05-21, 13:20   #15
fivemack
(loop (#_fork))
 
fivemack's Avatar
 
Feb 2006
Cambridge, England

3×2,141 Posts
Default

Quote:
Originally Posted by R.D. Silverman View Post
Running 8 GNFS threads requires a LOT of memory........
Only gnfs-lasieve4I16e uses more than 1GB per thread, and even 16GB is a reasonable configuration for a desktop bought nowadays - order of $100.
fivemack is offline   Reply With Quote
Old 2012-05-21, 14:09   #16
LaurV
Romulan Interpreter
 
LaurV's Avatar
 
Jun 2011
Thailand

965310 Posts
Default

Quote:
Originally Posted by fivemack View Post
Only gnfs-lasieve4I16e uses more than 1GB per thread, and even 16GB is a reasonable configuration for a desktop bought nowadays - order of $100.
sieving with 16 threads with 4I13e, for a C127 of aliquot 4290.i2321 in this right moment, it takes 29MB per thread (I may be again parallel with the subject...)
Attached Thumbnails
Click image for larger version

Name:	taskman.JPG
Views:	90
Size:	33.1 KB
ID:	8016  

Last fiddled with by LaurV on 2012-05-21 at 14:10
LaurV is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
Motherboard for Xeon ATH Hardware 7 2015-10-10 02:13
New Xeon firejuggler Hardware 8 2014-09-10 06:37
Xeon Phi TObject Hardware 34 2013-10-17 20:52
Advice wanted - HP workstation with XEON processors AntonVrba Hardware 4 2006-06-04 11:01
Dual Xeon Workstation RickC Hardware 15 2003-12-17 01:35

All times are UTC. The time now is 19:30.


Sun Aug 1 19:30:08 UTC 2021 up 9 days, 13:59, 0 users, load averages: 1.86, 1.79, 1.84

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.