![]() |
|
|
#1 |
|
"J. Gareth Moreton"
Feb 2015
Nomadic
2×32×5 Posts |
Hi everyone,
I have a quad-core Intel Core i7-4770 running at 3.40GHz running the 64-bit version of Windows 7, and with 4 worker threads running, I'm currently averaging about 24 to 25 ms/iteration for first-time checks (in the 60m range), and about 14 ms/iteration for double checks (in the 35m range). If I turn off one of the worker threads, these times drop to 20 ms/iteration for first-time checks and 11 ms/iteration for double checks. Of course, I'm still learning the quirks of the thread scheduler, but for a computer that is used as a work machine as well, this seems to be quite a boost in efficiency, although I'm not entirely certain if it equates or surpasses having an extra worker thread. What are people's experience with this? |
|
|
|
|
|
#2 |
|
Jan 2015
25310 Posts |
Don't have time to post an explanation, but being very succinct: google interrupts, hyperthreading, and turbo boost.
|
|
|
|
|
|
#3 | |
|
1976 Toyota Corona years forever!
"Wayne"
Nov 2006
Saskatchewan, Canada
111228 Posts |
Quote:
And don't worry about the machine doing "real" work too. Prime95 runs at the lowest priority and unless you are using memory intensive work like P-1; or Large ECM you will NOT notice any impact on your work. |
|
|
|
|
|
|
#4 |
|
1976 Toyota Corona years forever!
"Wayne"
Nov 2006
Saskatchewan, Canada
2·5·7·67 Posts |
|
|
|
|
|
|
#5 |
|
Jan 2015
11×23 Posts |
I didn't propose that he use HT, just to look it up. If you don't leave a physical core free, you'll be interrupting p95 so that your computer can do its everyday tasks which will be threaded in a logical core. I'd recommend trying 'intel power gadget' a try so you can see how turbo boost, heat, and load will affect your CPU performance in real time.
|
|
|
|
|
|
#6 |
|
"/X\(‘-‘)/X\"
Jan 2013
55628 Posts |
I also have a 4770 and I have the same experience, with hyperthreading enabled.
3 cores slightly outperforms 4. |
|
|
|
|
|
#7 | |
|
Serpentine Vermin Jar
Jul 2014
3,313 Posts |
Quote:
For me, I would run two workers, each one using all the cores on it's own CPU (it's a dual CPU system). I experienced exactly what you're describing if one of the two workers was doing an LL test on any exponent > 58M. If that's the case, the other worker would need to be doing a DC on something below 38M otherwise they would both slow down a lot. My theory had to do with memory contention or something like that, but whatever... I just know to either keep both workers doing something below 58M, or if I must do something larger, set the other one to a small double-check task. I recently set some of my "lower end" systems to do a single core per worker, which means I have some with 8 or 12 workers going. I've found that the same basic rule applies... if any single worker is doing a test > 58M then it'll slow down anything else doing something > 38M. As long as I have all of them doing small work (double checks below 50M or so) I don't have to worry about it. There is an exception to this, and that's on my newer system with dual Xeon E5 2697 v3 and DDR4 RAM. Not sure if it's the larger L1/L2/L3 caching or the faster DDR4 or just something else, but that > 58M limit doesn't apply. Right now I've got it set for two workers (using all the cores on either CPU)... one is doing a 100M test and the other a 60M and they don't interfere with each other. I haven't really pushed it yet to see what the limit is for it... if I ever have the need I'll experiment to see where it winds up. |
|
|
|
|
|
|
#8 | |
|
Jun 2003
10011110110102 Posts |
Quote:
20 ms/iter x 3 workers = 150 iters/sec thruput. 14 ms/iter x 4 workers = 285 iters/sec thruput 11 ms/iter x 3 workers = 272 iters/sec thruput Suggesting that you're very nearly memory bottlenecked, but 4 workers is still better than 3 workers in terms of total productivity. What is your RAM spec? |
|
|
|
|
|
|
#9 |
|
"J. Gareth Moreton"
Feb 2015
Nomadic
2·32·5 Posts |
I have 16 GB of RAM on the machine in question.
|
|
|
|
|
|
#10 |
|
If I May
"Chris Halsall"
Sep 2002
Barbados
262716 Posts |
Dual channel? As in, pair(s) of "sticks"? I assume so based on the amount of RAM you have, but it's an important variable.
Also, what speed? Another thing which is absolutely critical in optimizing Prime95/mprime is to get the affinity correct. At least under Linux (mprime) I have found that without explicitly setting the affinity manually (via the AffinityScramble2 parameter in local.txt) the processes can often jump around, sometimes ending up with hyperthreaded "virtual" cores being used. |
|
|
|
|
|
#11 |
|
1976 Toyota Corona years forever!
"Wayne"
Nov 2006
Saskatchewan, Canada
2×5×7×67 Posts |
|
|
|
|
![]() |
| Thread Tools | |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| reduce to 108119486 relation sets and 0 unique ideals | Alfred | Msieve | 2 | 2017-04-02 07:01 |
| Worker #5 and Worker#7 not running (Error ILLEGAL SUMOUT | skrupian08 | Information & Answers | 9 | 2016-08-23 16:35 |
| How to reduce number of worker windows? | Chuck | PrimeNet | 7 | 2011-07-03 19:17 |
| Reduce your debt!! ... I'm curious. | petrw1 | Lounge | 59 | 2009-01-21 12:48 |
| Any way to reduce CPU usage? | Jarl | Hardware | 5 | 2007-03-30 19:13 |