![]() |
![]() |
#1 |
Mar 2014
24·5 Posts |
![]()
I'm trying to help out with the strategic double check project a bit and I want to figure out the best way to run 1 or two test at a time. currently I'm running 1 test on my 5930k which is 6 physical cores with hyperthreading to make 12 virtual cores; I have the worker windows setting with #workers set to 1, CPU affinity: "run on any CPU", CPUs to use:"12". Is this the best way to do it? is there any way to do it better? and I assume that if I'm running two tests I would want to set it to 2 workers and 6 CPUs each.
|
![]() |
![]() |
![]() |
#2 | |
If I May
"Chris Halsall"
Sep 2002
Barbados
2×3×1,741 Posts |
![]() Quote:
The next critical thing is "real, on-chip" CPU affinity. Madpoo can speak to what is needed under Windows. If you have more than one physical CPU, then (and only then) is when you go to multiple tests. Last fiddled with by chalsall on 2015-09-24 at 00:11 Reason: Closed quote. |
|
![]() |
![]() |
![]() |
#3 | |
Mar 2014
24·5 Posts |
![]() Quote:
|
|
![]() |
![]() |
![]() |
#4 | |
"Curtis"
Feb 2005
Riverside, CA
2×7×13×29 Posts |
![]() Quote:
You also may find that, say, 5 LL tests is as much production as 6 tests, allowing that 6th core to do something else from this forum- ECM in particular is not very memory-bandwidth intensive. |
|
![]() |
![]() |
![]() |
#5 | |
If I May
"Chris Halsall"
Sep 2002
Barbados
2×3×1,741 Posts |
![]() Quote:
The best way to find the "sweet spot" for optimal configuration is empirical testing. Try different settings, note the speeds achieved. Do the math (read: build a spreadsheet to record data). Rince and repeat (as necessary). Prime95/mprime's benchmarking option helps a lot in this analysis, but I found that it ignored custom Affinity settings (at least under Linux) so I ended up running the same candidate dozens of times just to establish a baseline. And, to put on the table, I /only/ do Linux. Again, Madpoo is a better man to speak about optimizing Windows. |
|
![]() |
![]() |
![]() |
#6 | |
Romulan Interpreter
"name field"
Jun 2011
Thailand
100110111010012 Posts |
![]() Quote:
Accordingly, for two workers use "CPUs to use: 3" The HT will only succeed to muddy the waters, without bringing any benefits. P95 is enough optimized to take advantage of the full core, without HT. HT is only for programs that don't use the core fully (having idle ticks), so splitting it in two gives other processes/tasks/programs the opportunity to use the free ticks. It is not the case for P95, in its case one of the logical core is waiting, because the other one uses all cycles, then after the first finishes it is waiting for the second, then some more time is used to synchronize them each-other, etc. So, HT does not bring any benefits. Contrarily. Remark that I didn't say "the best way", but "a better way". The best way may be for you to reduce the number of cores more, if you have a slow memory, for example, or not enough channels. But for sure, using 6 cores with a single worker is a better way than using 12 cores. You can use Options/Benchmark from the P95 menu, compute the output speed and the time you need in each case, and see for yourself which way is better for your system. |
|
![]() |
![]() |
![]() |
#7 |
Mar 2014
8010 Posts |
![]()
hmm so I've been doing a bit of testing and it seems that 12 CPUs is a 15% faster than 6CPUs the cause of this seems to be that P95 is only hitting about 50% CPU utilization any idea how to get better utilization? Also when running more than one worker what is the difference between smart assignment and run on any CPU both technically and performance wise?
|
![]() |
![]() |
![]() |
#8 | |
Aug 2002
North San Diego County
2·3·112 Posts |
![]() Quote:
For example, the 6C12T box I use to quickly check single exponents is setup like this: Code:
CPU Affinity: CPU #1 CPUs to use: 6 AffinityScramble2=13579B02468A (in the local.txt file) If I turn hyperthreading off in BIOS (and take out the AffinityScramble2 line), then task manager shows +-100% CPU use, and the iteration times don't change materially. Last fiddled with by sdbardwick on 2015-09-25 at 05:21 |
|
![]() |
![]() |
![]() |
#9 |
Romulan Interpreter
"name field"
Jun 2011
Thailand
7×1,423 Posts |
![]()
That is a windows "bug", is not really a bug, but it is how windows count the cores, you only use 6 from 12 possible, so no matter wht you do with them, you will not be able to see more than 50% occupancy. Your CPU is used close to 100% in that case, don't worry. Windoze makes a distinction between "single physical core" and "two logical cores running on a single physical core", he sees that as two cores (which is arguable wrong, but that is the case). If you disable HT in bios, you will still get your 15% more speed, in 6 cores only, most probably. In fact, it depends of many other things, including what else you are running in the (other) logical cores in this time. But the result may also mean that you are not memory BW limited, which is good. You may also make tests and see if your 15% higher speed doesn't come with 70% higher heat and/or current consumption (which it is usually the case with HT, here the consensus is that for LL, HT is not good - but of course it is up to you, and all computers/systems are different).
Edit: Crosspost, I started to reply before the last post was made. We are in a divergent agreement, like someone here used to say, so I will let my post be. Last fiddled with by LaurV on 2015-09-25 at 05:42 |
![]() |
![]() |
![]() |
#10 |
"Curtis"
Feb 2005
Riverside, CA
149E16 Posts |
![]()
I think you mean "violent agreement".
dragonbud should learn to not trust what windows tells him. |
![]() |
![]() |
![]() |
#11 |
Romulan Interpreter
"name field"
Jun 2011
Thailand
26E916 Posts |
![]() |
![]() |
![]() |
![]() |
Thread Tools | |
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
How-to guide for running LL tests on the Amazon EC2 cloud | GP2 | Cloud Computing | 68 | 2021-11-17 16:39 |
Fast and robust error checking on Proth/Pepin tests | R. Gerbicz | Number Theory Discussion Group | 15 | 2018-09-01 13:23 |
Is it possible to disable benchmarking while torture tests are running? | ZFR | Software | 4 | 2018-02-02 20:18 |
LL tests running at different speeds | GARYP166 | Information & Answers | 11 | 2009-07-13 19:39 |
4 checkins in a single calendar month from a single computer | Gary Edstrom | Lounge | 7 | 2003-01-13 22:35 |