![]() |
|
|
#584 |
|
P90 years forever!
Aug 2002
Yeehaw, FL
1D6F16 Posts |
Setting affinity should not be necessary
|
|
|
|
|
|
#585 |
|
Dec 2011
After milion nines:)
1,451 Posts |
Yes, but please answer.
I ask this since this CPU have 12 cores Can I use this scheme (under linux) AffinityScramble2=0123456789ABCDEFGHIJKLMNOPQRSTUV Worker #1] Affinity=0,1 [Worker #2] Affinity=2 Worker #3] Affinity=3,4 [Worker #4] Affinity=5 Worker #5] Affinity=6,7 [Worker #6] Affinity=8 Worker #7] Affinity=9,A [Worker #8] Affinity=B Is this looks correct? Last fiddled with by pepi37 on 2020-08-16 at 22:16 |
|
|
|
|
|
#586 | |
|
Random Account
Aug 2009
111101001002 Posts |
Quote:
[Worker #1] Affinity=0,(1,2) [Worker #2] Affinity=3,(4,5) [Worker #3] Affinity=6,(7,8) [Worker #4] Affinity=9,(A,B) 4 worker with 2 helpers each.
|
|
|
|
|
|
|
#587 | |
|
P90 years forever!
Aug 2002
Yeehaw, FL
5×11×137 Posts |
Quote:
Your settings are OK except "9,A" should be "9,10" and B should be 11. |
|
|
|
|
|
|
#588 |
|
Dec 2011
After milion nines:)
1,451 Posts |
Since there is no rush doing CRUS sequence at home I am concentrated to best output, and 4 workers with 3 core each is not that in this case. But thanks for advice
Last fiddled with by pepi37 on 2020-08-18 at 20:31 |
|
|
|
|
|
#589 | ||
|
Random Account
Aug 2009
7A416 Posts |
Quote:
George is correct. You should not have to use the "Affinity" settings. Closer to the top of local.txt, you might have something like this: Quote:
What I have is an i7. Four physical cores and four logical cores. When I put this together, a long-time member here suggested I use one worker thread and four cores per test, the working being one of the four. This is 50% of the CPU's capacity. I have no "Affinity" settings. You can experiment with those two settings until you find what you feel does the best. If those two settings are not there, then you can add them. They are case-sensitive and must be written as I have them above. |
||
|
|
|
|
|
#590 | |
|
"Curtis"
Feb 2005
Riverside, CA
22×1,217 Posts |
Quote:
To wit: If you have a 4-lane bridge with 8 lanes of highway that merge into the bridge, you claim the bridge is only half used when I close 4 highway lanes and leave 4 lanes open to flow smoothly onto the bridge. Is the bridge half-used? |
|
|
|
|
|
|
#591 |
|
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
541910 Posts |
Prime95 & mprime and other GIMPS primality test codes are typically memory bandwidth limited. So say their authors. George has given examples of using MORE instructions to use LESS data memory transfers. TF is less demanding of memory bandwidth. I've found I can run Ernst's Mfactor program on most HT cores alongside, for exponents too large to factor with mfaktx, with modest impact on prime95's primality test throughput, ~15%. https://en.wikipedia.org/wiki/Hyper-threading does not increase memory bandwidth, only certain parts of a core are duplicated. Intel indicated 15-30% performance increase, not 100%, from HT.
I've burned out two motherboards, with an i7-4790 running prime95 and its igp running mfakto at full tilt at the same time. Seems like rather full utilization to me, to be able to take the chip beyond the power rating the board was designed for and presumably would tolerate. The Windows Task Manager display of core utilization, for n core & HT showing us 2n core utilization graphs, can be misleading. Compare the attached Task Manager performance pane screen captures of a 4-core&HT i7-4790 and a dual-6-core-Xeon-x5650 (no HT), each running prime95 at its optimally benchmarked configuration;. The i7-4790 1 worker 4 cores no HT use by prime95 primality testing, 50% cpu utilization indicated for prime95, 13% for gpuowl, 63% total. The x5650s 2 workers 6 cores each chip package & the prime95 process indicates 98-99% cpu utilization. Also Core 2 Duo, no HT, prime95 process is 98% cpu utilization indicated. Prime95 automatically handles the helper thread core affinity for us. Last fiddled with by kriesel on 2020-08-19 at 03:52 |
|
|
|
|
|
#592 |
|
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
5,419 Posts |
Prime95 runs at low priority and is preempted by the OS or assorted user processes. On a HT system, the virtual cores can come into play and lessen the impact. A prime95 worker is preempted as needed. This happens with gpuowl's GCD on a cpu core for example; one prime95 worker yields to it.
|
|
|
|
|
|
#593 |
|
Random Account
Aug 2009
22·3·163 Posts |
With everything above read, explain this (image attached). This is all I can get using multiple configurations. I only use HT on recommended processes. Others, it seems to have no affect. This is on a P-1, if that makes any differences. So, what is it that I have done wrong all this time.
Last fiddled with by storm5510 on 2020-08-19 at 16:30 |
|
|
|
|
|
#594 |
|
P90 years forever!
Aug 2002
Yeehaw, FL
5·11·137 Posts |
|
|
|
|
![]() |
| Thread Tools | |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Prime95 version 29.2 | Prime95 | Software | 71 | 2017-09-16 16:55 |
| Prime95 version 29.1 | Prime95 | Software | 95 | 2017-08-22 22:46 |
| Prime95 version 26.5 | Prime95 | Software | 175 | 2011-04-04 22:35 |
| Prime95 version 25.9 | Prime95 | Software | 143 | 2010-01-05 22:53 |
| Prime95 version 25.8 | Prime95 | Software | 159 | 2009-09-21 16:30 |