mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Software

Reply
 
Thread Tools
Old 2020-08-15, 19:49   #584
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

1D6F16 Posts
Default

Setting affinity should not be necessary
Prime95 is online now   Reply With Quote
Old 2020-08-16, 22:16   #585
pepi37
 
pepi37's Avatar
 
Dec 2011
After milion nines:)

1,451 Posts
Default

Quote:
Originally Posted by Prime95 View Post
Setting affinity should not be necessary
Yes, but please answer.
I ask this since this CPU have 12 cores
Can I use this scheme (under linux)
AffinityScramble2=0123456789ABCDEFGHIJKLMNOPQRSTUV


Worker #1]
Affinity=0,1

[Worker #2]
Affinity=2

Worker #3]
Affinity=3,4

[Worker #4]
Affinity=5

Worker #5]
Affinity=6,7

[Worker #6]
Affinity=8

Worker #7]
Affinity=9,A

[Worker #8]
Affinity=B

Is this looks correct?

Last fiddled with by pepi37 on 2020-08-16 at 22:16
pepi37 is offline   Reply With Quote
Old 2020-08-17, 01:17   #586
storm5510
Random Account
 
storm5510's Avatar
 
Aug 2009

195610 Posts
Default

Quote:
Originally Posted by pepi37 View Post
Yes, but please answer.
I ask this since this CPU have 12 cores
Can I use this scheme (under linux)
AffinityScramble2=0123456789ABCDEFGHIJKLMNOPQRSTUV

Worker #1]
Affinity=0,1

[Worker #2]
Affinity=2

Worker #3]
Affinity=3,4

[Worker #4]
Affinity=5

Worker #5]
Affinity=6,7

[Worker #6]
Affinity=8

Worker #7]
Affinity=9,A

[Worker #8]
Affinity=B

Is this looks correct?
You might actually get better performance if you cut this back to four workers:

[Worker #1]
Affinity=0,(1,2)

[Worker #2]
Affinity=3,(4,5)

[Worker #3]
Affinity=6,(7,8)

[Worker #4]
Affinity=9,(A,B)


4 worker with 2 helpers each.
storm5510 is offline   Reply With Quote
Old 2020-08-17, 01:51   #587
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

11101011011112 Posts
Default

Quote:
Originally Posted by pepi37 View Post
Yes, but please answer.
I ask this since this CPU have 12 cores
Can I use this scheme (under linux)
AffinityScramble2=0123456789ABCDEFGHIJKLMNOPQRSTUV


Worker #1]
Affinity=0,1

[Worker #2]
Affinity=2

Worker #3]
Affinity=3,4

[Worker #4]
Affinity=5

Worker #5]
Affinity=6,7

[Worker #6]
Affinity=8

Worker #7]
Affinity=9,A

[Worker #8]
Affinity=B

Is this looks correct?
AffinityScramble is deprecated.

Your settings are OK except "9,A" should be "9,10" and B should be 11.
Prime95 is online now   Reply With Quote
Old 2020-08-18, 20:31   #588
pepi37
 
pepi37's Avatar
 
Dec 2011
After milion nines:)

1,451 Posts
Default

Quote:
Originally Posted by storm5510 View Post
You might actually get better performance if you cut this back to four workers:

[Worker #1]
Affinity=0,(1,2)

[Worker #2]
Affinity=3,(4,5)

[Worker #3]
Affinity=6,(7,8)

[Worker #4]
Affinity=9,(A,B)


4 worker with 2 helpers each.
Since there is no rush doing CRUS sequence at home I am concentrated to best output, and 4 workers with 3 core each is not that in this case. But thanks for advice

Last fiddled with by pepi37 on 2020-08-18 at 20:31
pepi37 is offline   Reply With Quote
Old 2020-08-19, 00:00   #589
storm5510
Random Account
 
storm5510's Avatar
 
Aug 2009

195610 Posts
Default

Quote:
Originally Posted by pepi37 View Post
Since there is no rush doing CRUS sequence at home I am concentrated to best output, and 4 workers with 3 core each is not that in this case. But thanks for advice
You are most welcome.

George is correct. You should not have to use the "Affinity" settings. Closer to the top of local.txt, you might have something like this:

Quote:
WorkerThreads=x
CoresPerTest=x
Where x is a number. I started using Prime95, the Windows equivalent of mprime, in 2005. To date, Prime95 extremely rarely uses more than 50% of of a CPU's capability. I have tried to force it in the past without success. You could set WorkerThreads=12 and CoresPerTest=1, but I doubt it would use this much. Something has to be left for the operating system and other background processes.

What I have is an i7. Four physical cores and four logical cores. When I put this together, a long-time member here suggested I use one worker thread and four cores per test, the working being one of the four. This is 50% of the CPU's capacity. I have no "Affinity" settings. You can experiment with those two settings until you find what you feel does the best. If those two settings are not there, then you can add them. They are case-sensitive and must be written as I have them above.
storm5510 is offline   Reply With Quote
Old 2020-08-19, 02:43   #590
VBCurtis
 
VBCurtis's Avatar
 
"Curtis"
Feb 2005
Riverside, CA

22×1,217 Posts
Default

Quote:
Originally Posted by storm5510 View Post
To date, Prime95 extremely rarely uses more than 50% of of a CPU's capability. .
You are very mistaken. Prime95 uses more of a CPU's capability than nearly any other software, period. It is so efficient at using available capacity that using hyperthreads makes the software run slower. Even so, your belief that it's only using 50% of the CPU capability reflects a severe misunderstanding of what logical processors are- a misunderstanding you ought to remedy.

To wit: If you have a 4-lane bridge with 8 lanes of highway that merge into the bridge, you claim the bridge is only half used when I close 4 highway lanes and leave 4 lanes open to flow smoothly onto the bridge. Is the bridge half-used?
VBCurtis is offline   Reply With Quote
Old 2020-08-19, 02:59   #591
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

10101001010112 Posts
Default

Prime95 & mprime and other GIMPS primality test codes are typically memory bandwidth limited. So say their authors. George has given examples of using MORE instructions to use LESS data memory transfers. TF is less demanding of memory bandwidth. I've found I can run Ernst's Mfactor program on most HT cores alongside, for exponents too large to factor with mfaktx, with modest impact on prime95's primality test throughput, ~15%. https://en.wikipedia.org/wiki/Hyper-threading does not increase memory bandwidth, only certain parts of a core are duplicated. Intel indicated 15-30% performance increase, not 100%, from HT.
I've burned out two motherboards, with an i7-4790 running prime95 and its igp running mfakto at full tilt at the same time. Seems like rather full utilization to me, to be able to take the chip beyond the power rating the board was designed for and presumably would tolerate.
The Windows Task Manager display of core utilization, for n core & HT showing us 2n core utilization graphs, can be misleading.
Compare the attached Task Manager performance pane screen captures of a 4-core&HT i7-4790 and a dual-6-core-Xeon-x5650 (no HT), each running prime95 at its optimally benchmarked configuration;. The i7-4790 1 worker 4 cores no HT use by prime95 primality testing, 50% cpu utilization indicated for prime95, 13% for gpuowl, 63% total. The x5650s 2 workers 6 cores each chip package & the prime95 process indicates 98-99% cpu utilization. Also Core 2 Duo, no HT, prime95 process is 98% cpu utilization indicated. Prime95 automatically handles the helper thread core affinity for us.
Attached Thumbnails
Click image for larger version

Name:	i7-4790 loaded.png
Views:	98
Size:	76.5 KB
ID:	23107   Click image for larger version

Name:	dual-x5650 load.png
Views:	107
Size:	47.7 KB
ID:	23108   Click image for larger version

Name:	core2 load.png
Views:	100
Size:	30.2 KB
ID:	23109  

Last fiddled with by kriesel on 2020-08-19 at 03:52
kriesel is offline   Reply With Quote
Old 2020-08-19, 03:34   #592
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

5,419 Posts
Default

Quote:
Originally Posted by storm5510 View Post
Something has to be left for the operating system and other background processes.
Prime95 runs at low priority and is preempted by the OS or assorted user processes. On a HT system, the virtual cores can come into play and lessen the impact. A prime95 worker is preempted as needed. This happens with gpuowl's GCD on a cpu core for example; one prime95 worker yields to it.
kriesel is offline   Reply With Quote
Old 2020-08-19, 16:27   #593
storm5510
Random Account
 
storm5510's Avatar
 
Aug 2009

22×3×163 Posts
Default

With everything above read, explain this (image attached). This is all I can get using multiple configurations. I only use HT on recommended processes. Others, it seems to have no affect. This is on a P-1, if that makes any differences. So, what is it that I have done wrong all this time.
Attached Thumbnails
Click image for larger version

Name:	Capture.JPG
Views:	137
Size:	62.3 KB
ID:	23113  

Last fiddled with by storm5510 on 2020-08-19 at 16:30
storm5510 is offline   Reply With Quote
Old 2020-08-19, 16:46   #594
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

165578 Posts
Default

Quote:
Originally Posted by storm5510 View Post
So, what is it that I have done wrong all this time.
That looks perfect!

It shows each physical core has a prime95 worker on it.
Prime95 is online now   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
Prime95 version 29.2 Prime95 Software 71 2017-09-16 16:55
Prime95 version 29.1 Prime95 Software 95 2017-08-22 22:46
Prime95 version 26.5 Prime95 Software 175 2011-04-04 22:35
Prime95 version 25.9 Prime95 Software 143 2010-01-05 22:53
Prime95 version 25.8 Prime95 Software 159 2009-09-21 16:30

All times are UTC. The time now is 20:48.


Sun Aug 1 20:48:16 UTC 2021 up 9 days, 15:17, 0 users, load averages: 1.24, 1.41, 1.57

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.