mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware

Reply
 
Thread Tools
Old 2016-04-17, 23:45   #1
halahup
 
Apr 2016
Austin, TX

2·3 Posts
Default i7 6700K 50% load with 4 workers

My i7-6700K processor only works on 50% while using 4 workers, if I change the settings for 8 workers it loads 100% but it then separates every physical core into 2 logical ones. I wonder if there is a setting or a way to use 4 workers to load 100% just on those 4 physical cores? Please advise, thanks in advance!
halahup is offline   Reply With Quote
Old 2016-04-18, 00:25   #2
ewmayer
2ω=0
 
ewmayer's Avatar
 
Sep 2002
República de California

103·113 Posts
Default

"50%" of what? If the OS calibrates things such that 100% represents all 8 logical cores fully busy, then 4-threaded will indeed appear as 50%. Above 50% the percentage will likely have very little to do with the total throughput in that case, though -- 100% means there are as many threads wanting to use a core as there are logical cores.

If you go to 8 workers, do you see 100% (or close to it?) And how do the runtimes compare to the 4-worker case? If 8-worker causes the per-iteration time to roughly double, that means "50%' does indeed mean "all 4 physical cores maxed out".

Assuming the above is the case, you should probably also try some 1-worker/4-threads timings - if the resulting (timing*4) is greater than the average per-worker timing with 4 single-threaded workers, that means the latter run mode maximizes your total throughput.
ewmayer is offline   Reply With Quote
Old 2016-04-18, 01:05   #3
Mark Rose
 
Mark Rose's Avatar
 
"/X\(‘-‘)/X\"
Jan 2013

55648 Posts
Default

I'm not familiar with the Windows version, but what you probably want is processor affinity. I bet Windows is randomly running the four threads across the eight hyperthreads resulting in 50% usage across the board.
Mark Rose is offline   Reply With Quote
Old 2016-04-18, 01:30   #4
bgbeuning
 
Dec 2014

25510 Posts
Default

Short answer: prime95 is best using 4 workers on an i7 CPU.

Long answer: An i7 has 4 cores but 8 threads.
prime95 uses the floating point unit (FPU) a lot and the i7 has 1 FPU per core.
If you configure prime95 to use 8 workers on an i7 you will see the
time per iteration double because 2 workers are sharing one FPU.
bgbeuning is offline   Reply With Quote
Old 2016-04-18, 03:01   #5
halahup
 
Apr 2016
Austin, TX

2·3 Posts
Default

Quote:
Originally Posted by ewmayer View Post
"50%" of what? If the OS calibrates things such that 100% represents all 8 logical cores fully busy, then 4-threaded will indeed appear as 50%. Above 50% the percentage will likely have very little to do with the total throughput in that case, though -- 100% means there are as many threads wanting to use a core as there are logical cores.

If you go to 8 workers, do you see 100% (or close to it?) And how do the runtimes compare to the 4-worker case? If 8-worker causes the per-iteration time to roughly double, that means "50%' does indeed mean "all 4 physical cores maxed out".

Assuming the above is the case, you should probably also try some 1-worker/4-threads timings - if the resulting (timing*4) is greater than the average per-worker timing with 4 single-threaded workers, that means the latter run mode maximizes your total throughput.
Yeah if I use 8 workers then task manager shows 100% load but timing increases drastically (ETA is almost same as my FX8370 processor), but if I use 4 workers, timing is faster but workload is 50%, also in the status field with 8 workers it says that prime95 is checking 16 exponents, while with 4 workers - 8 exponents.
halahup is offline   Reply With Quote
Old 2016-04-18, 03:03   #6
halahup
 
Apr 2016
Austin, TX

610 Posts
Default

So would you suggest using just 4 workers even if it only loads 50% of CPU?
halahup is offline   Reply With Quote
Old 2016-04-18, 03:09   #7
halahup
 
Apr 2016
Austin, TX

2·3 Posts
Default

I attached the screenshot with the task manager
Attached Thumbnails
Click image for larger version

Name:	prime.PNG
Views:	149
Size:	201.6 KB
ID:	14234  
halahup is offline   Reply With Quote
Old 2016-04-18, 03:21   #8
axn
 
axn's Avatar
 
Jun 2003

116758 Posts
Default

Quote:
Originally Posted by halahup View Post
So would you suggest using just 4 workers even if it only loads 50% of CPU?
Yes. The OS _thinks_ it is only using 50% of CPU, but it is in fact using 100% of the floating point execution units (which is what is relevant for LL testing). So you're getting the maximum out of your CPU with 4 workers.
axn is online now   Reply With Quote
Old 2016-04-18, 03:23   #9
halahup
 
Apr 2016
Austin, TX

2×3 Posts
Default

Quote:
Originally Posted by axn View Post
Yes. The OS _thinks_ it is only using 50% of CPU, but it is in fact using 100% of the floating point execution units (which is what is relevant for LL testing). So you're getting the maximum out of your CPU with 4 workers.
Would you suggest using 2 threads per worker and 4 workers? Because then the system shows 100% work load.
halahup is offline   Reply With Quote
Old 2016-04-18, 03:24   #10
axn
 
axn's Avatar
 
Jun 2003

31×163 Posts
Default

Quote:
Originally Posted by halahup View Post
Would you suggest using 2 threads per worker and 4 workers? Because then the system shows 100% work load.
It would show 100% load. It might even make it a _tiny_ bit faster (or not). But it will increase heat/power consumption, and will make the computer slightly less responsive. It isn't worth it.
axn is online now   Reply With Quote
Old 2016-04-18, 03:40   #11
halahup
 
Apr 2016
Austin, TX

1102 Posts
Default

Quote:
Originally Posted by axn View Post
It would show 100% load. It might even make it a _tiny_ bit faster (or not). But it will increase heat/power consumption, and will make the computer slightly less responsive. It isn't worth it.
Ok, yeah it works even faster with 4 workers and only 1 thread per worker (50% load) than 2 threads per worker (100% load). I guess the architecture is not as trivial as I expected it to be. Thank you everyone!
halahup is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
PC freezes under load (usually) KadenBiggs Hardware 23 2017-01-26 08:03
18 core Haswell/P-1 CPU load nucleon Hardware 10 2015-04-26 14:48
Boat Load of 2##### +/- primes pdazzl FactorDB 7 2014-06-20 14:45
Went from 8 workers to 4 workers on v26.6 upgrade dmoran Software 13 2011-05-23 12:36
P3 reboots when CPU not under heavy load. geoff Hardware 4 2008-06-29 01:56

All times are UTC. The time now is 12:22.


Sat Jul 17 12:22:49 UTC 2021 up 50 days, 10:10, 1 user, load averages: 1.74, 1.48, 1.42

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.