mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2020-01-09, 22:57   #1739
PhilF
 
PhilF's Avatar
 
Feb 2005
Colorado

23·3·19 Posts
Default

Quote:
Originally Posted by mrh View Post
I'm not an amd expert, but I think that indicates you are using the "amd pro" drivers (I think that is what they are called) vs. rocm.

-mike
That would make sense. If so, I highly recommend the pro drivers to others.
PhilF is offline   Reply With Quote
Old 2020-01-10, 00:13   #1740
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

381410 Posts
Default Windows build for gpuowl v6.11-116-g5ca090d

For anyone who'd like to give it a try on Windows, this build of the latest available commit was done on Windows 7 x64 minutes ago. I haven't tried it past the help function. Make gpuown-win again generated the usual shower of warnings; see build-log.txt attached.
Quote:
Originally Posted by preda View Post
For a task of the form:
PRP=XXXXXXXX,1,2,91408469,-1,77,1
i.e. note the final integer, let's call it "wantsPm1", being "1" instead of the usual "0" -- this indicates that P-1 testing is desired;

gpuowl will automatically expand the task into a P-1 and a PRP with the "wantsPm1" set to 0.

It works like this:
- gpuowl reads the first good line from worktodo.txt
- if that line is a PRP with wantsPm1 non-zero, two new tasks are *appended* to the worktodot.txt (i.e. at the end)
- after which the PRP task that was having wantPm1 is deleted from worktodo.txt
- loop to find the first task in workdoto.txt

I.e. this would result in a re-ordering of the tasks in worktodo.txt because the "expanded" tasks are always added to the end.

It is likely there are some bugs, please let me know if you see any.
Attached Files
File Type: 7z gpuowl-v6.11-116-g5ca090d.7z (441.1 KB, 19 views)
File Type: txt build-log.txt (5.5 KB, 16 views)
kriesel is offline   Reply With Quote
Old 2020-01-10, 00:56   #1741
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

6,833 Posts
Default

Quote:
Originally Posted by preda View Post
gpuowl will automatically expand the task into a P-1 and a PRP with the "wantsPm1" set to 0.
If P-1 finds a factor are both the P-1 and PRP lines deleted?
Prime95 is offline   Reply With Quote
Old 2020-01-10, 01:42   #1742
wfgarnett3
 
wfgarnett3's Avatar
 
"William Garnett III"
Oct 2002
Bensalem, PA

22×3×7 Posts
Default

Quote:
Originally Posted by kriesel View Post
An unfortunate feature of prime95 is that without hyperthreading enabled on the host, occupying one cpu core with something else can cost an entire prime95 worker's output, however many cores that is. A little cpu usage by gpuowl even with -yield is normal. Some cpu cycles are used for save checkpoints to disk, screen output, doing the GEC, etc. But if -yield is in the config.txt or the command line, it should be reduced from the full cpu core or hyperthread that occurs without that option. How much were you seeing without -yield, and how much with?
Hyperthreading is enabled on my 2 core i3-4150.

-yield does help somewhat.

Here are my tests.

Screenshot 1 is Prime95 by itself PRP testing 90519811 with a 14.3 iteration time and 53% CPU usage.
Screenshot 2 is gpuOwL by itself without -yield PRP testing 81943843 with a 17.7 iteration time and 27% CPU usage.
Screenshot 3 is gpuOwL by itself with -yield with a 17.8 iteration time and 31% CPU usage.
Screenshot 4 is both Prime95 and gpuOwL (without -yield) showing Prime95 has a 19.1 iteration time and 81% CPU usage (thus gpuOwL slowed Prime95 down from 14.3 to 19.1).
Screenshot 5 is both Prime95 and gpuOwL (with -yield) showing Prime95 now has a 17.7 iteration time and 77% CPU usage so the -yield option helped some.

Thanks.
Attached Thumbnails
Click image for larger version

Name:	screenshot1.png
Views:	15
Size:	139.1 KB
ID:	21584   Click image for larger version

Name:	screenshot2.png
Views:	13
Size:	123.2 KB
ID:	21585   Click image for larger version

Name:	screenshot3.png
Views:	9
Size:	126.8 KB
ID:	21586   Click image for larger version

Name:	screenshot4.png
Views:	12
Size:	145.4 KB
ID:	21587   Click image for larger version

Name:	screenshot5.png
Views:	12
Size:	157.5 KB
ID:	21588  


Last fiddled with by wfgarnett3 on 2020-01-10 at 01:43
wfgarnett3 is offline   Reply With Quote
Old 2020-01-10, 04:15   #1743
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

73468 Posts
Default

Quote:
Originally Posted by wfgarnett3 View Post
Hyperthreading is enabled on my 2 core i3-4150.

-yield does help somewhat.

Here are my tests.

Screenshot 1 is Prime95 by itself PRP testing 90519811 with a 14.3 iteration time and 53% CPU usage.
Screenshot 2 is gpuOwL by itself without -yield PRP testing 81943843 with a 17.7 iteration time and 27% CPU usage.
Screenshot 3 is gpuOwL by itself with -yield with a 17.8 iteration time and 31% CPU usage.
Screenshot 4 is both Prime95 and gpuOwL (without -yield) showing Prime95 has a 19.1 iteration time and 81% CPU usage (thus gpuOwL slowed Prime95 down from 14.3 to 19.1).
Screenshot 5 is both Prime95 and gpuOwL (with -yield) showing Prime95 now has a 17.7 iteration time and 77% CPU usage so the -yield option helped some.

Thanks.
See https://www.mersenneforum.org/showpo...postcount=1403 for how much difference -yield made in my case; the difference between one core saturated, and 2% of a core. What else is using cpu during your gpuowl-only runs? Windows Task Manager, Processes tab, sort by cpu % usage. On my systems, the ratio of accumulated cpu time prime95 to gpuowl-win are >20:1.
kriesel is offline   Reply With Quote
Old 2020-01-10, 08:13   #1744
preda
 
preda's Avatar
 
"Mihai Preda"
Apr 2015

17578 Posts
Default

Quote:
Originally Posted by Prime95 View Post
If P-1 finds a factor are both the P-1 and PRP lines deleted?
No, not yet (an oversight on my part). What should I do with the AID and the assignment relative to primeNet?

- should I put the AID (of the PRP) on the P-1 factor-found result?
- if I simply drop the PRP assignment from worktodo.txt on P-1 factor found, it would still be assigned on the server even if the factor is reported?
preda is offline   Reply With Quote
Old 2020-01-10, 08:48   #1745
wfgarnett3
 
wfgarnett3's Avatar
 
"William Garnett III"
Oct 2002
Bensalem, PA

22·3·7 Posts
Default

Quote:
Originally Posted by kriesel View Post
See https://www.mersenneforum.org/showpo...postcount=1403 for how much difference -yield made in my case; the difference between one core saturated, and 2% of a core. What else is using cpu during your gpuowl-only runs? Windows Task Manager, Processes tab, sort by cpu % usage. On my systems, the ratio of accumulated cpu time prime95 to gpuowl-win are >20:1.
See attached screenshots -- gpuOwL-only runs are basically using all of the 27% CPU usage
Attached Thumbnails
Click image for larger version

Name:	screenshot6.png
Views:	16
Size:	130.2 KB
ID:	21589   Click image for larger version

Name:	screenshot7.png
Views:	15
Size:	126.9 KB
ID:	21590  

Last fiddled with by wfgarnett3 on 2020-01-10 at 08:52
wfgarnett3 is offline   Reply With Quote
Old 2020-01-10, 12:26   #1746
preda
 
preda's Avatar
 
"Mihai Preda"
Apr 2015

11111011112 Posts
Default Abort PRP test on P-1 factor found

I added some untested code that is supposed to:

1. when a P-1 factor is found, all PRP entries from worktodo.txt for the same exponent are removed. No result is written (to results.txt) for these deleted tasks.
2. when a P-1 factor is found in the background (GCD) while a PRP test for the same exponent is ongoing, the PRP test is aborted early and the point 1. above is applied.

I think this solution [in addition to bugs] has the problem of leaving PRP assignments "hanging" on primenet. Maybe the server could implement auto-release of the PRP assignments of a user when that user submits a factor for the same exponent (because, after a factor found, it does not make sense for the user that found the factor to pursue the PRP tests)

Quote:
Originally Posted by preda View Post
No, not yet (an oversight on my part). What should I do with the AID and the assignment relative to primeNet?

- should I put the AID (of the PRP) on the P-1 factor-found result?
- if I simply drop the PRP assignment from worktodo.txt on P-1 factor found, it would still be assigned on the server even if the factor is reported?
preda is offline   Reply With Quote
Old 2020-01-10, 12:36   #1747
preda
 
preda's Avatar
 
"Mihai Preda"
Apr 2015

19×53 Posts
Default

Quote:
Originally Posted by kriesel View Post
Make gpuown-win again generated the usual shower of warnings; see build-log.txt attached.
Ken, I'm aware of your complaint agains those warnings, and I did look into them. IMO those warnings are invalid, a compiler problem. They could be silenced with some effort, but again IMO that effort is not worth expending because the [invalid] warnings are an incovenience only for the person building the program (Ken) but not for the users.
preda is offline   Reply With Quote
Old 2020-01-10, 14:00   #1748
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

381410 Posts
Default -yield effect varies; Windows version?

Quote:
Originally Posted by wfgarnett3 View Post
See attached screenshots -- gpuOwL-only runs are basically using all of the 27% CPU usage
I don't see any of the recent performance-enhancing -USE options on your runs. See https://www.mersenneforum.org/showpo...postcount=1654 for tuning data on a GTX1050Ti
-CARRY32 may also help.
A couple of my systems' cpu overhead for gpuowl-win are shown in the screen captures. Roa (Windows 10) is running just over a full HT "core" of overhead (of total 24 cores plus HT, one "core" = 1/48 =2.08%) while condorella (Windows 7) is running a tiny fraction of that. I don't know why there's such a difference.

wfgarnett3's 27% utilization is also ~one HT "core" on a dual-core HT Windows 10 system IIRC.

Does -yield work on Window 7 and not on Windows 10? Any Windows 8.x users out there?
Attached Thumbnails
Click image for larger version

Name:	roa-gpuowl-cpu-overhead.png
Views:	6
Size:	117.1 KB
ID:	21592   Click image for larger version

Name:	condorella-gpuowl-cpu-overhead.png
Views:	7
Size:	95.1 KB
ID:	21593  
kriesel is offline   Reply With Quote
Old 2020-01-10, 14:07   #1749
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

2×1,907 Posts
Default

Quote:
Originally Posted by preda View Post
- should I put the AID (of the PRP) on the P-1 factor-found result?
It might be the easiest way to tell the PrimeNet server what to clean up.
It's sort of analogous to the case of a multibitlevel factoring assignment returning a factor in an early bit level; no point in soldiering on needlessly once the exponent has a factor discovered and reported.
kriesel is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
mfakto: an OpenCL program for Mersenne prefactoring Bdot GPU Computing 1615 2020-05-16 23:55
GPUOWL AMD Windows OpenCL issues xx005fs GPU Computing 0 2019-07-26 21:37
Testing an expression for primality 1260 Software 17 2015-08-28 01:35
Testing Mersenne cofactors for primality? CRGreathouse Computer Science & Computational Number Theory 18 2013-06-08 19:12
Primality-testing program with multiple types of moduli (PFGW-related) Unregistered Information & Answers 4 2006-10-04 22:38

All times are UTC. The time now is 21:59.

Thu May 28 21:59:23 UTC 2020 up 64 days, 19:32, 0 users, load averages: 1.24, 1.65, 1.66

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2020, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.