![]() |
|
|
#1376 | |
|
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
5,437 Posts |
Quote:
If there was a way for the user to look at manual results with or without error, DC mismatch etc., divided by the various computer/gpuinstance identifiers that would be great. |
|
|
|
|
|
|
#1377 | |
|
"Mihai Preda"
Apr 2015
137110 Posts |
Quote:
The worktodo.bak left behind is sort of intended -- for the situation that GpuOwl makes a mess out of worktodo.txt for some reason, the user can recover. |
|
|
|
|
|
|
#1378 | |
|
"Mihai Preda"
Apr 2015
137110 Posts |
Quote:
|
|
|
|
|
|
|
#1379 |
|
"mrh"
Oct 2018
Temecula, ca
22×3×5 Posts |
Does anyone know if P-1 works using nvidia? PRP seems to work fine for me, but when trying -pm1 it looks like stage1 makes it to about 100% and then the host (not gpu) tries to allocate 70GB or so of memory and gets kill by oom_reaper. I can dig deeper, but maybe someone already knows what is wrong?
|
|
|
|
|
|
#1380 | |
|
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
5,437 Posts |
Quote:
See the previous few pages of this thread and particularly https://www.mersenneforum.org/showpo...postcount=1360 https://www.mersenneforum.org/showpo...postcount=1358 Perhaps also some of https://www.mersenneforum.org/showpo...22&postcount=1, including the new Gpuowl P-1 run time scaling on AMD and NVIDIA https://www.mersenneforum.org/showpo...5&postcount=17 TF has dainty memory requirements, LL or PRP significant, and P-1 the biggest. Last fiddled with by kriesel on 2019-09-17 at 01:04 |
|
|
|
|
|
|
#1381 | |
|
"mrh"
Oct 2018
Temecula, ca
22×3×5 Posts |
Quote:
|
|
|
|
|
|
|
#1382 | |
|
"mrh"
Oct 2018
Temecula, ca
22·3·5 Posts |
Quote:
|
|
|
|
|
|
|
#1383 |
|
"Mihai Preda"
Apr 2015
3×457 Posts |
In the most recent commit I added savefile support for P-1 *first-stage only*. There will be files created, with the extension ".p1.owl". A save is done every 5min and on manual exit (ctrl-C). There should be messages in the log indicating the save being done and the iteration.
The B1 is saved in the savefile. If the actual B1 does not match the saved B1 the savefile will not be loaded, but also should not be overwritten -- instead the program will indicate the mismatch and exit. |
|
|
|
|
|
#1384 |
|
"Mihai Preda"
Apr 2015
3·457 Posts |
Finally added Ken's pet feature, savefiles for P-1, both stages. Here's a bit how they're implemented:
For exponent N, there are: stage1 savefile(s): <workdir>/N/N.p1.owl stage2 savefile(s): <workdir>/N/N.p2.owl stage1 is saved periodically (every 5minutes), on Ctrl-C, and at the one-to-last iteration of stage1. stage2 is only saved when a "round" is completed (log lines like "86316851 P2 774/2880" indicate rounds being completed). Notably, stage2 is not saved on Ctrl-C. Feedback welcome; there may be bugs. Please test a few known-factors exponents and check that the factors are found; especially across reloads in stage2. |
|
|
|
|
|
#1385 | |
|
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
5,437 Posts |
Quote:
Built no problem in msys2/mingw64, ran ok on Win7-x64 Pro RX480. M24000577 P-1 stage 1 save at 5 minute intervals confirmed; stage 2 save at rounds confirmed; stage 2 gcd factor found confirmed; rerun with existing save files begins at very late stage 1, redoes stage 2. Presumably this could be used to fairly efficiently run a second stage 2 with larger bound (might require deleting the earlier stage 2 files, or code modification to run b2-old to b2-new-larger without duplicating B1 to B2-old.) M1257787 PRP test went correctly, no file or folder test or rename etc issues. M95m P-1 run in progress, stage one save on Ctrl-C confirmed P-1 save and resume is useful as larger exponents increase total P-1 run time to days or weeks each. See https://www.mersenneforum.org/showpo...5&postcount=17 Last fiddled with by kriesel on 2019-09-19 at 18:27 |
|
|
|
|
|
|
#1386 |
|
"Eric"
Jan 2018
USA
22·53 Posts |
It looks like when using an Nvidia GPU to do any work with GPUOWL maxes out 1 of my CPU cores, which does have a decent impact on my CPU crunching performance and waste unnecessary heat and power. I have found some links that supposedly fixes the issue (only for CUDA), but I don't know if it is going to be able to work with OpenCL apps like GPUOWL. If one of the CPU core can be freed then that would waste less compute cycles overall.
Here's one of the link anyways. https://devtalk.nvidia.com/default/t...ns-its-kernel/ |
|
|
|
![]() |
| Thread Tools | |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| mfakto: an OpenCL program for Mersenne prefactoring | Bdot | GPU Computing | 1676 | 2021-06-30 21:23 |
| GPUOWL AMD Windows OpenCL issues | xx005fs | GpuOwl | 0 | 2019-07-26 21:37 |
| Testing an expression for primality | 1260 | Software | 17 | 2015-08-28 01:35 |
| Testing Mersenne cofactors for primality? | CRGreathouse | Computer Science & Computational Number Theory | 18 | 2013-06-08 19:12 |
| Primality-testing program with multiple types of moduli (PFGW-related) | Unregistered | Information & Answers | 4 | 2006-10-04 22:38 |