mersenneforum.org

mersenneforum.org (https://www.mersenneforum.org/index.php)
-   GpuOwl (https://www.mersenneforum.org/forumdisplay.php?f=171)
-   -   gpuOwL: an OpenCL program for Mersenne primality testing (https://www.mersenneforum.org/showthread.php?t=22204)

SELROC 2019-04-03 05:50

[QUOTE=GP2;512503]You can find some PRPs that need DC from the following users:

Warning: not all are type 4. There's no way to filter by residue type, although you can click on the "Residue Type" column header to sort by it.

Also, I think gpuOwL only produces residues with shift count zero, so the double check will also have shift count zero, and some might insist that it's not a proper double check unless it's with a different shift count.
[LIST][*][URL="https://www.mersenne.org/report_prp/?exp_lo=82000000&exp_hi=999999999&exp_date=&end_date=&user_only=1&user_id=Mihai+Preda&exdchk=1&dispdate=1&B1="]Mihai Preda[/URL][*][URL="https://www.mersenne.org/report_prp/?exp_lo=82000000&exp_hi=999999999&exp_date=&end_date=&user_only=1&user_id=Kriesel&exdchk=1&dispdate=1&exbad=1&exfactor=1&B1="]Kriesel[/URL][*][URL="https://www.mersenne.org/report_prp/?exp_lo=82000000&exp_hi=999999999&exp_date=&end_date=&user_only=1&user_id=kwe5ykdf&exdchk=1&dispdate=1&exbad=1&exfactor=1&B1="]kwe5ykdf[/URL][*][URL="https://www.mersenne.org/report_prp/?exp_lo=82000000&exp_hi=999999999&exp_date=&end_date=&user_only=1&user_id=tServo&exdchk=1&dispdate=1&exbad=1&exfactor=1&B1="]tServo[/URL][*][URL="https://www.mersenne.org/report_prp/?exp_lo=82000000&exp_hi=999999999&exp_date=&end_date=&user_only=1&user_id=Franklin+Webber&exdchk=1&dispdate=1&exbad=1&exfactor=1&B1="]Franklin Webber[/URL][*][URL="https://www.mersenne.org/report_prp/?exp_lo=82000000&exp_hi=999999999&exp_date=&end_date=&user_only=1&user_id=Xebecer&exdchk=1&dispdate=1&exbad=1&exfactor=1&B1="]Xebecer[/URL][*][URL="https://www.mersenne.org/report_prp/?exp_lo=82000000&exp_hi=999999999&exp_date=&end_date=&user_only=1&user_id=xx005fs&exdchk=1&dispdate=1&exbad=1&exfactor=1&B1="]xx005fs[/URL][*][URL="https://www.mersenne.org/report_prp/?exp_lo=82000000&exp_hi=999999999&exp_date=&end_date=&user_only=1&user_id=SEL-ROC&exdchk=1&dispdate=1&exbad=1&exfactor=1&B1="]SEL-ROC[/URL][*][URL="https://www.mersenne.org/report_prp/?exp_lo=82000000&exp_hi=999999999&exp_date=&end_date=&user_only=1&user_id=kracker&exdchk=1&dispdate=1&exbad=1&exfactor=1&B1="]kracker[/URL][/LIST][/QUOTE]


At this point gpuOwl is good for discovering new primes, the double-check should be done with a different program, maybe mprime with fast cpu and memory.

LaurV 2019-04-04 17:12

[offtopic]
hehe, we love the new thread title, some supermods are really creative :razz:

[/offtopic]

SELROC 2019-04-05 09:57

[QUOTE=SELROC;512276]It was with the wrong set of system software, not enough up to date, gpuowl started but after a while the residue was zeroed out, and a page fault occurred.


After installing the latest kernel, modules and headers, v. 5.0, then gpuowl works without problems and the zero-residue error is gone.[/QUOTE]

[QUOTE=SELROC;512290]The same error has occurred again, zero-residue, but this time gpuowl reloaded the last checkpoint and continued.


I found that I was running a version of openowl compiled against old libs. Now I have recompiled and waiting to see if the error occurs again.[/QUOTE]

[QUOTE=SELROC;512361]After some hour no error. Will see today the continuation.[/QUOTE]




The error occurred again today, after X days of work. Gpuowl has reloaded last checkpoint and is continuing.

preda 2019-04-05 11:57

[QUOTE=SELROC;512744]The error occurred again today, after X days of work. Gpuowl has reloaded last checkpoint and is continuing.[/QUOTE]

On what hardware -- is the hardware setup verified reliable?

What do you suspect as the reason for errors? (GPUowl, driver, hardware, something else)

SELROC 2019-04-05 12:27

[QUOTE=preda;512747]On what hardware -- is the hardware setup verified reliable?

What do you suspect as the reason for errors? (GPUowl, driver, hardware, something else)[/QUOTE]


Gigabyte mainboard

radeon VII
Ubuntu
ROCm 2.2

kernel 5.0.5


the previous time I hit a page fault but not this time (after recompiling gpuowl)

preda 2019-04-05 12:29

[QUOTE=SELROC;512750]Gigabyte mainboard

radeon VII
Ubuntu
ROCm 2.2

kernel 5.0.5


the previous time I hit a page fault but not this time (after recompiling gpuowl)[/QUOTE]

Congrats on the R7! Did you undervolt/overclock? (if yes, that may explain the errors from time to time)

SELROC 2019-04-05 12:36

[QUOTE=preda;512752]Congrats on the R7! Did you undervolt/overclock? (if yes, that may explain the errors from time to time)[/QUOTE]


PS: the setup has been validated against known prime.


the gpu is in automatic mode.

preda 2019-04-05 12:38

[QUOTE=SELROC;512755]PS: the setup has been validated against known prime.

the gpu is in automatic mode.[/QUOTE]

So you did not undervolt/overclock? (when you see the errors you report)

SELROC 2019-04-05 12:55

[QUOTE=preda;512756]So you did not undervolt/overclock? (when you see the errors you report)[/QUOTE]


No.


the voltage is ~1.04 V


SCLK 1802 MHz
MCLK 1001 MHz

SELROC 2019-04-06 07:45

[QUOTE=preda;512747]On what hardware -- is the hardware setup verified reliable?

What do you suspect as the reason for errors? (GPUowl, driver, hardware, something else)[/QUOTE]


No no, gpuowl on his part has done a very good job of reloading the checkpoint and continuing.


The error manifests with a zero residue, so it may be that gpuowl is sometimes reading from the wrong location, and this may correspond to a page fault.

SELROC 2019-04-07 07:12

[QUOTE=preda;512747]On what hardware -- is the hardware setup verified reliable?

What do you suspect as the reason for errors? (GPUowl, driver, hardware, something else)[/QUOTE]

[QUOTE=SELROC;512841]No no, gpuowl on his part has done a very good job of reloading the checkpoint and continuing.


The error manifests with a zero residue, so it may be that gpuowl is sometimes reading from the wrong location, and this may correspond to a page fault.[/QUOTE]




It may be my negligence, I was running kernel 5.0.5 but with an outdated initramfs. After updating initramfs the error is not occurring.


All times are UTC. The time now is 23:13.

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.