mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing > GpuOwl

Reply
 
Thread Tools
Old 2017-08-30, 06:16   #188
axn
 
axn's Avatar
 
Jun 2003

2·3·7·112 Posts
Default

Quote:
Originally Posted by preda View Post
- all LL error protections have been removed: loop detection, Jacobi-check, rounding-error, etc. (as the new check is stronger).
Hopefully you've put in a "zero check". Without it, the whole thing becomes worthless.

EDIT:- Also, rounding error still needs to be detected, as it could mean you need to go to higher FFT

Last fiddled with by axn on 2017-08-30 at 06:19
axn is offline   Reply With Quote
Old 2017-08-30, 08:01   #189
preda
 
preda's Avatar
 
"Mihai Preda"
Apr 2015

3×457 Posts
Default

Quote:
Originally Posted by axn View Post
Hopefully you've put in a "zero check". Without it, the whole thing becomes worthless.

EDIT:- Also, rounding error still needs to be detected, as it could mean you need to go to higher FFT
Yes, zero check is in (as required by the new check). Rounding errors should be caught by the normal check though, why should there be dedicated rounding detection?

It's true that rounding suggests strongly "highter FFT", but that same idea can arise from a simple bits/word computation as well.
preda is offline   Reply With Quote
Old 2017-09-05, 01:46   #190
kracker
 
kracker's Avatar
 
"Mr. Meeseeks"
Jan 2012
California, USA

23·271 Posts
Default

Windows binaries from latest commit(55d094a)... not tested, sorry!
Attached Files
File Type: zip gpuowl-v1.0-55d094a.zip (129.7 KB, 102 views)
kracker is offline   Reply With Quote
Old 2017-09-05, 03:15   #191
preda
 
preda's Avatar
 
"Mihai Preda"
Apr 2015

3·457 Posts
Default

Quote:
Originally Posted by kracker View Post
Windows binaries from latest commit(55d094a)... not tested, sorry!
Thanks Kracker!

A summary of the changes in 1.1:
- savefile name change, for exponent NNNN they're now called:
NNNN.ll
NNNN-prev.ll
NNNN-temp.ll
NNNN.<iteration>.ll

i.e. all the savefile for some exponent start with that exponent.
To see details about a savefile just print the last line of the file, like this:
tail -n1 file.ll
(the last line is human-readable text, all before that is binary).

- savefile format change. See the savefile signature that is now changed to "LL4".
It is possible to bring over an LL3 savefile to LL4 format, if it is edited with care by appending a "0" on the last line, and updating the signature to LL4. (this "0" that was added is the number of error rollbacks).

And not news, it does PRP-3. It writes JSON-formatted result to results.txt . Soon it will be possible to submit this result format.

Last fiddled with by preda on 2017-09-05 at 03:17
preda is offline   Reply With Quote
Old 2017-09-11, 00:06   #192
preda
 
preda's Avatar
 
"Mihai Preda"
Apr 2015

55B16 Posts
Default

Quote:
Originally Posted by airsquirrels View Post
Updated timings from the Windows driver (Which I'm told is still using the previous closed compiler, while Linux has transitioned to LLVM/rocm - apparently regrettably)

RX64 Air:
ms/iter: 1.639

This is with stock clocks. If I use the "stable" 1000Mhz memory clocks that pass self test I get 1.600

Both kernels work as expected in Windows, so likely an llvm regression.
The recent head version now works correctly on VEGA on Linux with amdgpu-pro 17.30, in both modes (-legacy or not).

It appears the "workaround" that fixed the behavior was the removal of the "max-error" computation in the amalgamation kernel (which kernel is only used in non-legacy mode).

Also other small improvements, bring the speed to 1.54 ms/it on Vega air standard (but with quite some heat generated).
preda is offline   Reply With Quote
Old 2017-09-11, 18:52   #193
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

124558 Posts
Default

Quote:
Originally Posted by preda View Post
Thanks Kracker!

A summary of the changes in 1.1:
- savefile name change, for exponent NNNN they're now called:
NNNN.ll
NNNN-prev.ll
NNNN-temp.ll
NNNN.<iteration>.ll

i.e. all the savefile for some exponent start with that exponent.
To see details about a savefile just print the last line of the file, like this:
tail -n1 file.ll
(the last line is human-readable text, all before that is binary).

- savefile format change. See the savefile signature that is now changed to "LL4".
It is possible to bring over an LL3 savefile to LL4 format, if it is edited with care by appending a "0" on the last line, and updating the signature to LL4. (this "0" that was added is the number of error rollbacks).

And not news, it does PRP-3. It writes JSON-formatted result to results.txt . Soon it will be possible to submit this result format.
Why file extension .ll if you're doing a PRP-3 computation not Lucas-Lehmer?
kriesel is online now   Reply With Quote
Old 2017-09-11, 22:07   #194
preda
 
preda's Avatar
 
"Mihai Preda"
Apr 2015

137110 Posts
Default

Quote:
Originally Posted by kriesel View Post
Why file extension .ll if you're doing a PRP-3 computation not Lucas-Lehmer?
Legacy? -- it simply remained unchanged from LL times. I can change it if desired.
preda is offline   Reply With Quote
Old 2017-09-12, 00:30   #195
GP2
 
GP2's Avatar
 
Sep 2003

A1916 Posts
Default

Quote:
Originally Posted by preda View Post
Legacy? -- it simply remained unchanged from LL times. I can change it if desired.
It does create confusion.
GP2 is offline   Reply With Quote
Old 2017-09-14, 04:18   #196
preda
 
preda's Avatar
 
"Mihai Preda"
Apr 2015

55B16 Posts
Default

Quote:
Originally Posted by GP2 View Post
It does create confusion.
OK. I plan to change the file extension to "owl".
preda is offline   Reply With Quote
Old 2017-09-20, 21:04   #197
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

1D7116 Posts
Default

I'm trying to make sure gpuOwl interim residues match prime95 interim residues. As the person that implemented completely non-standard iteration numbers (off by 2) for LL testing, let me check on the standard for PRP tests.

I view a squaring and optional mul-by-3 as one iteration. Thus, when PRPing a Mersenne number, I think the interim residue for iteration 1 is (3^2)*3 = 27. There are N-1 iterations to PRP 2^N-1.
Prime95 is offline   Reply With Quote
Old 2017-09-20, 23:35   #198
preda
 
preda's Avatar
 
"Mihai Preda"
Apr 2015

25338 Posts
Default

Quote:
Originally Posted by Prime95 View Post
I'm trying to make sure gpuOwl interim residues match prime95 interim residues. As the person that implemented completely non-standard iteration numbers (off by 2) for LL testing, let me check on the standard for PRP tests.

I view a squaring and optional mul-by-3 as one iteration. Thus, when PRPing a Mersenne number, I think the interim residue for iteration 1 is (3^2)*3 = 27. There are N-1 iterations to PRP 2^N-1.
A brief description of my implem is here:
http://www.mersenneforum.org/showthr...655#post466655

In the residue computation, there is no mul-by-3 at any point.

The mul-by-3 is only involved in the verification, thus does not affect the residue.

My values are:
iteration 0: residue is 3.
iteration 1: residue is 3^2.
iteration 2: residue is 3^4.

For M = 2^p - 1,
the final residue is 3^(2^(p-1)). This final residue is -3 for a PRP.
Note, computing 3^(2^(p-1)) does not require any mul-by-3, only squarings.


See also http://www.mersenneforum.org/showpos...&postcount=138
where I propose adding 3 to the final residue to make the final value for a PRP == 0, an idea that I abandoned in the end (adding the 3) [because it would make the final residue non-uniform with the res at other positions].

Last fiddled with by preda on 2017-09-20 at 23:52 Reason: clarification, add reference
preda is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
mfakto: an OpenCL program for Mersenne prefactoring Bdot GPU Computing 1676 2021-06-30 21:23
GPUOWL AMD Windows OpenCL issues xx005fs GpuOwl 0 2019-07-26 21:37
Testing an expression for primality 1260 Software 17 2015-08-28 01:35
Testing Mersenne cofactors for primality? CRGreathouse Computer Science & Computational Number Theory 18 2013-06-08 19:12
Primality-testing program with multiple types of moduli (PFGW-related) Unregistered Information & Answers 4 2006-10-04 22:38

All times are UTC. The time now is 16:57.


Mon Aug 2 16:57:27 UTC 2021 up 10 days, 11:26, 0 users, load averages: 2.17, 2.31, 2.21

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.