mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing > GpuOwl

Reply
Thread Tools
Old 2019-03-15, 11:13   #1013
SELROC
 

22·3·5·23 Posts
Default

Quote:
Originally Posted by kriesel View Post
I tested v4.3 a total of one little 11 minute m1257787 run to see if my edit left it still functional. Why would you object to that?
Worked through a backlog of builds, giving the newest at the time (V5.0-f604bb1) priority.
Execution speed seems to me to have been declining since v3.8, and V5.0 apparently continues that trend even when no P-1 is done. https://www.mersenneforum.org/showpo...&postcount=830

I'm still running V3.8 for production because of its speed advantage. I think I'm not alone in that.
I may run V5 for exponents needing P-1, after it's shown sufficiently reliable, if the run time of PRP-1 in V5 is shown to be less than the run time of P-1 in CUDAPm1 plus PRP in V3.8 separately on comparable gpus. My recent benchmarking indicates V5 is often 4-5% slower (PRP only, not PRP-1) than V3.8. That difference is longer than an entire typical P-1, which is 2-2.5% of a CUDALucas LL test, in the 100M to 500M range, unless PRP was ~ twice as fast as LL.

Case in point: 87m P-1 on GTX1060, ~5 hours, PRP in V3.8, 3d 22 hours; so combined time is 4d 3h. Compare to 4d 12h for gpuowl V5, 9 hours slower.
Again: 171m P-1 on GTX1060, ~20 hours, PRP in V3.8, 14d 21h, combined 15d 17 h; Estimated V5.0 PRP time 15d 16h. If PRP-1 is within ~1 hour of PRP, V5 wins.
Quote:
Originally Posted by SELROC View Post
The fastest version was 3.5, performance regression after that, and little performance recovery in 4.6.



To my great surprise the gpuowl performance regression on amdgpu-pro is now gone with amdgpu-pro version 18.50


I have the same performance on amdgpu-pro 18.50 and ROCm 2.2


I am using the latest gpuowl version aka GitHub master branch.
  Reply With Quote
Old 2019-03-26, 18:31   #1014
M344587487
 
M344587487's Avatar
 
"Composite as Heck"
Oct 2017

827 Posts
Default

Will the below DC definitely DC and not do a first time test of the wrong type? gpuowl did that once before so I avoided DC but now I want to verify that this card is producing correct results:

PRP=N/A,1,2,79335979,-1,75,0,3,1
https://www.mersenne.org/report_expo...xp_lo=79335979
gpuowl 6.2-3a95f98-mod, a recent version.
M344587487 is offline   Reply With Quote
Old 2019-03-26, 19:28   #1015
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

19·397 Posts
Default

That looks like the correct worktodo.txt entry to me.
Prime95 is offline   Reply With Quote
Old 2019-03-26, 19:47   #1016
ewmayer
2ω=0
 
ewmayer's Avatar
 
Sep 2002
República de California

19·613 Posts
Default

Quote:
Originally Posted by kriesel View Post
Mihai is using a nomenclature of height, width, and middle for the components of the individual fft transforms in gpuowl, and up to 3 components. See https://www.mersenneforum.org/showpo...&postcount=956

Mlucas does differently, with up to 5, if I recall correctly, and possibly more to come. Here's an excerpt from an mlucas.cfg file, with 5, and it appears room to go up to 10:
Code:
radices =  36 16 16 32 32  0  0  0  0  0
for an 18M fft length. (36 x 16 x 16 x 32 x 32, x 2 for real plus imaginary I guess, = 18432K)
Correct re. Mlucas - those entries are complex-FFT radices. You may see 6 or even 7 for some FFT lengths of current-GIMPS-wavefront interest, depending on what works best on your hardware, as determined by the standard post-install self-tests, but 4-5 radices is more common.

Last fiddled with by ewmayer on 2019-03-26 at 19:49
ewmayer is offline   Reply With Quote
Old 2019-03-27, 01:31   #1017
M344587487
 
M344587487's Avatar
 
"Composite as Heck"
Oct 2017

33B16 Posts
Default

M344587487 is offline   Reply With Quote
Old 2019-03-27, 06:39   #1018
SELROC
 

2·3·233 Posts
Default

  Reply With Quote
Old 2019-03-27, 06:46   #1019
preda
 
preda's Avatar
 
"Mihai Preda"
Apr 2015

3×457 Posts
Default

preda is offline   Reply With Quote
Old 2019-03-27, 08:43   #1020
M344587487
 
M344587487's Avatar
 
"Composite as Heck"
Oct 2017

827 Posts
Default preda if you're saying that the DC will erroneously do a type 4 blink twice

M344587487 is offline   Reply With Quote
Old 2019-03-27, 10:23   #1021
LaurV
Romulan Interpreter
 
LaurV's Avatar
 
Jun 2011
Thailand

25BF16 Posts
Default haha, good one!

LaurV is offline   Reply With Quote
Old 2019-03-27, 12:44   #1022
SELROC
 

47·79 Posts
Default benchmarking

  Reply With Quote
Old 2019-03-27, 16:49   #1023
SELROC
 

114816 Posts
Default last attempt: benchmarking gpus is subject to temperature. when benchmarking please note gpu temp.

test advanced editor.


OK

Last fiddled with by SELROC on 2019-03-27 at 16:50
  Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
mfakto: an OpenCL program for Mersenne prefactoring Bdot GPU Computing 1676 2021-06-30 21:23
GPUOWL AMD Windows OpenCL issues xx005fs GpuOwl 0 2019-07-26 21:37
Testing an expression for primality 1260 Software 17 2015-08-28 01:35
Testing Mersenne cofactors for primality? CRGreathouse Computer Science & Computational Number Theory 18 2013-06-08 19:12
Primality-testing program with multiple types of moduli (PFGW-related) Unregistered Information & Answers 4 2006-10-04 22:38

All times are UTC. The time now is 06:56.


Fri Aug 6 06:56:35 UTC 2021 up 14 days, 1:25, 1 user, load averages: 2.68, 2.65, 2.70

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.