mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing > GpuOwl

Reply
 
Thread Tools
Old 2019-09-14, 03:37   #1365
LaurV
Romulan Interpreter
 
LaurV's Avatar
 
Jun 2011
Thailand

72×197 Posts
Default

Quote:
Originally Posted by Prime95 View Post
I'd like a volunteer with a non-Radeon VII to test a gpuowl version for me.
Do Nvidia qualifies, or must be AMD card?
If it does, tell me what to do.
LaurV is offline   Reply With Quote
Old 2019-09-14, 03:42   #1366
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

1D6F16 Posts
Default

Quote:
Originally Posted by LaurV View Post
Do Nvidia qualifies, or must be AMD card?
If it does, tell me what to do.
I don't know if I forked from an nVidia-capable gpuowl version. Best would probably be an AMD gpu. If no volunteers appear, we'll revisit your kind offer.
Prime95 is online now   Reply With Quote
Old 2019-09-14, 17:17   #1367
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

5,419 Posts
Default

Quote:
Originally Posted by Prime95 View Post
I don't know if I forked from an nVidia-capable gpuowl version. Best would probably be an AMD gpu. If no volunteers appear, we'll revisit your kind offer.
Judging by file dates and help output, it looks to be a variant of v6.5. Is there a particular -use that your requested test requires? FYI it does compile and run on NVIDIA.

For the same 87M exponent, starting from zero, each separate folder:

Win7 Pro x64, RX480, GW variant:
Code:
>gpuowl-win -h
2019-09-14 11:10:12 gpuowl

Command line options:

-dir <folder>      : specify work directory (containing worktodo.txt, results.txt, config.txt, gpuowl.log)
-user <name>       : specify the user name.
-cpu  <name>       : specify the hardware name.
-time              : display kernel profiling information.
-fft <size>        : specify FFT size, such as: 5000K, 4M, +2, -1.
-block <value>     : PRP GEC block size. Default 1000. Smaller block is slower but detects errors sooner.
-log <step>        : log every <step> iterations, default 20000. Multiple of 10000.
-carry long|short  : force carry type. Short carry may be faster, but requires high bits/word.
-B1                : P-1 B1 bound, default 500000
-B2                : P-1 B2 bound, default B1 * 30
-rB2               : ratio of B2 to B1. Default 30, used only if B2 is not explicitly set
-prp <exponent>    : run a single PRP test and exit, ignoring worktodo.txt
-pm1 <exponent>    : run a single P-1 test and exit, ignoring worktodo.txt
-results <file>    : name of results file, default 'results.txt'
-iters <N>         : run next PRP test for <N> iterations and exit. Multiple of 10000.
-use NEW_FFT8,OLD_FFT5,NEW_FFT10: comma separated list of defines, see the #if tests in gpuowl.cl (used for perf tuning).
-device <N>        : select a specific device:
 0 : Ellesmere-36x1266-@28:0.0 Radeon (TM) RX 480 Graphics
 1 : gfx804-8x1203-@3:0.0 Radeon 550 Series

FFT Configurations:
FFT    8K [  0.01M -    0.18M]  64-64
FFT   32K [  0.05M -    0.68M]  64-256 256-64
FFT   64K [  0.10M -    1.34M]  64-512 512-64
FFT  128K [  0.20M -    2.63M]  1K-64 64-1K 256-256
FFT  192K [  0.29M -    3.91M]  64-256-6
FFT  224K [  0.34M -    4.54M]  64-256-7
FFT  256K [  0.39M -    5.18M]  64-2K 256-512 512-256 2K-64
FFT  288K [  0.44M -    5.81M]  64-256-9
FFT  320K [  0.49M -    6.44M]  64-256-10
FFT  352K [  0.54M -    7.06M]  64-256-11
FFT  384K [  0.59M -    7.69M]  64-256-12 64-512-6
FFT  448K [  0.69M -    8.94M]  64-512-7
FFT  512K [  0.79M -   10.18M]  1K-256 256-1K 512-512 4K-64
FFT  576K [  0.88M -   11.42M]  64-512-9
FFT  640K [  0.98M -   12.66M]  64-512-10
FFT  704K [  1.08M -   13.89M]  64-512-11
FFT  768K [  1.18M -   15.12M]  64-512-12 64-1K-6 256-256-6
FFT  896K [  1.38M -   17.57M]  64-1K-7 256-256-7
FFT    1M [  1.57M -   20.02M]  1K-512 256-2K 512-1K 2K-256
FFT 1152K [  1.77M -   22.45M]  64-1K-9 256-256-9
FFT 1280K [  1.97M -   24.88M]  64-1K-10 256-256-10
FFT 1408K [  2.16M -   27.31M]  64-1K-11 256-256-11
FFT 1536K [  2.36M -   29.72M]  64-1K-12 64-2K-6 256-256-12 256-512-6 512-256-6
FFT 1792K [  2.75M -   34.54M]  64-2K-7 256-512-7 512-256-7
FFT    2M [  3.15M -   39.34M]  1K-1K 512-2K 2K-512 4K-256
FFT 2304K [  3.54M -   44.13M]  64-2K-9 256-512-9 512-256-9
FFT 2560K [  3.93M -   48.90M]  64-2K-10 256-512-10 512-256-10
FFT 2816K [  4.33M -   53.66M]  64-2K-11 256-512-11 512-256-11
FFT    3M [  4.72M -   58.41M]  1K-256-6 64-2K-12 256-512-12 256-1K-6 512-256-12 512-512-6
FFT 3584K [  5.51M -   67.87M]  1K-256-7 256-1K-7 512-512-7
FFT    4M [  6.29M -   77.30M]  1K-2K 2K-1K 4K-512
FFT 4608K [  7.08M -   86.70M]  1K-256-9 256-1K-9 512-512-9
FFT    5M [  7.86M -   96.07M]  1K-256-10 256-1K-10 512-512-10
FFT 5632K [  8.65M -  105.41M]  1K-256-11 256-1K-11 512-512-11
FFT    6M [  9.44M -  114.74M]  1K-256-12 1K-512-6 256-1K-12 256-2K-6 512-512-12 512-1K-6 2K-256-6
FFT    7M [ 11.01M -  133.32M]  1K-512-7 256-2K-7 512-1K-7 2K-256-7
FFT    8M [ 12.58M -  151.83M]  2K-2K 4K-1K
FFT    9M [ 14.16M -  170.28M]  1K-512-9 256-2K-9 512-1K-9 2K-256-9
FFT   10M [ 15.73M -  188.68M]  1K-512-10 256-2K-10 512-1K-10 2K-256-10
FFT   11M [ 17.30M -  207.02M]  1K-512-11 256-2K-11 512-1K-11 2K-256-11
FFT   12M [ 18.87M -  225.32M]  1K-512-12 1K-1K-6 256-2K-12 512-1K-12 512-2K-6 2K-256-12 2K-512-6 4K-256-6
FFT   14M [ 22.02M -  261.80M]  1K-1K-7 512-2K-7 2K-512-7 4K-256-7
FFT   16M [ 25.17M -  298.13M]  4K-2K
FFT   18M [ 28.31M -  334.34M]  1K-1K-9 512-2K-9 2K-512-9 4K-256-9
FFT   20M [ 31.46M -  370.44M]  1K-1K-10 512-2K-10 2K-512-10 4K-256-10
FFT   22M [ 34.60M -  406.43M]  1K-1K-11 512-2K-11 2K-512-11 4K-256-11
FFT   24M [ 37.75M -  442.34M]  1K-1K-12 1K-2K-6 512-2K-12 2K-512-12 2K-1K-6 4K-256-12 4K-512-6
FFT   28M [ 44.04M -  513.91M]  1K-2K-7 2K-1K-7 4K-512-7
FFT   36M [ 56.62M -  656.22M]  1K-2K-9 2K-1K-9 4K-512-9
FFT   40M [ 62.91M -  727.03M]  1K-2K-10 2K-1K-10 4K-512-10
FFT   44M [ 69.21M -  797.64M]  1K-2K-11 2K-1K-11 4K-512-11
FFT   48M [ 75.50M -  868.07M]  1K-2K-12 2K-1K-12 2K-2K-6 4K-512-12 4K-1K-6
FFT   56M [ 88.08M - 1008.44M]  2K-2K-7 4K-1K-7
FFT   72M [113.25M - 1287.53M]  2K-2K-9 4K-1K-9
FFT   80M [125.83M - 1426.38M]  2K-2K-10 4K-1K-10
FFT   88M [138.41M - 1564.83M]  2K-2K-11 4K-1K-11
FFT   96M [150.99M - 1702.92M]  2K-2K-12 4K-1K-12 4K-2K-6
FFT  112M [176.16M - 1978.12M]  4K-2K-7
FFT  144M [226.49M - 2525.23M]  4K-2K-9
FFT  160M [251.66M - 2797.39M]  4K-2K-10
FFT  176M [276.82M - 3068.76M]  4K-2K-11
FFT  192M [301.99M - 3339.40M]  4K-2K-12
2019-09-14 11:10:17 Exiting because "help"
2019-09-14 11:10:17 Bye

C:\msys64\home\ken\gpuowl-compile\gw>gw

C:\msys64\home\ken\gpuowl-compile\gw>gpuowl-win -device 0
2019-09-14 11:19:48 gpuowl
2019-09-14 11:19:48 Note: no config.txt file found
2019-09-14 11:19:48 config: -device 0
2019-09-14 11:19:48 87005279 FFT 5120K: Width 256x4, Height 64x4, Middle 10; 16.59 bits/word
2019-09-14 11:19:48 using short carry kernels
2019-09-14 11:19:55 OpenCL args "-DEXP=87005279u -DWIDTH=1024u -DSMALL_HEIGHT=256u -DMIDDLE=10u -DWEIGHT_STEP=0xa.97d8cd06772f8p-3 -DIWEIGHT_STEP=0xc.1551b6b115
8dp-4 -DWEIGHT_BIGSTEP=0x9.837f0518db8a8p-3 -DIWEIGHT_BIGSTEP=0xd.744fccad69d68p-4  -I. -cl-fast-relaxed-math -cl-std=CL2.0"
2019-09-14 11:19:55 OpenCL compilation error -11 (args -DEXP=87005279u -DWIDTH=1024u -DSMALL_HEIGHT=256u -DMIDDLE=10u -DWEIGHT_STEP=0xa.97d8cd06772f8p-3 -DIWEIG
HT_STEP=0xc.1551b6b1158dp-4 -DWEIGHT_BIGSTEP=0x9.837f0518db8a8p-3 -DIWEIGHT_BIGSTEP=0xd.744fccad69d68p-4  -I. -cl-fast-relaxed-math -cl-std=CL2.0)
2019-09-14 11:19:55 C:\Users\ken\AppData\Local\Temp\\OCL9192T1.cl:197:3: error: implicit declaration of function '__asm' is invalid in C99
  X2(u[0], u[2]);
  ^
C:\Users\ken\AppData\Local\Temp\\OCL9192T1.cl:174:2: note: expanded from macro 'X2'
        __asm( "v_add_f64 %0, %1, -%2\n" : "=v" (b.x) : "v" (t.x), "v" (b.x)); \
        ^
C:\Users\ken\AppData\Local\Temp\\OCL9192T1.cl:197:3: error: expected ')'
C:\Users\ken\AppData\Local\Temp\\OCL9192T1.cl:174:35: note: expanded from macro 'X2'
        __asm( "v_add_f64 %0, %1, -%2\n" : "=v" (b.x) : "v" (t.x), "v" (b.x)); \
                                         ^
C:\Users\ken\AppData\Local\Temp\\OCL9192T1.cl:197:3: note: to match this '('
C:\Users\ken\AppData\Local\Temp\\OCL9192T1.cl:174:7: note: expanded from macro 'X2'
        __asm( "v_add_f64 %0, %1, -%2\n" : "=v" (b.x) : "v" (t.x), "v" (b.x)); \
             ^
C:\Users\ken\AppData\Local\Temp\\OCL9192T1.cl:197:3: error: expected ')'
  X2(u[0], u[2]);
  ^
C:\Users\ken\AppData\Local\Temp\\OCL9192T1.cl:175:35: note: expanded from macro 'X2'
        __asm( "v_add_f64 %0, %1, -%2\n" : "=v" (b.y) : "v" (t.y), "v" (b.y)); \
                                         ^
C:\Users\ken\AppData\Local\Temp\\OCL9192T1.cl:197:3: note: to match this '('
C:\Users\ken\AppData\Local\Temp\\OCL9192T1.cl:175:7: note: expanded from macro 'X2'
        __asm( "v_add_f64 %0, %1, -%2\n" : "=v" (b.y) : "v" (t.y), "v" (b.y)); \
             ^
C:\Users\ken\AppData\Local\Temp\\OCL9192T1.cl:198:3: error: expected ')'
  X2_mul_t4(u[1], u[3]);
  ^
C:\Users\ken\AppData\Local\Temp\\OCL9192T1.cl:180:35: note: expanded from macro 'X2_mul_t4'
        __asm( "v_add_f64 %0, %1, -%2\n" : "=v" (t.x) : "v" (b.x), "v" (t.x)); \
                                         ^
C:\Users\ken\AppData\Local\Temp\\OCL9192T1.cl:198:3: note: to match this '('
C:\Users\ken\AppData\Local\Temp\\OCL9192T1.cl:180:7: note: expanded from macro 'X2_mul_t4'
        __asm( "v_add_f64 %0, %1, -%2\n" : "=v" (t.x) : "v" (b.x), "v" (t.x)); \
             ^
C:\Users\ken\AppData\Local\Temp\\OCL9192T1.cl:1982019-09-14 11:19:55 Exception 9gpu_error: BUILD_PROGRAM_FAILURE clBuildProgram at clwrap.cpp:215 build
2019-09-14 11:19:55 Bye

C:\msys64\home\ken\gpuowl-compile\gw>gpuowl-win -device 0 -use ORIG_X2
2019-09-14 11:20:30 gpuowl
2019-09-14 11:20:30 Note: no config.txt file found
2019-09-14 11:20:30 config: -device 0 -use ORIG_X2
2019-09-14 11:20:30 87005279 FFT 5120K: Width 256x4, Height 64x4, Middle 10; 16.59 bits/word
2019-09-14 11:20:30 using short carry kernels
2019-09-14 11:20:35 OpenCL args "-DEXP=87005279u -DWIDTH=1024u -DSMALL_HEIGHT=256u -DMIDDLE=10u -DWEIGHT_STEP=0xa.97d8cd06772f8p-3 -DIWEIGHT_STEP=0xc.1551b6b115
8dp-4 -DWEIGHT_BIGSTEP=0x9.837f0518db8a8p-3 -DIWEIGHT_BIGSTEP=0xd.744fccad69d68p-4 -DORIG_X2=1 -DORIG_X2=1  -I. -cl-fast-relaxed-math -cl-std=CL2.0"
2019-09-14 11:20:39 OpenCL compilation in 3389 ms
2019-09-14 11:20:40 87005279.owl not found, starting from the beginning.
2019-09-14 11:21:08 87005279 OK     2000  0.00%; 6501 us/sq; ETA 6d 13:06; e944fcb41cb63c80 (check 6.71s)
2019-09-14 11:23:06 87005279       20000  0.02%; 6557 us/sq; ETA 6d 14:26; 77e12e401949f647
2019-09-14 11:25:17 87005279       40000  0.05%; 6549 us/sq; ETA 6d 14:13; 3ccb222b85a3780d
2019-09-14 11:26:42 Stopping, please wait..
2019-09-14 11:26:49 87005279 OK    53000  0.06%; 6579 us/sq; ETA 6d 14:54; 4a2c9b719dd7f2c1 (check 6.74s)
2019-09-14 11:26:49 Exiting because "stop requested"
2019-09-14 11:26:49 Bye
Terminate batch job (Y/N)? y

C:\msys64\home\ken\gpuowl-compile\gw>gpuowl-win -device 0 -use ORIG_X2 -time
2019-09-14 11:27:09 gpuowl
2019-09-14 11:27:09 Note: no config.txt file found
2019-09-14 11:27:09 config: -device 0 -use ORIG_X2 -time
2019-09-14 11:27:09 87005279 FFT 5120K: Width 256x4, Height 64x4, Middle 10; 16.59 bits/word
2019-09-14 11:27:09 using short carry kernels
2019-09-14 11:27:16 OpenCL args "-DEXP=87005279u -DWIDTH=1024u -DSMALL_HEIGHT=256u -DMIDDLE=10u -DWEIGHT_STEP=0xa.97d8cd06772f8p-3 -DIWEIGHT_STEP=0xc.1551b6b115
8dp-4 -DWEIGHT_BIGSTEP=0x9.837f0518db8a8p-3 -DIWEIGHT_BIGSTEP=0xd.744fccad69d68p-4 -DORIG_X2=1 -DORIG_X2=1  -I. -cl-fast-relaxed-math -cl-std=CL2.0"
2019-09-14 11:27:19 OpenCL compilation in 3207 ms
2019-09-14 11:27:20 87005279.owl loaded: k 53000, block 1000, res64 4a2c9b719dd7f2c1
2019-09-14 11:28:27 87005279 OK    55000  0.06%; 13095 us/sq; ETA 13d 04:17; e5617f81e2a4387a (check 25.20s)
2019-09-14 11:28:27 32.25% fftMiddleIn    :   5051 us/call x  4259 calls
2019-09-14 11:28:27 18.56% carryFused     :   3201 us/call x  3869 calls
2019-09-14 11:28:27 17.15% tailFused      :   2860 us/call x  3999 calls
2019-09-14 11:28:27 14.56% fftMiddleOut   :   2352 us/call x  4129 calls
2019-09-14 11:28:27 13.64% transposeH     :   2203 us/call x  4129 calls
2019-09-14 11:28:27  0.93% fftH           :   1585 us/call x   390 calls
2019-09-14 11:28:27  0.88% fftP           :   1503 us/call x   390 calls
2019-09-14 11:28:27  0.67% carryA         :   1725 us/call x   258 calls
2019-09-14 11:28:27  0.61% fftW           :   1569 us/call x   260 calls
2019-09-14 11:28:27  0.36% multiply       :   1862 us/call x   130 calls
2019-09-14 11:28:27  0.36% carryB         :    915 us/call x   260 calls
2019-09-14 11:28:27
2019-09-14 11:30:20 87005279       60000  0.07%; 22484 us/sq; ETA 22d 15:02; 6d81443958902b6b
2019-09-14 11:30:20 28.82% fftMiddleIn    :   6456 us/call x  5010 calls
2019-09-14 11:30:20 19.99% carryFused     :   4490 us/call x  4995 calls
2019-09-14 11:30:20 19.00% tailFused      :   4263 us/call x  5000 calls
2019-09-14 11:30:20 16.22% fftMiddleOut   :   3636 us/call x  5005 calls
2019-09-14 11:30:20 15.80% transposeH     :   3542 us/call x  5005 calls
2019-09-14 11:30:20  0.05% fftH           :   3400 us/call x    15 calls
2019-09-14 11:30:20  0.03% fftP           :   2533 us/call x    15 calls
2019-09-14 11:30:20  0.03% carryB         :   3800 us/call x    10 calls
2019-09-14 11:30:20  0.03% fftW           :   3600 us/call x    10 calls
2019-09-14 11:30:20  0.02% carryA         :   2200 us/call x    10 calls
2019-09-14 11:30:20  0.01% multiply       :   3200 us/call x     5 calls
2019-09-14 11:30:20
2019-09-14 11:31:57 Stopping, please wait..
2019-09-14 11:32:17 87005279 OK    64000  0.07%; 24296 us/sq; ETA 24d 10:45; a5a4adb2509d792a (check 20.19s)
2019-09-14 11:32:17 29.29% fftMiddleIn    :   6837 us/call x  5008 calls
2019-09-14 11:32:17 19.30% carryFused     :   4517 us/call x  4995 calls
2019-09-14 11:32:17 18.58% tailFused      :   4344 us/call x  5000 calls
2019-09-14 11:32:17 17.53% fftMiddleOut   :   4096 us/call x  5004 calls
2019-09-14 11:32:17 15.22% transposeH     :   3555 us/call x  5004 calls
2019-09-14 11:32:17  0.02% carryB         :   2844 us/call x     9 calls
2019-09-14 11:32:17  0.02% multiply       :   4650 us/call x     4 calls
2019-09-14 11:32:17  0.01% carryA         :   1875 us/call x     8 calls
2019-09-14 11:32:17  0.01% fftP           :   1000 us/call x    13 calls
2019-09-14 11:32:17  0.01% fftH           :   1083 us/call x    12 calls
2019-09-14 11:32:17
2019-09-14 11:32:17 Exiting because "stop requested"
2019-09-14 11:32:17 Bye
Terminate batch job (Y/N)? y

C:\msys64\home\ken\gpuowl-compile\gw>gpuowl-win -device 0 -carry short -use ORIG_X2 -time
2019-09-14 11:45:24 gpuowl
2019-09-14 11:45:24 Note: no config.txt file found
2019-09-14 11:45:24 config: -device 0 -carry short -use ORIG_X2 -time
2019-09-14 11:45:24 87005279 FFT 5120K: Width 256x4, Height 64x4, Middle 10; 16.59 bits/word
2019-09-14 11:45:24 using short carry kernels
2019-09-14 11:45:31 OpenCL args "-DEXP=87005279u -DWIDTH=1024u -DSMALL_HEIGHT=256u -DMIDDLE=10u -DWEIGHT_STEP=0xa.97d8cd06772f8p-3 -DIWEIGHT_STEP=0xc.1551b6b115
8dp-4 -DWEIGHT_BIGSTEP=0x9.837f0518db8a8p-3 -DIWEIGHT_BIGSTEP=0xd.744fccad69d68p-4 -DORIG_X2=1 -DORIG_X2=1  -I. -cl-fast-relaxed-math -cl-std=CL2.0"
2019-09-14 11:45:34 OpenCL compilation in 3229 ms
2019-09-14 11:45:35 87005279.owl loaded: k 64000, block 1000, res64 a5a4adb2509d792a
2019-09-14 11:47:05 87005279 OK    66000  0.08%; 19765 us/sq; ETA 19d 21:19; cd6fbb4ea4a33c97 (check 24.90s)
2019-09-14 11:47:05 28.97% fftMiddleIn    :   6073 us/call x  4259 calls
2019-09-14 11:47:05 17.84% tailFused      :   3983 us/call x  3999 calls
2019-09-14 11:47:05 17.09% carryFused     :   3943 us/call x  3869 calls
2019-09-14 11:47:05 15.32% fftMiddleOut   :   3313 us/call x  4129 calls
2019-09-14 11:47:05 15.15% transposeH     :   3276 us/call x  4129 calls
2019-09-14 11:47:05  1.40% fftH           :   3200 us/call x   390 calls
2019-09-14 11:47:05  1.26% fftP           :   2880 us/call x   390 calls
2019-09-14 11:47:05  1.01% carryA         :   3507 us/call x   258 calls
2019-09-14 11:47:05  0.87% fftW           :   3000 us/call x   260 calls
2019-09-14 11:47:05  0.70% carryB         :   2400 us/call x   260 calls
2019-09-14 11:47:05  0.37% multiply       :   2520 us/call x   130 calls
2019-09-14 11:47:05  0.03% carryM         :  15600 us/call x     2 calls
2019-09-14 11:47:05
2019-09-14 11:48:19 Stopping, please wait..
2019-09-14 11:48:31 87005279 OK    69000  0.08%; 24611 us/sq; ETA 24d 18:21; 80ce9777c6f885e9 (check 12.89s)
2019-09-14 11:48:31 31.37% fftMiddleIn    :   6756 us/call x  4006 calls
2019-09-14 11:48:31 18.90% carryFused     :   4080 us/call x  3996 calls
2019-09-14 11:48:31 17.00% tailFused      :   3666 us/call x  4000 calls
2019-09-14 11:48:31 16.31% transposeH     :   3515 us/call x  4003 calls
2019-09-14 11:48:31 16.20% fftMiddleOut   :   3492 us/call x  4003 calls
2019-09-14 11:48:31  0.07% fftP           :   6240 us/call x    10 calls
2019-09-14 11:48:31  0.05% fftW           :   6686 us/call x     7 calls
2019-09-14 11:48:31  0.04% fftH           :   3467 us/call x     9 calls
2019-09-14 11:48:32  0.02% carryB         :   2229 us/call x     7 calls
2019-09-14 11:48:32  0.02% multiply       :   5200 us/call x     3 calls
2019-09-14 11:48:32  0.02% isEqual        :  15600 us/call x     1 calls
2019-09-14 11:48:32
2019-09-14 11:48:32 Exiting because "stop requested"
2019-09-14 11:48:32 Bye
Terminate batch job (Y/N)? n

C:\msys64\home\ken\gpuowl-compile\gw>gw

C:\msys64\home\ken\gpuowl-compile\gw>gpuowl-win -device 0 -carry short -use ORIG_X2
2019-09-14 11:48:40 gpuowl
2019-09-14 11:48:40 Note: no config.txt file found
2019-09-14 11:48:40 config: -device 0 -carry short -use ORIG_X2
2019-09-14 11:48:40 87005279 FFT 5120K: Width 256x4, Height 64x4, Middle 10; 16.59 bits/word
2019-09-14 11:48:40 using short carry kernels
2019-09-14 11:48:48 OpenCL args "-DEXP=87005279u -DWIDTH=1024u -DSMALL_HEIGHT=256u -DMIDDLE=10u -DWEIGHT_STEP=0xa.97d8cd06772f8p-3 -DIWEIGHT_STEP=0xc.1551b6b115
8dp-4 -DWEIGHT_BIGSTEP=0x9.837f0518db8a8p-3 -DIWEIGHT_BIGSTEP=0xd.744fccad69d68p-4 -DORIG_X2=1 -DORIG_X2=1  -I. -cl-fast-relaxed-math -cl-std=CL2.0"
2019-09-14 11:48:51 OpenCL compilation in 3276 ms
2019-09-14 11:48:52 87005279.owl loaded: k 69000, block 1000, res64 80ce9777c6f885e9
2019-09-14 11:49:20 87005279 OK    71000  0.08%; 6497 us/sq; ETA 6d 12:54; cb6cb22058171054 (check 6.72s)
2019-09-14 11:50:19 87005279       80000  0.09%; 6501 us/sq; ETA 6d 12:59; e989bcf6f98d3c02
2019-09-14 11:52:30 87005279      100000  0.11%; 6550 us/sq; ETA 6d 14:07; 4ba1f423b8c71b64
2019-09-14 11:54:41 87005279      120000  0.14%; 6552 us/sq; ETA 6d 14:07; 74525140cca3e28c
2019-09-14 11:56:26 Stopping, please wait..
2019-09-14 11:56:33 87005279 OK   136000  0.16%; 6564 us/sq; ETA 6d 14:24; 32900173c562435a (check 6.75s)
2019-09-14 11:56:33 Exiting because "stop requested"
 2019-09-14 11:56:33 Bye
Win7 Pro x64, RX480 (same gpu and system as above), gpuowl-v6.5-76-g1ca08e2 (dirty is because I edited the makefile slightly):
Code:
>gpuowl-win -device 0 -carry short -fft +0 -use ORIG_X2
2019-09-14 11:33:44 gpuowl v6.5-76-g1ca08e2-dirty
2019-09-14 11:33:44 Note: no config.txt file found
2019-09-14 11:33:44 config: -device 0 -carry short -fft +0 -use ORIG_X2
2019-09-14 11:33:44 87005279 FFT 5120K: Width 256x4, Height 64x4, Middle 10; 16.59 bits/word
2019-09-14 11:33:44 using short carry kernels
2019-09-14 11:33:46 OpenCL args "-DEXP=87005279u -DWIDTH=1024u -DSMALL_HEIGHT=256u -DMIDDLE=10u -DWEIGHT_STEP=0xa.97d8cd06772f8p-3 -DIWEIGHT_STEP=0xc.1551b6b115
8dp-4 -DWEIGHT_BIGSTEP=0x9.837f0518db8a8p-3 -DIWEIGHT_BIGSTEP=0xd.744fccad69d68p-4 -DORIG_X2=1 -DORIG_X2=1  -I. -cl-fast-relaxed-math -cl-std=CL2.0"
2019-09-14 11:33:49 OpenCL compilation in 3057 ms
2019-09-14 11:33:50 87005279.owl not found, starting from the beginning.
2019-09-14 11:34:08 87005279 OK     2000  0.00%; 4.108 ms/sq; ETA 4d 03:16; e944fcb41cb63c80 (check 4.35s)
2019-09-14 11:35:22 87005279       20000  0.02%; 4.157 ms/sq; ETA 4d 04:27; 77e12e401949f647
2019-09-14 11:36:45 87005279       40000  0.05%; 4.147 ms/sq; ETA 4d 04:10; 3ccb222b85a3780d
2019-09-14 11:37:31 Stopping, please wait..
2019-09-14 11:37:36 87005279 OK    51000  0.06%; 4.124 ms/sq; ETA 4d 03:36; 7b72f5d50e454610 (check 4.88s)
2019-09-14 11:37:36 Exiting because "stop requested"
2019-09-14 11:37:36 Bye

C:\msys64\home\ken\gpuowl-compile\v6.5-latest\gpuowl>gpuowl-win -device 0 -carry short -fft +0 -use ORIG_X2 -time
2019-09-14 11:38:07 gpuowl v6.5-76-g1ca08e2-dirty
2019-09-14 11:38:07 Note: no config.txt file found
2019-09-14 11:38:07 config: -device 0 -carry short -fft +0 -use ORIG_X2 -time
2019-09-14 11:38:07 87005279 FFT 5120K: Width 256x4, Height 64x4, Middle 10; 16.59 bits/word
2019-09-14 11:38:07 using short carry kernels
2019-09-14 11:38:15 OpenCL args "-DEXP=87005279u -DWIDTH=1024u -DSMALL_HEIGHT=256u -DMIDDLE=10u -DWEIGHT_STEP=0xa.97d8cd06772f8p-3 -DIWEIGHT_STEP=0xc.1551b6b115
8dp-4 -DWEIGHT_BIGSTEP=0x9.837f0518db8a8p-3 -DIWEIGHT_BIGSTEP=0xd.744fccad69d68p-4 -DORIG_X2=1 -DORIG_X2=1  -I. -cl-fast-relaxed-math -cl-std=CL2.0"
2019-09-14 11:38:18 OpenCL compilation in 3151 ms
2019-09-14 11:38:19 87005279.owl loaded: k 51000, block 1000, res64 7b72f5d50e454610
2019-09-14 11:40:08 87005279 OK    53000  0.06%; 25.631 ms/sq; ETA 25d 19:04; 4a2c9b719dd7f2c1 (check 21.59s)
2019-09-14 11:40:08 16.79% carryFused     :   4709 us/call x  3869 calls
2019-09-14 11:40:08 16.17% tailFused      :   4389 us/call x  3999 calls
2019-09-14 11:40:08 15.41% fftMiddleIn    :   3927 us/call x  4259 calls
2019-09-14 11:40:08 15.33% transposeW     :   3908 us/call x  4259 calls
2019-09-14 11:40:08 15.31% transposeH     :   4024 us/call x  4129 calls
2019-09-14 11:40:08 14.65% fftMiddleOut   :   3850 us/call x  4129 calls
2019-09-14 11:40:08  1.58% fftH           :   4400 us/call x   390 calls
2019-09-14 11:40:08  1.42% fftP           :   3960 us/call x   390 calls
2019-09-14 11:40:08  1.06% carryB         :   4440 us/call x   260 calls
2019-09-14 11:40:08  0.91% carryA         :   3809 us/call x   258 calls
2019-09-14 11:40:08  0.85% fftW           :   3540 us/call x   260 calls
2019-09-14 11:40:08  0.53% multiply       :   4440 us/call x   130 calls
2019-09-14 11:40:08
2019-09-14 11:42:47 87005279       60000  0.07%; 22.751 ms/sq; ETA 22d 21:28; 6d81443958902b6b
2019-09-14 11:42:47 19.17% carryFused     :   4359 us/call x  6993 calls
2019-09-14 11:42:48 17.20% tailFused      :   3909 us/call x  7000 calls
2019-09-14 11:42:48 16.18% transposeH     :   3673 us/call x  7007 calls
2019-09-14 11:42:48 16.15% transposeW     :   3663 us/call x  7014 calls
2019-09-14 11:42:48 16.11% fftMiddleIn    :   3652 us/call x  7014 calls
2019-09-14 11:42:48 14.98% fftMiddleOut   :   3400 us/call x  7007 calls
2019-09-14 11:42:48  0.06% fftP           :   4457 us/call x    21 calls
2019-09-14 11:42:48  0.04% fftW           :   4457 us/call x    14 calls
2019-09-14 11:42:48  0.04% fftH           :   2971 us/call x    21 calls
2019-09-14 11:42:48  0.04% carryA         :   4457 us/call x    14 calls
2019-09-14 11:42:48  0.02% multiply       :   4457 us/call x     7 calls
2019-09-14 11:42:48
2019-09-14 11:43:01 Stopping, please wait..
2019-09-14 11:43:24 87005279 OK    61000  0.07%; 13.993 ms/sq; ETA 14d 01:57; be2af92c309064ef (check 22.32s)
2019-09-14 11:43:24 21.11% carryFused     :   3795 us/call x  1998 calls
2019-09-14 11:43:24 17.03% tailFused      :   3058 us/call x  2000 calls
2019-09-14 11:43:24 16.12% fftMiddleOut   :   2892 us/call x  2001 calls
2019-09-14 11:43:24 15.86% fftMiddleIn    :   2844 us/call x  2002 calls
2019-09-14 11:43:24 14.99% transposeW     :   2688 us/call x  2002 calls
2019-09-14 11:43:24 14.51% transposeH     :   2604 us/call x  2001 calls
2019-09-14 11:43:24  0.09% fftP           :   7800 us/call x     4 calls
2019-09-14 11:43:24  0.09% fftH           :  10400 us/call x     3 calls
2019-09-14 11:43:24  0.04% fftW           :   5200 us/call x     3 calls
2019-09-14 11:43:24  0.04% carryM         :  15600 us/call x     1 calls
2019-09-14 11:43:24  0.04% transposeIn    :  15600 us/call x     1 calls
2019-09-14 11:43:24  0.04% readResidue    :  15600 us/call x     1 calls
2019-09-14 11:43:24  0.04% isNotZero      :  15600 us/call x     1 calls
2019-09-14 11:43:24
2019-09-14 11:43:24 Exiting because "stop requested"
2019-09-14 11:43:24 Bye
Win7 Pro x64, GTX1080Ti, GW variant:
Code:
>gpuowl-win -device 0 -carry short -use ORIG_X2
2019-09-14 11:47:06 gpuowl
2019-09-14 11:47:06 Note: no config.txt file found
2019-09-14 11:47:06 config: -device 0 -carry short -use ORIG_X2
2019-09-14 11:47:06 87005279 FFT 5120K: Width 256x4, Height 64x4, Middle 10; 16.59 bits/word
2019-09-14 11:47:06 using short carry kernels
2019-09-14 11:47:06 OpenCL args "-DEXP=87005279u -DWIDTH=1024u -DSMALL_HEIGHT=256u -DMIDDLE=10u -DWEIGHT_STEP=0xa.97d8cd06772f8p-3 -DIWEIGHT_STEP=0xc.
1551b6b1158dp-4 -DWEIGHT_BIGSTEP=0x9.837f0518db8a8p-3 -DIWEIGHT_BIGSTEP=0xd.744fccad69d68p-4 -DORIG_X2=1 -DORIG_X2=1  -I. -cl-fast-relaxed-math -cl-st
d=CL2.0"
2019-09-14 11:47:10

2019-09-14 11:47:10 OpenCL compilation in 3474 ms
2019-09-14 11:47:11 87005279.owl not found, starting from the beginning.
2019-09-14 11:47:27 87005279 OK     2000  0.00%; 3483 us/sq; ETA 3d 12:10; e944fcb41cb63c80 (check 3.89s)
2019-09-14 11:48:30 87005279       20000  0.02%; 3522 us/sq; ETA 3d 13:06; 77e12e401949f647
2019-09-14 11:49:41 87005279       40000  0.05%; 3557 us/sq; ETA 3d 13:55; 3ccb222b85a3780d
2019-09-14 11:50:10 Stopping, please wait..
2019-09-14 11:50:14 87005279 OK    48000  0.06%; 3573 us/sq; ETA 3d 14:18; a316078024d009b0 (check 3.97s)
2019-09-14 11:50:14 Exiting because "stop requested"
2019-09-14 11:50:14 Bye
Win7 Pro x64, GTX1080Ti, v6.7-4-g278407a:
Code:
>gpuowl-win -device 0 -use ORIG_X2 -maxAlloc 10240 -user kriesel -cpu dodo-gtx1080ti
2019-09-14 11:51:36 gpuowl v6.7-4-g278407a
2019-09-14 11:51:36 Note: no config.txt file found
2019-09-14 11:51:36 config: -device 0 -use ORIG_X2 -maxAlloc 10240 -user kriesel -cpu dodo-gtx1080ti
2019-09-14 11:51:36 87005279 FFT 5120K: Width 256x4, Height 64x4, Middle 10; 16.59 bits/word
2019-09-14 11:51:36 using short carry kernels
2019-09-14 11:51:36 OpenCL args "-DEXP=87005279u -DWIDTH=1024u -DSMALL_HEIGHT=256u -DMIDDLE=10u -DWEIGHT_STEP=0xa.97d8cd06772f8p-3 -DIWEIGHT_STEP=0xc.
1551b6b1158dp-4 -DWEIGHT_BIGSTEP=0x9.837f0518db8a8p-3 -DIWEIGHT_BIGSTEP=0xd.744fccad69d68p-4 -DORIG_X2=1  -I. -cl-fast-relaxed-math -cl-std=CL2.0"
2019-09-14 11:51:40

2019-09-14 11:51:40 OpenCL compilation in 3650 ms
2019-09-14 11:51:41 87005279.owl not found, starting from the beginning.
2019-09-14 11:51:49 87005279 OK     1000  0.00%; 3400 us/sq; ETA 3d 10:10; 00fdfddc9aeaa71f (check 2.09s)
2019-09-14 11:54:38 87005279       50000  0.06%; 3438 us/sq; ETA 3d 11:03; d3c2d8af5e987770
2019-09-14 11:57:32 87005279      100000  0.11%; 3478 us/sq; ETA 3d 11:58; 4ba1f423b8c71b64
2019-09-14 12:00:27 87005279      150000  0.17%; 3503 us/sq; ETA 3d 12:30; 229fc24f15398a56
2019-09-14 12:03:22 87005279      200000  0.23%; 3507 us/sq; ETA 3d 12:34; 75fc31e283600e79
2019-09-14 12:06:20 87005279 OK   250000  0.29%; 3506 us/sq; ETA 3d 12:30; 2d95d14b64b3f424 (check 2.11s)
2019-09-14 12:09:15 87005279      300000  0.34%; 3509 us/sq; ETA 3d 12:30; 543c72d2989ffcac
2019-09-14 12:12:11 87005279      350000  0.40%; 3510 us/sq; ETA 3d 12:30; 0e1f3273842b2f55
kriesel is online now   Reply With Quote
Old 2019-09-14, 17:58   #1368
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

11101011011112 Posts
Default

So it is slower. Thanks for the data.

Oddly the -time option shows my variant spending less time in fftMiddleIn than the production version spends in TransposeW + fftMiddleIn. So -time says it should be faster but the wall clock shows it isn't.

Back to the drawing board.
Prime95 is online now   Reply With Quote
Old 2019-09-14, 18:04   #1369
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

5,419 Posts
Default

Quote:
Originally Posted by Prime95 View Post
So it is slower. Thanks for the data.
You're welcome. On GTX1080Ti it seems very close. There may be gpus where it is faster now.
kriesel is online now   Reply With Quote
Old 2019-09-14, 19:13   #1370
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

5,419 Posts
Default

Quote:
Originally Posted by preda View Post
Looking for bug reports. P-1 savefile not implemented yet.
Be careful what you ask for?
Code:
>gpuowl-win -h
2019-09-14 13:24:58 gpuowl v6.10-0-gc1d0025

Command line options:

-dir <folder>      : specify work directory (containing worktodo.txt, results.txt, config.txt, gpuowl.log)
-user <name>       : specify the user name.
-cpu  <name>       : specify the hardware name.
-time              : display kernel profiling information.
-fft <size>        : specify FFT size, such as: 5000K, 4M, +2, -1.
-block <value>     : PRP GEC block size. Default 500. Smaller block is slower but detects errors sooner.
-log <step>        : log every <step> iterations, default 50000. Multiple of 10000.
-carry long|short  : force carry type. Short carry may be faster, but requires high bits/word.
-B1                : P-1 B1 bound, default 500000
-B2                : P-1 B2 bound, default B1 * 30
-rB2               : ratio of B2 to B1. Default 30, used only if B2 is not explicitly set
-prp <exponent>    : run a single PRP test and exit, ignoring worktodo.txt
-pm1 <exponent>    : run a single P-1 test and exit, ignoring worktodo.txt
-results <file>    : name of results file, default 'results.txt'
-iters <N>         : run next PRP test for <N> iterations and exit. Multiple of 10000.
-maxAlloc          : limit GPU memory usage to this value in MB
-use NEW_FFT8,OLD_FFT5,NEW_FFT10: comma separated list of defines, see the #if tests in gpuowl.cl (used for perf tuning).
-device <N>        : select a specific device:
 0 : Ellesmere-36@1266-28:00.0 Radeon (TM) RX 480 Graphics
 1 : gfx804-8@1203-03:00.0 Radeon 550 Series

FFT Configurations:
FFT    8K [  0.01M -    0.17M]  64-64
FFT   32K [  0.05M -    0.68M]  64-256 256-64
FFT   64K [  0.10M -    1.33M]  64-512 512-64
FFT  128K [  0.20M -    2.62M]  1K-64 64-1K 256-256
FFT  192K [  0.29M -    3.89M]  64-256-6
FFT  224K [  0.34M -    4.52M]  64-256-7
FFT  256K [  0.39M -    5.15M]  64-2K 256-512 512-256 2K-64
FFT  288K [  0.44M -    5.77M]  64-256-9
FFT  320K [  0.49M -    6.40M]  64-256-10
FFT  352K [  0.54M -    7.02M]  64-256-11
FFT  384K [  0.59M -    7.64M]  64-256-12 64-512-6
FFT  448K [  0.69M -    8.88M]  64-512-7
FFT  512K [  0.79M -   10.12M]  1K-256 256-1K 512-512 4K-64
FFT  576K [  0.88M -   11.35M]  64-512-9
FFT  640K [  0.98M -   12.58M]  64-512-10
FFT  704K [  1.08M -   13.81M]  64-512-11
FFT  768K [  1.18M -   15.03M]  64-512-12 64-1K-6 256-256-6
FFT  896K [  1.38M -   17.47M]  64-1K-7 256-256-7
FFT    1M [  1.57M -   19.89M]  1K-512 256-2K 512-1K 2K-256
FFT 1152K [  1.77M -   22.32M]  64-1K-9 256-256-9
FFT 1280K [  1.97M -   24.73M]  64-1K-10 256-256-10
FFT 1408K [  2.16M -   27.14M]  64-1K-11 256-256-11
FFT 1536K [  2.36M -   29.54M]  64-1K-12 64-2K-6 256-256-12 256-512-6 512-256-6
FFT 1792K [  2.75M -   34.33M]  64-2K-7 256-512-7 512-256-7
FFT    2M [  3.15M -   39.10M]  1K-1K 512-2K 2K-512 4K-256
FFT 2304K [  3.54M -   43.85M]  64-2K-9 256-512-9 512-256-9
FFT 2560K [  3.93M -   48.59M]  64-2K-10 256-512-10 512-256-10
FFT 2816K [  4.33M -   53.32M]  64-2K-11 256-512-11 512-256-11
FFT    3M [  4.72M -   58.04M]  1K-256-6 64-2K-12 256-512-12 256-1K-6 512-256-12 512-512-6
FFT 3584K [  5.51M -   67.44M]  1K-256-7 256-1K-7 512-512-7
FFT    4M [  6.29M -   76.81M]  1K-2K 2K-1K 4K-512
FFT 4608K [  7.08M -   86.15M]  1K-256-9 256-1K-9 512-512-9
FFT    5M [  7.86M -   95.46M]  1K-256-10 256-1K-10 512-512-10
FFT 5632K [  8.65M -  104.74M]  1K-256-11 256-1K-11 512-512-11
FFT    6M [  9.44M -  114.00M]  1K-256-12 1K-512-6 256-1K-12 256-2K-6 512-512-12 512-1K-6 2K-256-6
FFT    7M [ 11.01M -  132.46M]  1K-512-7 256-2K-7 512-1K-7 2K-256-7
FFT    8M [ 12.58M -  150.85M]  2K-2K 4K-1K
FFT    9M [ 14.16M -  169.18M]  1K-512-9 256-2K-9 512-1K-9 2K-256-9
FFT   10M [ 15.73M -  187.45M]  1K-512-10 256-2K-10 512-1K-10 2K-256-10
FFT   11M [ 17.30M -  205.67M]  1K-512-11 256-2K-11 512-1K-11 2K-256-11
FFT   12M [ 18.87M -  223.85M]  1K-512-12 1K-1K-6 256-2K-12 512-1K-12 512-2K-6 2K-256-12 2K-512-6 4K-256-6
FFT   14M [ 22.02M -  260.08M]  1K-1K-7 512-2K-7 2K-512-7 4K-256-7
FFT   16M [ 25.17M -  296.17M]  4K-2K
FFT   18M [ 28.31M -  332.13M]  1K-1K-9 512-2K-9 2K-512-9 4K-256-9
FFT   20M [ 31.46M -  367.98M]  1K-1K-10 512-2K-10 2K-512-10 4K-256-10
FFT   22M [ 34.60M -  403.74M]  1K-1K-11 512-2K-11 2K-512-11 4K-256-11
FFT   24M [ 37.75M -  439.40M]  1K-1K-12 1K-2K-6 512-2K-12 2K-512-12 2K-1K-6 4K-256-12 4K-512-6
FFT   28M [ 44.04M -  510.47M]  1K-2K-7 2K-1K-7 4K-512-7
FFT   36M [ 56.62M -  651.81M]  1K-2K-9 2K-1K-9 4K-512-9
FFT   40M [ 62.91M -  722.13M]  1K-2K-10 2K-1K-10 4K-512-10
FFT   44M [ 69.21M -  792.25M]  1K-2K-11 2K-1K-11 4K-512-11
FFT   48M [ 75.50M -  862.18M]  1K-2K-12 2K-1K-12 2K-2K-6 4K-512-12 4K-1K-6
FFT   56M [ 88.08M - 1001.57M]  2K-2K-7 4K-1K-7
FFT   72M [113.25M - 1278.70M]  2K-2K-9 4K-1K-9
FFT   80M [125.83M - 1416.57M]  2K-2K-10 4K-1K-10
FFT   88M [138.41M - 1554.04M]  2K-2K-11 4K-1K-11
FFT   96M [150.99M - 1691.15M]  2K-2K-12 4K-1K-12 4K-2K-6
FFT  112M [176.16M - 1964.39M]  4K-2K-7
FFT  144M [226.49M - 2507.57M]  4K-2K-9
FFT  160M [251.66M - 2777.78M]  4K-2K-10
FFT  176M [276.82M - 3047.18M]  4K-2K-11
FFT  192M [301.99M - 3315.86M]  4K-2K-12
2019-09-14 13:25:02 Exiting because "help"
2019-09-14 13:25:02 Bye

C:\msys64\home\ken\gpuowl-compile\v6.10-0-gc1d0025>gpuowl-win -device 0 -use ORIG_X2 -user kriesel -cpu condorella/rx480
2019-09-14 13:38:16 gpuowl v6.10-0-gc1d0025
2019-09-14 13:38:16 Note: no config.txt file found
2019-09-14 13:38:16 config: -device 0 -use ORIG_X2 -user kriesel -cpu condorella/rx480
2019-09-14 13:38:16 24000577 FFT 1280K: Width 8x8, Height 256x4, Middle 10; 18.31 bits/word
2019-09-14 13:38:16 using short carry kernels
2019-09-14 13:38:21 OpenCL args "-DEXP=24000577u -DWIDTH=64u -DSMALL_HEIGHT=1024u -DMIDDLE=10u -DWEIGHT_STEP=0xc.e5beac96a0b88p-3 -DIWEIGHT_STEP=0x9.eca8ba4660a
fp-4 -DWEIGHT_BIGSTEP=0xe.ac0c6e7dd2438p-3 -DIWEIGHT_BIGSTEP=0x8.b95c1e3ea8bd8p-4 -DORIG_X2=1  -I. -cl-fast-relaxed-math -cl-std=CL2.0"
2019-09-14 13:38:24 OpenCL compilation in 3712 ms
2019-09-14 13:38:25 24000577 P1 B1=220000, B2=3960000, stage1 317550 bits
2019-09-14 13:38:54 24000577 P1    10000   3.15%; 2886 us/sq; ETA 0d 00:15; 7f995dc7dff7f8e0
2019-09-14 13:39:23 24000577 P1    20000   6.30%; 2885 us/sq; ETA 0d 00:14; f705474c0ac30c16
2019-09-14 13:39:52 24000577 P1    30000   9.45%; 2910 us/sq; ETA 0d 00:14; 3fc336b60ee971a2
2019-09-14 13:40:21 24000577 P1    40000  12.60%; 2890 us/sq; ETA 0d 00:13; 87c9fcec37cd0a71
2019-09-14 13:40:50 24000577 P1    50000  15.75%; 2885 us/sq; ETA 0d 00:13; f64948f68fb1d67b
2019-09-14 13:41:19 24000577 P1    60000  18.89%; 2894 us/sq; ETA 0d 00:12; c37c2d473cb0ea06
2019-09-14 13:41:48 24000577 P1    70000  22.04%; 2885 us/sq; ETA 0d 00:12; 5bd384b917eabb12
2019-09-14 13:42:17 24000577 P1    80000  25.19%; 2899 us/sq; ETA 0d 00:11; 91ea4d5d92dc1c29
2019-09-14 13:42:46 24000577 P1    90000  28.34%; 2904 us/sq; ETA 0d 00:11; 9c85386920ff8b45
2019-09-14 13:43:15 24000577 P1   100000  31.49%; 2898 us/sq; ETA 0d 00:11; 438848c849a426c8
2019-09-14 13:43:44 24000577 P1   110000  34.64%; 2898 us/sq; ETA 0d 00:10; 495bc594a2150ed6
2019-09-14 13:44:13 24000577 P1   120000  37.79%; 2885 us/sq; ETA 0d 00:09; 1bd1712dcb680f0d
2019-09-14 13:44:42 24000577 P1   130000  40.94%; 2898 us/sq; ETA 0d 00:09; d03e2db3fd19c843
2019-09-14 13:45:11 24000577 P1   140000  44.09%; 2891 us/sq; ETA 0d 00:09; 9fc5fa31b4959aed
2019-09-14 13:45:40 24000577 P1   150000  47.24%; 2891 us/sq; ETA 0d 00:08; ae6304c818c1f83e
2019-09-14 13:46:08 24000577 P1   160000  50.39%; 2883 us/sq; ETA 0d 00:08; fe8f0bada295328d
2019-09-14 13:46:37 24000577 P1   170000  53.53%; 2890 us/sq; ETA 0d 00:07; 3fd5a4ddb6841e9b
2019-09-14 13:47:07 24000577 P1   180000  56.68%; 2899 us/sq; ETA 0d 00:07; a6234de954685799
2019-09-14 13:47:35 24000577 P1   190000  59.83%; 2894 us/sq; ETA 0d 00:06; c873c91deeefba27
2019-09-14 13:48:04 24000577 P1   200000  62.98%; 2893 us/sq; ETA 0d 00:06; eb92d0b622962612
2019-09-14 13:48:34 24000577 P1   210000  66.13%; 2901 us/sq; ETA 0d 00:05; a64dbff6290ed34a
2019-09-14 13:49:03 24000577 P1   220000  69.28%; 2891 us/sq; ETA 0d 00:05; 7f49b2efd2a795fe
2019-09-14 13:49:32 24000577 P1   230000  72.43%; 2893 us/sq; ETA 0d 00:04; 9884971a1fc42886
2019-09-14 13:50:00 24000577 P1   240000  75.58%; 2893 us/sq; ETA 0d 00:04; ba30a7d0f33bde93
2019-09-14 13:50:30 24000577 P1   250000  78.73%; 2898 us/sq; ETA 0d 00:03; bb8984fecf1af62a
2019-09-14 13:50:58 24000577 P1   260000  81.88%; 2891 us/sq; ETA 0d 00:03; efb3c97f53545dbb
2019-09-14 13:51:28 24000577 P1   270000  85.03%; 2901 us/sq; ETA 0d 00:02; 405373760718e67c
2019-09-14 13:51:57 24000577 P1   280000  88.18%; 2894 us/sq; ETA 0d 00:02; a612ab69e780c283
2019-09-14 13:52:25 24000577 P1   290000  91.32%; 2890 us/sq; ETA 0d 00:01; 740645b16c6380fe
2019-09-14 13:52:55 24000577 P1   300000  94.47%; 2894 us/sq; ETA 0d 00:01; 50ed1a7837d59607
2019-09-14 13:53:24 24000577 P1   310000  97.62%; 2910 us/sq; ETA 0d 00:00; 21a38a3fd1fa6582
2019-09-14 13:53:46 24000577 P1   317550 100.00%; 2893 us/sq; ETA 0d 00:00; 7acca8667b4d2492
2019-09-14 13:53:46 P-1 (B1=220000, B2=3960000, D=30030): primes 260946, expanded 262000, doubles 47491 (left 166492), singles 165964, total 213455 (82%)
2019-09-14 13:53:46 24000577 P2 using blocks [7 - 132] to cover 213455 primes
2019-09-14 13:53:46 24000577 P2 using 770 buffers of 10.0 MB each
2019-09-14 13:56:49 24000577 P2  770/2880: setup 11809 ms; 3029 us/prime, 56682 primes
2019-09-14 13:56:49 24000577 P1 GCD: no factor
2019-09-14 13:59:54 24000577 P2 1540/2880: setup 11793 ms; 3028 us/prime, 57130 primes
2019-09-14 14:02:59 24000577 P2 2310/2880: setup 11856 ms; 3030 us/prime, 57186 primes
2019-09-14 14:05:17 24000577 P2 2880/2880: setup 8720 ms; 3036 us/prime, 42457 primes
2019-09-14 14:05:17 1257787 FFT 64K: Width 8x8, Height 64x8; 19.19 bits/word
2019-09-14 14:05:17 using short carry kernels
2019-09-14 14:05:17 OpenCL args "-DEXP=1257787u -DWIDTH=64u -DSMALL_HEIGHT=512u -DMIDDLE=1u -DWEIGHT_STEP=0xe.00d75658c47c8p-3 -DIWEIGHT_STEP=0x9.2405b0b5f2d88p
-4 -DWEIGHT_BIGSTEP=0xc.5672a115506d8p-3 -DIWEIGHT_BIGSTEP=0xa.5fed6a9b15138p-4 -DORIG_X2=1  -I. -cl-fast-relaxed-math -cl-std=CL2.0"
2019-09-14 14:05:21 OpenCL compilation in 3634 ms
2019-09-14 14:05:21 C:\msys64\home\ken\gpuowl-compile\v6.10-0-gc1d0025\1257787\1257787.owl not found
2019-09-14 14:05:21 C:\msys64\home\ken\gpuowl-compile\v6.10-0-gc1d0025\1257787\1257787-old.owl not found
2019-09-14 14:05:21 starting from the beginning.
2019-09-14 14:05:21 1257787 OK     1000   0.08%;  202 us/sq; ETA 0d 00:04; 91d0e6e562cb2541 (check 0.11s)
2019-09-14 14:05:32 1257787       50000   3.97%;  212 us/sq; ETA 0d 00:04; d7ea0488d047e5e4
2019-09-14 14:05:34 24000577 P2 GCD: 13504596665207
2019-09-14 14:05:34 {"exponent":"24000577", "worktype":"PM1", "status":"F", "program":{"name":"gpuowl", "version":"v6.10-0-gc1d0025"}, "timestamp":"2019-09-14 1
9:05:34 UTC", "user":"kriesel", "computer":"condorella/rx480", "aid":"0", "fft-length":1310720, "B1":220000, "B2":3960000, "factors":["13504596665207"]}
2019-09-14 14:05:42 1257787      100000   7.95%;  216 us/sq; ETA 0d 00:04; 09f25999ff3326ca
2019-09-14 14:05:53 1257787      150000  11.92%;  214 us/sq; ETA 0d 00:04; 367d63ab9a7b46d5
2019-09-14 14:06:04 1257787      200000  15.90%;  215 us/sq; ETA 0d 00:04; 25ebe34e39ca647b
2019-09-14 14:06:15 1257787 OK   250000  19.87%;  215 us/sq; ETA 0d 00:04; 564fdae0bb5a37b1 (check 0.12s)
2019-09-14 14:06:26 1257787      300000  23.85%;  215 us/sq; ETA 0d 00:03; 79b4d6cb0169a9b0
2019-09-14 14:06:36 1257787      350000  27.82%;  217 us/sq; ETA 0d 00:03; 0b9b51c4f7638fd3
2019-09-14 14:06:47 1257787      400000  31.80%;  216 us/sq; ETA 0d 00:03; fe2bfeea5734dd7c
2019-09-14 14:06:58 1257787      450000  35.77%;  216 us/sq; ETA 0d 00:03; 16fa53053e566011
2019-09-14 14:07:09 1257787 OK   500000  39.75%;  215 us/sq; ETA 0d 00:03; 7838f365c8c78d0c (check 0.14s)
terminate called after throwing an instance of 'std::invalid_argument'
  what():  stoi

This application has requested the Runtime to terminate it in an unusual way.
Please contact the application's support team for more information.
Code:
Problem signature:
  Problem Event Name:    APPCRASH
  Application Name:    gpuowl-win.exe
  Application Version:    0.0.0.0
  Application Timestamp:    00000000
  Fault Module Name:    gpuowl-win.exe
  Fault Module Version:    0.0.0.0
  Fault Module Timestamp:    00000000
  Exception Code:    40000015
  Exception Offset:    000000000005e386
  OS Version:    6.1.7601.2.1.0.256.48
  Locale ID:    1033
  Additional Information 1:    91c7
  Additional Information 2:    91c775c91db222fe910a2744dc0825a6
  Additional Information 3:    de11
  Additional Information 4:    de11727f51f0f73173ea2f6b995e9dc2

Read our privacy statement online:
  http://go.microsoft.com/fwlink/?linkid=104288&clcid=0x0409

If the online privacy statement is not available, please read our privacy statement offline:
  C:\Windows\system32\en-US\erofflps.txt
kriesel is online now   Reply With Quote
Old 2019-09-15, 12:03   #1371
preda
 
preda's Avatar
 
"Mihai Preda"
Apr 2015

101010110112 Posts
Default

Quote:
Originally Posted by kriesel View Post
Be careful what you ask for?
I wonder what happened at that point, 39% into the PRP test. Can you trigger it reliably? I pushed an attempted fix, could you check whether it still happens?
preda is offline   Reply With Quote
Old 2019-09-15, 12:48   #1372
preda
 
preda's Avatar
 
"Mihai Preda"
Apr 2015

3·457 Posts
Default

Quote:
Originally Posted by Prime95 View Post
Bug report, but not P-1: Gerbicz error count not reported in JSON result.

It used to, see this commit:

https://github.com/preda/gpuowl/comm...37df13563a0f0f
I suspect that what is needed is a boolean, indicating whether the PRP test was was done with/without the Gerbicz error check. Would a bool be enough? Given that gpuowl never did PRP without GEC, that bool is always true for GpuOwl (i.e. implied by the program info, which is part of the result).

My dislike of "error-count" is caused by the fact that, with GEC and roll-backs, the error-count of the result is always 0, as there is no error *included* in the chain of computation begin-to-result. Let me give an example:

Let's say the user start a PRP test of some exponent N. At 50% in the test, a GEC error is detected. The user now starts a whole new PRP(N) test again, from the beginning (e.g. by deleting all the savefiles for N). This second test runs to completion without incident. What should the error-count reported in the result of the second test be? (I suppose 0?)

But what if the user, when a GEC error is detected at 50%, instead of starting from the beginnig (0%) starts from a savefile at 10%, and the computation runs without incident to completion, what should the error-count be? (the savefile at 10% is GEC verified good) Again I suppose 0, beause there was no error in this test result -- no error from beginning to 10%, and no error from 10% to end.

But what the software does automatically on a GEC error is similar to that user restarting from 10% -- it loads a good savefile, verified, with 0 errors in it, and runs from there to completion without incident (or cancels the test and starts another in the case of another GEC error, etc).

[Another way to see it, is that the state of a test should be contained fully in the savefile. Loading a savefile, manually or on a rollback, should re-instate the state from the savefile. In addition, GpuOwl never creates a savefile that didn't pass GEC. Reasoning this way, an "error-count" that is stored in the savefile can never be different from 0]

I suppose it would not be useful if GpuOwl added invariably an information "error-count":"0" to every PRP result?

Another problem is that GEC errors can also originate from a too-small FFT size (in GpuOwl's case), but that is no indication on the health of the hardware.

Is the goal to put a bearing on the "health" of a particular GPU? -- but that would still not affect the validity of the PRP result. And the health of a GPU is not limited to a single test -- e.g. a GPU that often produces GEC errors may still have full runs without errors from time to time, how does that affect the reliability of the result?

So, is in fact what is needed a bool indicating whether the GEC was performed or not?
preda is offline   Reply With Quote
Old 2019-09-15, 14:29   #1373
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

5,419 Posts
Default

Quote:
Originally Posted by preda View Post
I suspect that what is needed is a boolean, indicating whether the PRP test was was done with/without the Gerbicz error check. Would a bool be enough? Given that gpuowl never did PRP without GEC, that bool is always true for GpuOwl (i.e. implied by the program info, which is part of the result).

My dislike of "error-count" is caused by the fact that, with GEC and roll-backs, the error-count of the result is always 0, as there is no error *included* in the chain of computation begin-to-result.
The nonzero integer count of GEC errors detected during a run is still useful information. It indicates a confounded combination of hardware issues affecting raw reliability and throughput, and fft length exponent limits allowing roundoff error to generate significant error. That is useful information even if the errors are detected and corrected by retries and so the gpuowl result is only delayed and otherwise unaffected. The delay effect can be considerable. I have seen a case in prime95 where the hardware was so unreliable that progress in PRP/GEC could no longer be made. In one gpuowl V1.9 case there was a GEC error count of around 150, when near the stated limits of its fft lengths. A gpu or cpu that generates a lot of PRP GEC errors should not be running LL with or without the Jacobi check. Not all users examine logs or console for clues to error rate. Omitting detected error count from results makes it easier for the less attentive users to submit results from less reliable hardware and remain unaware their hardware is unreliable.
kriesel is online now   Reply With Quote
Old 2019-09-15, 15:15   #1374
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

5×11×137 Posts
Default

Quote:
Originally Posted by preda View Post
My dislike of "error-count" is caused by the fact that, with GEC and roll-backs, the error-count of the result is always 0, as there is no error *included* in the chain of computation begin-to-result.
The count is useful for two reasons that I can see:

1) It lets the user monitor hardware health. This is especially nice for headless operations. Rather than ssh into each GPU machine and grepping the log files, I can program the server to email a user whenever a non-zero error count is reported (this feature exists now for prime95 LL tests).

2) It lets us spot double check these PRP results someday. The first prime95 implementation had some windows of vulnerability. If there any vulnerabilities remaining, these machines would be the most likely to find them.
Prime95 is online now   Reply With Quote
Old 2019-09-15, 15:52   #1375
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

5,419 Posts
Default

Quote:
Originally Posted by preda View Post
I wonder what happened at that point, 39% into the PRP test. Can you trigger it reliably? I pushed an attempted fix, could you check whether it still happens?
Approximately reproduced in 6.10-1. Renaming to a filename that already exists, in this case in subfolder <exponent> is a problem.

Code:
>gpuowl-win -h
2019-09-15 09:55:59 gpuowl v6.10-1-gea7d51c

Command line options:

-dir <folder>      : specify work directory (containing worktodo.txt, results.txt, config.txt, gpuowl.log)
-user <name>       : specify the user name.
-cpu  <name>       : specify the hardware name.
-time              : display kernel profiling information.
-fft <size>        : specify FFT size, such as: 5000K, 4M, +2, -1.
-block <value>     : PRP GEC block size. Default 500. Smaller block is slower but detects errors sooner.
-log <step>        : log every <step> iterations, default 50000. Multiple of 10000.
-carry long|short  : force carry type. Short carry may be faster, but requires high bits/word.
-B1                : P-1 B1 bound, default 500000
-B2                : P-1 B2 bound, default B1 * 30
-rB2               : ratio of B2 to B1. Default 30, used only if B2 is not explicitly set
-prp <exponent>    : run a single PRP test and exit, ignoring worktodo.txt
-pm1 <exponent>    : run a single P-1 test and exit, ignoring worktodo.txt
-results <file>    : name of results file, default 'results.txt'
-iters <N>         : run next PRP test for <N> iterations and exit. Multiple of 10000.
-maxAlloc          : limit GPU memory usage to this value in MB
-use NEW_FFT8,OLD_FFT5,NEW_FFT10: comma separated list of defines, see the #if tests in gpuowl.cl (used for perf tuning).
-device <N>        : select a specific device:
 0 : Ellesmere-36@1266-28:00.0 Radeon (TM) RX 480 Graphics
 1 : gfx804-8@1203-03:00.0 Radeon 550 Series

FFT Configurations:
FFT    8K [  0.01M -    0.17M]  64-64
FFT   32K [  0.05M -    0.68M]  64-256 256-64
FFT   64K [  0.10M -    1.33M]  64-512 512-64
FFT  128K [  0.20M -    2.62M]  1K-64 64-1K 256-256
FFT  192K [  0.29M -    3.89M]  64-256-6
FFT  224K [  0.34M -    4.52M]  64-256-7
FFT  256K [  0.39M -    5.15M]  64-2K 256-512 512-256 2K-64
FFT  288K [  0.44M -    5.77M]  64-256-9
FFT  320K [  0.49M -    6.40M]  64-256-10
FFT  352K [  0.54M -    7.02M]  64-256-11
FFT  384K [  0.59M -    7.64M]  64-256-12 64-512-6
FFT  448K [  0.69M -    8.88M]  64-512-7
FFT  512K [  0.79M -   10.12M]  1K-256 256-1K 512-512 4K-64
FFT  576K [  0.88M -   11.35M]  64-512-9
FFT  640K [  0.98M -   12.58M]  64-512-10
FFT  704K [  1.08M -   13.81M]  64-512-11
FFT  768K [  1.18M -   15.03M]  64-512-12 64-1K-6 256-256-6
FFT  896K [  1.38M -   17.47M]  64-1K-7 256-256-7
FFT    1M [  1.57M -   19.89M]  1K-512 256-2K 512-1K 2K-256
FFT 1152K [  1.77M -   22.32M]  64-1K-9 256-256-9
FFT 1280K [  1.97M -   24.73M]  64-1K-10 256-256-10
FFT 1408K [  2.16M -   27.14M]  64-1K-11 256-256-11
FFT 1536K [  2.36M -   29.54M]  64-1K-12 64-2K-6 256-256-12 256-512-6 512-256-6
FFT 1792K [  2.75M -   34.33M]  64-2K-7 256-512-7 512-256-7
FFT    2M [  3.15M -   39.10M]  1K-1K 512-2K 2K-512 4K-256
FFT 2304K [  3.54M -   43.85M]  64-2K-9 256-512-9 512-256-9
FFT 2560K [  3.93M -   48.59M]  64-2K-10 256-512-10 512-256-10
FFT 2816K [  4.33M -   53.32M]  64-2K-11 256-512-11 512-256-11
FFT    3M [  4.72M -   58.04M]  1K-256-6 64-2K-12 256-512-12 256-1K-6 512-256-12 512-512-6
FFT 3584K [  5.51M -   67.44M]  1K-256-7 256-1K-7 512-512-7
FFT    4M [  6.29M -   76.81M]  1K-2K 2K-1K 4K-512
FFT 4608K [  7.08M -   86.15M]  1K-256-9 256-1K-9 512-512-9
FFT    5M [  7.86M -   95.46M]  1K-256-10 256-1K-10 512-512-10
FFT 5632K [  8.65M -  104.74M]  1K-256-11 256-1K-11 512-512-11
FFT    6M [  9.44M -  114.00M]  1K-256-12 1K-512-6 256-1K-12 256-2K-6 512-512-12 512-1K-6 2K-256-6
FFT    7M [ 11.01M -  132.46M]  1K-512-7 256-2K-7 512-1K-7 2K-256-7
FFT    8M [ 12.58M -  150.85M]  2K-2K 4K-1K
FFT    9M [ 14.16M -  169.18M]  1K-512-9 256-2K-9 512-1K-9 2K-256-9
FFT   10M [ 15.73M -  187.45M]  1K-512-10 256-2K-10 512-1K-10 2K-256-10
FFT   11M [ 17.30M -  205.67M]  1K-512-11 256-2K-11 512-1K-11 2K-256-11
FFT   12M [ 18.87M -  223.85M]  1K-512-12 1K-1K-6 256-2K-12 512-1K-12 512-2K-6 2K-256-12 2K-512-6 4K-256-6
FFT   14M [ 22.02M -  260.08M]  1K-1K-7 512-2K-7 2K-512-7 4K-256-7
FFT   16M [ 25.17M -  296.17M]  4K-2K
FFT   18M [ 28.31M -  332.13M]  1K-1K-9 512-2K-9 2K-512-9 4K-256-9
FFT   20M [ 31.46M -  367.98M]  1K-1K-10 512-2K-10 2K-512-10 4K-256-10
FFT   22M [ 34.60M -  403.74M]  1K-1K-11 512-2K-11 2K-512-11 4K-256-11
FFT   24M [ 37.75M -  439.40M]  1K-1K-12 1K-2K-6 512-2K-12 2K-512-12 2K-1K-6 4K-256-12 4K-512-6
FFT   28M [ 44.04M -  510.47M]  1K-2K-7 2K-1K-7 4K-512-7
FFT   36M [ 56.62M -  651.81M]  1K-2K-9 2K-1K-9 4K-512-9
FFT   40M [ 62.91M -  722.13M]  1K-2K-10 2K-1K-10 4K-512-10
FFT   44M [ 69.21M -  792.25M]  1K-2K-11 2K-1K-11 4K-512-11
FFT   48M [ 75.50M -  862.18M]  1K-2K-12 2K-1K-12 2K-2K-6 4K-512-12 4K-1K-6
FFT   56M [ 88.08M - 1001.57M]  2K-2K-7 4K-1K-7
FFT   72M [113.25M - 1278.70M]  2K-2K-9 4K-1K-9
FFT   80M [125.83M - 1416.57M]  2K-2K-10 4K-1K-10
FFT   88M [138.41M - 1554.04M]  2K-2K-11 4K-1K-11
FFT   96M [150.99M - 1691.15M]  2K-2K-12 4K-1K-12 4K-2K-6
FFT  112M [176.16M - 1964.39M]  4K-2K-7
FFT  144M [226.49M - 2507.57M]  4K-2K-9
FFT  160M [251.66M - 2777.78M]  4K-2K-10
FFT  176M [276.82M - 3047.18M]  4K-2K-11
FFT  192M [301.99M - 3315.86M]  4K-2K-12
2019-09-15 09:56:07 Exiting because "help"
2019-09-15 09:56:07 Bye

C:\msys64\home\ken\gpuowl-compile\v6.10-1-gea7d51c>g610

C:\msys64\home\ken\gpuowl-compile\v6.10-1-gea7d51c>gpuowl-win -device 0 -use ORIG_X2 -user kriesel -cpu condorella/rx480
2019-09-15 10:06:04 gpuowl v6.10-1-gea7d51c
2019-09-15 10:06:04 Note: no config.txt file found
2019-09-15 10:06:04 config: -device 0 -use ORIG_X2 -user kriesel -cpu condorella/rx480
2019-09-15 10:06:04 24000577 FFT 1280K: Width 8x8, Height 256x4, Middle 10; 18.31 bits/word
2019-09-15 10:06:04 using short carry kernels
2019-09-15 10:06:11 OpenCL args "-DEXP=24000577u -DWIDTH=64u -DSMALL_HEIGHT=1024u -DMIDDLE=10u -DWEIGHT_STEP=0xc.e5beac96a0b88p-3 -DIWEIGHT_STEP=0x9.eca8ba4660a
fp-4 -DWEIGHT_BIGSTEP=0xe.ac0c6e7dd2438p-3 -DIWEIGHT_BIGSTEP=0x8.b95c1e3ea8bd8p-4 -DORIG_X2=1  -I. -cl-fast-relaxed-math -cl-std=CL2.0"
2019-09-15 10:06:15 OpenCL compilation in 3488 ms
2019-09-15 10:06:15 24000577 P1 B1=220000, B2=3960000, stage1 317550 bits
2019-09-15 10:06:44 24000577 P1    10000   3.15%; 2878 us/sq; ETA 0d 00:15; 7f995dc7dff7f8e0
2019-09-15 10:07:13 24000577 P1    20000   6.30%; 2878 us/sq; ETA 0d 00:14; f705474c0ac30c16
2019-09-15 10:07:41 24000577 P1    30000   9.45%; 2880 us/sq; ETA 0d 00:14; 3fc336b60ee971a2
2019-09-15 10:08:10 24000577 P1    40000  12.60%; 2873 us/sq; ETA 0d 00:13; 87c9fcec37cd0a71
2019-09-15 10:08:39 24000577 P1    50000  15.75%; 2884 us/sq; ETA 0d 00:13; f64948f68fb1d67b
2019-09-15 10:09:08 24000577 P1    60000  18.89%; 2876 us/sq; ETA 0d 00:12; c37c2d473cb0ea06
2019-09-15 10:09:37 24000577 P1    70000  22.04%; 2879 us/sq; ETA 0d 00:12; 5bd384b917eabb12
2019-09-15 10:10:05 24000577 P1    80000  25.19%; 2873 us/sq; ETA 0d 00:11; 91ea4d5d92dc1c29
2019-09-15 10:10:34 24000577 P1    90000  28.34%; 2874 us/sq; ETA 0d 00:11; 9c85386920ff8b45
2019-09-15 10:11:03 24000577 P1   100000  31.49%; 2870 us/sq; ETA 0d 00:10; 438848c849a426c8
2019-09-15 10:11:32 24000577 P1   110000  34.64%; 2882 us/sq; ETA 0d 00:10; 495bc594a2150ed6
2019-09-15 10:12:00 24000577 P1   120000  37.79%; 2867 us/sq; ETA 0d 00:09; 1bd1712dcb680f0d
2019-09-15 10:12:29 24000577 P1   130000  40.94%; 2882 us/sq; ETA 0d 00:09; d03e2db3fd19c843
2019-09-15 10:12:58 24000577 P1   140000  44.09%; 2891 us/sq; ETA 0d 00:09; 9fc5fa31b4959aed
2019-09-15 10:13:27 24000577 P1   150000  47.24%; 2874 us/sq; ETA 0d 00:08; ae6304c818c1f83e
2019-09-15 10:13:56 24000577 P1   160000  50.39%; 2872 us/sq; ETA 0d 00:08; fe8f0bada295328d
2019-09-15 10:14:25 24000577 P1   170000  53.53%; 2870 us/sq; ETA 0d 00:07; 3fd5a4ddb6841e9b
2019-09-15 10:14:53 24000577 P1   180000  56.68%; 2878 us/sq; ETA 0d 00:07; a6234de954685799
2019-09-15 10:15:22 24000577 P1   190000  59.83%; 2878 us/sq; ETA 0d 00:06; c873c91deeefba27
2019-09-15 10:15:51 24000577 P1   200000  62.98%; 2874 us/sq; ETA 0d 00:06; eb92d0b622962612
2019-09-15 10:16:20 24000577 P1   210000  66.13%; 2880 us/sq; ETA 0d 00:05; a64dbff6290ed34a
2019-09-15 10:16:49 24000577 P1   220000  69.28%; 2869 us/sq; ETA 0d 00:05; 7f49b2efd2a795fe
2019-09-15 10:17:18 24000577 P1   230000  72.43%; 2870 us/sq; ETA 0d 00:04; 9884971a1fc42886
2019-09-15 10:17:46 24000577 P1   240000  75.58%; 2878 us/sq; ETA 0d 00:04; ba30a7d0f33bde93
2019-09-15 10:18:15 24000577 P1   250000  78.73%; 2869 us/sq; ETA 0d 00:03; bb8984fecf1af62a
2019-09-15 10:18:44 24000577 P1   260000  81.88%; 2877 us/sq; ETA 0d 00:03; efb3c97f53545dbb
2019-09-15 10:19:13 24000577 P1   270000  85.03%; 2875 us/sq; ETA 0d 00:02; 405373760718e67c
2019-09-15 10:19:42 24000577 P1   280000  88.18%; 2880 us/sq; ETA 0d 00:02; a612ab69e780c283
2019-09-15 10:20:11 24000577 P1   290000  91.32%; 2891 us/sq; ETA 0d 00:01; 740645b16c6380fe
2019-09-15 10:20:39 24000577 P1   300000  94.47%; 2878 us/sq; ETA 0d 00:01; 50ed1a7837d59607
2019-09-15 10:21:08 24000577 P1   310000  97.62%; 2877 us/sq; ETA 0d 00:00; 21a38a3fd1fa6582
2019-09-15 10:21:30 24000577 P1   317550 100.00%; 2878 us/sq; ETA 0d 00:00; 7acca8667b4d2492
2019-09-15 10:21:30 P-1 (B1=220000, B2=3960000, D=30030): primes 260946, expanded 262000, doubles 47491 (left 166492), singles 165964, total 213455 (82%)
2019-09-15 10:21:30 24000577 P2 using blocks [7 - 132] to cover 213455 primes
2019-09-15 10:21:30 24000577 P2 using 770 buffers of 10.0 MB each
2019-09-15 10:24:34 24000577 P2  770/2880: setup 11824 ms; 3025 us/prime, 56682 primes
2019-09-15 10:24:34 24000577 P1 GCD: no factor
2019-09-15 10:27:38 24000577 P2 1540/2880: setup 11778 ms; 3025 us/prime, 57130 primes
2019-09-15 10:30:43 24000577 P2 2310/2880: setup 11793 ms; 3026 us/prime, 57186 primes
2019-09-15 10:33:01 24000577 P2 2880/2880: setup 8720 ms; 3032 us/prime, 42457 primes
2019-09-15 10:33:01 1257787 FFT 64K: Width 8x8, Height 64x8; 19.19 bits/word
2019-09-15 10:33:01 using short carry kernels
2019-09-15 10:33:01 OpenCL args "-DEXP=1257787u -DWIDTH=64u -DSMALL_HEIGHT=512u -DMIDDLE=1u -DWEIGHT_STEP=0xe.00d75658c47c8p-3 -DIWEIGHT_STEP=0x9.2405b0b5f2d88p
-4 -DWEIGHT_BIGSTEP=0xc.5672a115506d8p-3 -DIWEIGHT_BIGSTEP=0xa.5fed6a9b15138p-4 -DORIG_X2=1  -I. -cl-fast-relaxed-math -cl-std=CL2.0"
2019-09-15 10:33:04 OpenCL compilation in 3712 ms
2019-09-15 10:33:04 C:\msys64\home\ken\gpuowl-compile\v6.10-1-gea7d51c\1257787\1257787.owl not found
2019-09-15 10:33:04 C:\msys64\home\ken\gpuowl-compile\v6.10-1-gea7d51c\1257787\1257787-old.owl not found
2019-09-15 10:33:04 starting from the beginning.
2019-09-15 10:33:05 1257787 OK     1000   0.08%;  202 us/sq; ETA 0d 00:04; 91d0e6e562cb2541 (check 0.11s)
2019-09-15 10:33:15 1257787       50000   3.97%;  213 us/sq; ETA 0d 00:04; d7ea0488d047e5e4
2019-09-15 10:33:18 24000577 P2 GCD: 13504596665207
2019-09-15 10:33:18 {"exponent":"24000577", "worktype":"PM1", "status":"F", "program":{"name":"gpuowl", "version":"v6.10-1-gea7d51c"}, "timestamp":"2019-09-15 1
5:33:18 UTC", "user":"kriesel", "computer":"condorella/rx480", "aid":"0", "fft-length":1310720, "B1":220000, "B2":3960000, "factors":["13504596665207"]}
2019-09-15 10:33:26 1257787      100000   7.95%;  214 us/sq; ETA 0d 00:04; 09f25999ff3326ca
2019-09-15 10:33:37 1257787      150000  11.92%;  213 us/sq; ETA 0d 00:04; 367d63ab9a7b46d5
2019-09-15 10:33:47 1257787      200000  15.90%;  212 us/sq; ETA 0d 00:04; 25ebe34e39ca647b
2019-09-15 10:33:58 1257787 OK   250000  19.87%;  212 us/sq; ETA 0d 00:04; 564fdae0bb5a37b1 (check 0.12s)
2019-09-15 10:34:09 1257787      300000  23.85%;  214 us/sq; ETA 0d 00:03; 79b4d6cb0169a9b0
2019-09-15 10:34:20 1257787      350000  27.82%;  213 us/sq; ETA 0d 00:03; 0b9b51c4f7638fd3
2019-09-15 10:34:30 1257787      400000  31.80%;  213 us/sq; ETA 0d 00:03; fe2bfeea5734dd7c
2019-09-15 10:34:41 1257787      450000  35.77%;  213 us/sq; ETA 0d 00:03; 16fa53053e566011
2019-09-15 10:34:52 1257787 OK   500000  39.75%;  213 us/sq; ETA 0d 00:03; 7838f365c8c78d0c (check 0.11s)
2019-09-15 10:34:52 Exception NSt10filesystem7__cxx1116filesystem_errorE: filesystem error: cannot rename: File exists [C:\msys64\home\ken\gpuowl-compile\v6.10-
1-gea7d51c\1257787\1257787-new.owl] [C:\msys64\home\ken\gpuowl-compile\v6.10-1-gea7d51c\1257787\1257787.owl]
2019-09-15 10:34:52 Bye
C:\msys64\home\ken\gpuowl-compile\v6.10-1-gea7d51c>dir 1257787
 Volume in drive C has no label.
 Volume Serial Number is 3E40-A384

 Directory of C:\msys64\home\ken\gpuowl-compile\v6.10-1-gea7d51c\1257787

09/15/2019  10:34 AM    <DIR>          .
09/15/2019  10:34 AM    <DIR>          ..
09/15/2019  10:34 AM           157,270 1257787-new.owl
09/15/2019  10:33 AM           157,268 1257787-old.owl
09/15/2019  10:33 AM           157,270 1257787.owl
               3 File(s)        471,808 bytes
               2 Dir(s)  863,544,840,192 bytes free
Worktodo for anyone to try reproducing it:
Code:
B1=220000,B2=3960000;PFactor=0,1,2,24000577,-1,76,2
PRP=0,1,2,1257787,-1,70,0
The PFactor line is probably unnecessary. V6.10-0 did the same; 3 files, then boom. Also note that both V6.10-0 and 6.10-1 left a worktodo.bak behind. There may be a similar rename to existing filename issue with the worktodo file. I vaguely recall there being an issue like this rename fail due to existing target filenamewith gpuowl some months back.

Last fiddled with by kriesel on 2019-09-15 at 16:01
kriesel is online now   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
mfakto: an OpenCL program for Mersenne prefactoring Bdot GPU Computing 1676 2021-06-30 21:23
GPUOWL AMD Windows OpenCL issues xx005fs GpuOwl 0 2019-07-26 21:37
Testing an expression for primality 1260 Software 17 2015-08-28 01:35
Testing Mersenne cofactors for primality? CRGreathouse Computer Science & Computational Number Theory 18 2013-06-08 19:12
Primality-testing program with multiple types of moduli (PFGW-related) Unregistered Information & Answers 4 2006-10-04 22:38

All times are UTC. The time now is 20:30.


Sun Aug 1 20:30:51 UTC 2021 up 9 days, 14:59, 0 users, load averages: 2.67, 2.31, 1.95

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.