![]() |
|
|
#1365 |
|
Romulan Interpreter
Jun 2011
Thailand
72×197 Posts |
|
|
|
|
|
|
#1366 |
|
P90 years forever!
Aug 2002
Yeehaw, FL
1D6F16 Posts |
|
|
|
|
|
|
#1367 | |
|
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
5,419 Posts |
Quote:
For the same 87M exponent, starting from zero, each separate folder: Win7 Pro x64, RX480, GW variant: Code:
>gpuowl-win -h
2019-09-14 11:10:12 gpuowl
Command line options:
-dir <folder> : specify work directory (containing worktodo.txt, results.txt, config.txt, gpuowl.log)
-user <name> : specify the user name.
-cpu <name> : specify the hardware name.
-time : display kernel profiling information.
-fft <size> : specify FFT size, such as: 5000K, 4M, +2, -1.
-block <value> : PRP GEC block size. Default 1000. Smaller block is slower but detects errors sooner.
-log <step> : log every <step> iterations, default 20000. Multiple of 10000.
-carry long|short : force carry type. Short carry may be faster, but requires high bits/word.
-B1 : P-1 B1 bound, default 500000
-B2 : P-1 B2 bound, default B1 * 30
-rB2 : ratio of B2 to B1. Default 30, used only if B2 is not explicitly set
-prp <exponent> : run a single PRP test and exit, ignoring worktodo.txt
-pm1 <exponent> : run a single P-1 test and exit, ignoring worktodo.txt
-results <file> : name of results file, default 'results.txt'
-iters <N> : run next PRP test for <N> iterations and exit. Multiple of 10000.
-use NEW_FFT8,OLD_FFT5,NEW_FFT10: comma separated list of defines, see the #if tests in gpuowl.cl (used for perf tuning).
-device <N> : select a specific device:
0 : Ellesmere-36x1266-@28:0.0 Radeon (TM) RX 480 Graphics
1 : gfx804-8x1203-@3:0.0 Radeon 550 Series
FFT Configurations:
FFT 8K [ 0.01M - 0.18M] 64-64
FFT 32K [ 0.05M - 0.68M] 64-256 256-64
FFT 64K [ 0.10M - 1.34M] 64-512 512-64
FFT 128K [ 0.20M - 2.63M] 1K-64 64-1K 256-256
FFT 192K [ 0.29M - 3.91M] 64-256-6
FFT 224K [ 0.34M - 4.54M] 64-256-7
FFT 256K [ 0.39M - 5.18M] 64-2K 256-512 512-256 2K-64
FFT 288K [ 0.44M - 5.81M] 64-256-9
FFT 320K [ 0.49M - 6.44M] 64-256-10
FFT 352K [ 0.54M - 7.06M] 64-256-11
FFT 384K [ 0.59M - 7.69M] 64-256-12 64-512-6
FFT 448K [ 0.69M - 8.94M] 64-512-7
FFT 512K [ 0.79M - 10.18M] 1K-256 256-1K 512-512 4K-64
FFT 576K [ 0.88M - 11.42M] 64-512-9
FFT 640K [ 0.98M - 12.66M] 64-512-10
FFT 704K [ 1.08M - 13.89M] 64-512-11
FFT 768K [ 1.18M - 15.12M] 64-512-12 64-1K-6 256-256-6
FFT 896K [ 1.38M - 17.57M] 64-1K-7 256-256-7
FFT 1M [ 1.57M - 20.02M] 1K-512 256-2K 512-1K 2K-256
FFT 1152K [ 1.77M - 22.45M] 64-1K-9 256-256-9
FFT 1280K [ 1.97M - 24.88M] 64-1K-10 256-256-10
FFT 1408K [ 2.16M - 27.31M] 64-1K-11 256-256-11
FFT 1536K [ 2.36M - 29.72M] 64-1K-12 64-2K-6 256-256-12 256-512-6 512-256-6
FFT 1792K [ 2.75M - 34.54M] 64-2K-7 256-512-7 512-256-7
FFT 2M [ 3.15M - 39.34M] 1K-1K 512-2K 2K-512 4K-256
FFT 2304K [ 3.54M - 44.13M] 64-2K-9 256-512-9 512-256-9
FFT 2560K [ 3.93M - 48.90M] 64-2K-10 256-512-10 512-256-10
FFT 2816K [ 4.33M - 53.66M] 64-2K-11 256-512-11 512-256-11
FFT 3M [ 4.72M - 58.41M] 1K-256-6 64-2K-12 256-512-12 256-1K-6 512-256-12 512-512-6
FFT 3584K [ 5.51M - 67.87M] 1K-256-7 256-1K-7 512-512-7
FFT 4M [ 6.29M - 77.30M] 1K-2K 2K-1K 4K-512
FFT 4608K [ 7.08M - 86.70M] 1K-256-9 256-1K-9 512-512-9
FFT 5M [ 7.86M - 96.07M] 1K-256-10 256-1K-10 512-512-10
FFT 5632K [ 8.65M - 105.41M] 1K-256-11 256-1K-11 512-512-11
FFT 6M [ 9.44M - 114.74M] 1K-256-12 1K-512-6 256-1K-12 256-2K-6 512-512-12 512-1K-6 2K-256-6
FFT 7M [ 11.01M - 133.32M] 1K-512-7 256-2K-7 512-1K-7 2K-256-7
FFT 8M [ 12.58M - 151.83M] 2K-2K 4K-1K
FFT 9M [ 14.16M - 170.28M] 1K-512-9 256-2K-9 512-1K-9 2K-256-9
FFT 10M [ 15.73M - 188.68M] 1K-512-10 256-2K-10 512-1K-10 2K-256-10
FFT 11M [ 17.30M - 207.02M] 1K-512-11 256-2K-11 512-1K-11 2K-256-11
FFT 12M [ 18.87M - 225.32M] 1K-512-12 1K-1K-6 256-2K-12 512-1K-12 512-2K-6 2K-256-12 2K-512-6 4K-256-6
FFT 14M [ 22.02M - 261.80M] 1K-1K-7 512-2K-7 2K-512-7 4K-256-7
FFT 16M [ 25.17M - 298.13M] 4K-2K
FFT 18M [ 28.31M - 334.34M] 1K-1K-9 512-2K-9 2K-512-9 4K-256-9
FFT 20M [ 31.46M - 370.44M] 1K-1K-10 512-2K-10 2K-512-10 4K-256-10
FFT 22M [ 34.60M - 406.43M] 1K-1K-11 512-2K-11 2K-512-11 4K-256-11
FFT 24M [ 37.75M - 442.34M] 1K-1K-12 1K-2K-6 512-2K-12 2K-512-12 2K-1K-6 4K-256-12 4K-512-6
FFT 28M [ 44.04M - 513.91M] 1K-2K-7 2K-1K-7 4K-512-7
FFT 36M [ 56.62M - 656.22M] 1K-2K-9 2K-1K-9 4K-512-9
FFT 40M [ 62.91M - 727.03M] 1K-2K-10 2K-1K-10 4K-512-10
FFT 44M [ 69.21M - 797.64M] 1K-2K-11 2K-1K-11 4K-512-11
FFT 48M [ 75.50M - 868.07M] 1K-2K-12 2K-1K-12 2K-2K-6 4K-512-12 4K-1K-6
FFT 56M [ 88.08M - 1008.44M] 2K-2K-7 4K-1K-7
FFT 72M [113.25M - 1287.53M] 2K-2K-9 4K-1K-9
FFT 80M [125.83M - 1426.38M] 2K-2K-10 4K-1K-10
FFT 88M [138.41M - 1564.83M] 2K-2K-11 4K-1K-11
FFT 96M [150.99M - 1702.92M] 2K-2K-12 4K-1K-12 4K-2K-6
FFT 112M [176.16M - 1978.12M] 4K-2K-7
FFT 144M [226.49M - 2525.23M] 4K-2K-9
FFT 160M [251.66M - 2797.39M] 4K-2K-10
FFT 176M [276.82M - 3068.76M] 4K-2K-11
FFT 192M [301.99M - 3339.40M] 4K-2K-12
2019-09-14 11:10:17 Exiting because "help"
2019-09-14 11:10:17 Bye
C:\msys64\home\ken\gpuowl-compile\gw>gw
C:\msys64\home\ken\gpuowl-compile\gw>gpuowl-win -device 0
2019-09-14 11:19:48 gpuowl
2019-09-14 11:19:48 Note: no config.txt file found
2019-09-14 11:19:48 config: -device 0
2019-09-14 11:19:48 87005279 FFT 5120K: Width 256x4, Height 64x4, Middle 10; 16.59 bits/word
2019-09-14 11:19:48 using short carry kernels
2019-09-14 11:19:55 OpenCL args "-DEXP=87005279u -DWIDTH=1024u -DSMALL_HEIGHT=256u -DMIDDLE=10u -DWEIGHT_STEP=0xa.97d8cd06772f8p-3 -DIWEIGHT_STEP=0xc.1551b6b115
8dp-4 -DWEIGHT_BIGSTEP=0x9.837f0518db8a8p-3 -DIWEIGHT_BIGSTEP=0xd.744fccad69d68p-4 -I. -cl-fast-relaxed-math -cl-std=CL2.0"
2019-09-14 11:19:55 OpenCL compilation error -11 (args -DEXP=87005279u -DWIDTH=1024u -DSMALL_HEIGHT=256u -DMIDDLE=10u -DWEIGHT_STEP=0xa.97d8cd06772f8p-3 -DIWEIG
HT_STEP=0xc.1551b6b1158dp-4 -DWEIGHT_BIGSTEP=0x9.837f0518db8a8p-3 -DIWEIGHT_BIGSTEP=0xd.744fccad69d68p-4 -I. -cl-fast-relaxed-math -cl-std=CL2.0)
2019-09-14 11:19:55 C:\Users\ken\AppData\Local\Temp\\OCL9192T1.cl:197:3: error: implicit declaration of function '__asm' is invalid in C99
X2(u[0], u[2]);
^
C:\Users\ken\AppData\Local\Temp\\OCL9192T1.cl:174:2: note: expanded from macro 'X2'
__asm( "v_add_f64 %0, %1, -%2\n" : "=v" (b.x) : "v" (t.x), "v" (b.x)); \
^
C:\Users\ken\AppData\Local\Temp\\OCL9192T1.cl:197:3: error: expected ')'
C:\Users\ken\AppData\Local\Temp\\OCL9192T1.cl:174:35: note: expanded from macro 'X2'
__asm( "v_add_f64 %0, %1, -%2\n" : "=v" (b.x) : "v" (t.x), "v" (b.x)); \
^
C:\Users\ken\AppData\Local\Temp\\OCL9192T1.cl:197:3: note: to match this '('
C:\Users\ken\AppData\Local\Temp\\OCL9192T1.cl:174:7: note: expanded from macro 'X2'
__asm( "v_add_f64 %0, %1, -%2\n" : "=v" (b.x) : "v" (t.x), "v" (b.x)); \
^
C:\Users\ken\AppData\Local\Temp\\OCL9192T1.cl:197:3: error: expected ')'
X2(u[0], u[2]);
^
C:\Users\ken\AppData\Local\Temp\\OCL9192T1.cl:175:35: note: expanded from macro 'X2'
__asm( "v_add_f64 %0, %1, -%2\n" : "=v" (b.y) : "v" (t.y), "v" (b.y)); \
^
C:\Users\ken\AppData\Local\Temp\\OCL9192T1.cl:197:3: note: to match this '('
C:\Users\ken\AppData\Local\Temp\\OCL9192T1.cl:175:7: note: expanded from macro 'X2'
__asm( "v_add_f64 %0, %1, -%2\n" : "=v" (b.y) : "v" (t.y), "v" (b.y)); \
^
C:\Users\ken\AppData\Local\Temp\\OCL9192T1.cl:198:3: error: expected ')'
X2_mul_t4(u[1], u[3]);
^
C:\Users\ken\AppData\Local\Temp\\OCL9192T1.cl:180:35: note: expanded from macro 'X2_mul_t4'
__asm( "v_add_f64 %0, %1, -%2\n" : "=v" (t.x) : "v" (b.x), "v" (t.x)); \
^
C:\Users\ken\AppData\Local\Temp\\OCL9192T1.cl:198:3: note: to match this '('
C:\Users\ken\AppData\Local\Temp\\OCL9192T1.cl:180:7: note: expanded from macro 'X2_mul_t4'
__asm( "v_add_f64 %0, %1, -%2\n" : "=v" (t.x) : "v" (b.x), "v" (t.x)); \
^
C:\Users\ken\AppData\Local\Temp\\OCL9192T1.cl:1982019-09-14 11:19:55 Exception 9gpu_error: BUILD_PROGRAM_FAILURE clBuildProgram at clwrap.cpp:215 build
2019-09-14 11:19:55 Bye
C:\msys64\home\ken\gpuowl-compile\gw>gpuowl-win -device 0 -use ORIG_X2
2019-09-14 11:20:30 gpuowl
2019-09-14 11:20:30 Note: no config.txt file found
2019-09-14 11:20:30 config: -device 0 -use ORIG_X2
2019-09-14 11:20:30 87005279 FFT 5120K: Width 256x4, Height 64x4, Middle 10; 16.59 bits/word
2019-09-14 11:20:30 using short carry kernels
2019-09-14 11:20:35 OpenCL args "-DEXP=87005279u -DWIDTH=1024u -DSMALL_HEIGHT=256u -DMIDDLE=10u -DWEIGHT_STEP=0xa.97d8cd06772f8p-3 -DIWEIGHT_STEP=0xc.1551b6b115
8dp-4 -DWEIGHT_BIGSTEP=0x9.837f0518db8a8p-3 -DIWEIGHT_BIGSTEP=0xd.744fccad69d68p-4 -DORIG_X2=1 -DORIG_X2=1 -I. -cl-fast-relaxed-math -cl-std=CL2.0"
2019-09-14 11:20:39 OpenCL compilation in 3389 ms
2019-09-14 11:20:40 87005279.owl not found, starting from the beginning.
2019-09-14 11:21:08 87005279 OK 2000 0.00%; 6501 us/sq; ETA 6d 13:06; e944fcb41cb63c80 (check 6.71s)
2019-09-14 11:23:06 87005279 20000 0.02%; 6557 us/sq; ETA 6d 14:26; 77e12e401949f647
2019-09-14 11:25:17 87005279 40000 0.05%; 6549 us/sq; ETA 6d 14:13; 3ccb222b85a3780d
2019-09-14 11:26:42 Stopping, please wait..
2019-09-14 11:26:49 87005279 OK 53000 0.06%; 6579 us/sq; ETA 6d 14:54; 4a2c9b719dd7f2c1 (check 6.74s)
2019-09-14 11:26:49 Exiting because "stop requested"
2019-09-14 11:26:49 Bye
Terminate batch job (Y/N)? y
C:\msys64\home\ken\gpuowl-compile\gw>gpuowl-win -device 0 -use ORIG_X2 -time
2019-09-14 11:27:09 gpuowl
2019-09-14 11:27:09 Note: no config.txt file found
2019-09-14 11:27:09 config: -device 0 -use ORIG_X2 -time
2019-09-14 11:27:09 87005279 FFT 5120K: Width 256x4, Height 64x4, Middle 10; 16.59 bits/word
2019-09-14 11:27:09 using short carry kernels
2019-09-14 11:27:16 OpenCL args "-DEXP=87005279u -DWIDTH=1024u -DSMALL_HEIGHT=256u -DMIDDLE=10u -DWEIGHT_STEP=0xa.97d8cd06772f8p-3 -DIWEIGHT_STEP=0xc.1551b6b115
8dp-4 -DWEIGHT_BIGSTEP=0x9.837f0518db8a8p-3 -DIWEIGHT_BIGSTEP=0xd.744fccad69d68p-4 -DORIG_X2=1 -DORIG_X2=1 -I. -cl-fast-relaxed-math -cl-std=CL2.0"
2019-09-14 11:27:19 OpenCL compilation in 3207 ms
2019-09-14 11:27:20 87005279.owl loaded: k 53000, block 1000, res64 4a2c9b719dd7f2c1
2019-09-14 11:28:27 87005279 OK 55000 0.06%; 13095 us/sq; ETA 13d 04:17; e5617f81e2a4387a (check 25.20s)
2019-09-14 11:28:27 32.25% fftMiddleIn : 5051 us/call x 4259 calls
2019-09-14 11:28:27 18.56% carryFused : 3201 us/call x 3869 calls
2019-09-14 11:28:27 17.15% tailFused : 2860 us/call x 3999 calls
2019-09-14 11:28:27 14.56% fftMiddleOut : 2352 us/call x 4129 calls
2019-09-14 11:28:27 13.64% transposeH : 2203 us/call x 4129 calls
2019-09-14 11:28:27 0.93% fftH : 1585 us/call x 390 calls
2019-09-14 11:28:27 0.88% fftP : 1503 us/call x 390 calls
2019-09-14 11:28:27 0.67% carryA : 1725 us/call x 258 calls
2019-09-14 11:28:27 0.61% fftW : 1569 us/call x 260 calls
2019-09-14 11:28:27 0.36% multiply : 1862 us/call x 130 calls
2019-09-14 11:28:27 0.36% carryB : 915 us/call x 260 calls
2019-09-14 11:28:27
2019-09-14 11:30:20 87005279 60000 0.07%; 22484 us/sq; ETA 22d 15:02; 6d81443958902b6b
2019-09-14 11:30:20 28.82% fftMiddleIn : 6456 us/call x 5010 calls
2019-09-14 11:30:20 19.99% carryFused : 4490 us/call x 4995 calls
2019-09-14 11:30:20 19.00% tailFused : 4263 us/call x 5000 calls
2019-09-14 11:30:20 16.22% fftMiddleOut : 3636 us/call x 5005 calls
2019-09-14 11:30:20 15.80% transposeH : 3542 us/call x 5005 calls
2019-09-14 11:30:20 0.05% fftH : 3400 us/call x 15 calls
2019-09-14 11:30:20 0.03% fftP : 2533 us/call x 15 calls
2019-09-14 11:30:20 0.03% carryB : 3800 us/call x 10 calls
2019-09-14 11:30:20 0.03% fftW : 3600 us/call x 10 calls
2019-09-14 11:30:20 0.02% carryA : 2200 us/call x 10 calls
2019-09-14 11:30:20 0.01% multiply : 3200 us/call x 5 calls
2019-09-14 11:30:20
2019-09-14 11:31:57 Stopping, please wait..
2019-09-14 11:32:17 87005279 OK 64000 0.07%; 24296 us/sq; ETA 24d 10:45; a5a4adb2509d792a (check 20.19s)
2019-09-14 11:32:17 29.29% fftMiddleIn : 6837 us/call x 5008 calls
2019-09-14 11:32:17 19.30% carryFused : 4517 us/call x 4995 calls
2019-09-14 11:32:17 18.58% tailFused : 4344 us/call x 5000 calls
2019-09-14 11:32:17 17.53% fftMiddleOut : 4096 us/call x 5004 calls
2019-09-14 11:32:17 15.22% transposeH : 3555 us/call x 5004 calls
2019-09-14 11:32:17 0.02% carryB : 2844 us/call x 9 calls
2019-09-14 11:32:17 0.02% multiply : 4650 us/call x 4 calls
2019-09-14 11:32:17 0.01% carryA : 1875 us/call x 8 calls
2019-09-14 11:32:17 0.01% fftP : 1000 us/call x 13 calls
2019-09-14 11:32:17 0.01% fftH : 1083 us/call x 12 calls
2019-09-14 11:32:17
2019-09-14 11:32:17 Exiting because "stop requested"
2019-09-14 11:32:17 Bye
Terminate batch job (Y/N)? y
C:\msys64\home\ken\gpuowl-compile\gw>gpuowl-win -device 0 -carry short -use ORIG_X2 -time
2019-09-14 11:45:24 gpuowl
2019-09-14 11:45:24 Note: no config.txt file found
2019-09-14 11:45:24 config: -device 0 -carry short -use ORIG_X2 -time
2019-09-14 11:45:24 87005279 FFT 5120K: Width 256x4, Height 64x4, Middle 10; 16.59 bits/word
2019-09-14 11:45:24 using short carry kernels
2019-09-14 11:45:31 OpenCL args "-DEXP=87005279u -DWIDTH=1024u -DSMALL_HEIGHT=256u -DMIDDLE=10u -DWEIGHT_STEP=0xa.97d8cd06772f8p-3 -DIWEIGHT_STEP=0xc.1551b6b115
8dp-4 -DWEIGHT_BIGSTEP=0x9.837f0518db8a8p-3 -DIWEIGHT_BIGSTEP=0xd.744fccad69d68p-4 -DORIG_X2=1 -DORIG_X2=1 -I. -cl-fast-relaxed-math -cl-std=CL2.0"
2019-09-14 11:45:34 OpenCL compilation in 3229 ms
2019-09-14 11:45:35 87005279.owl loaded: k 64000, block 1000, res64 a5a4adb2509d792a
2019-09-14 11:47:05 87005279 OK 66000 0.08%; 19765 us/sq; ETA 19d 21:19; cd6fbb4ea4a33c97 (check 24.90s)
2019-09-14 11:47:05 28.97% fftMiddleIn : 6073 us/call x 4259 calls
2019-09-14 11:47:05 17.84% tailFused : 3983 us/call x 3999 calls
2019-09-14 11:47:05 17.09% carryFused : 3943 us/call x 3869 calls
2019-09-14 11:47:05 15.32% fftMiddleOut : 3313 us/call x 4129 calls
2019-09-14 11:47:05 15.15% transposeH : 3276 us/call x 4129 calls
2019-09-14 11:47:05 1.40% fftH : 3200 us/call x 390 calls
2019-09-14 11:47:05 1.26% fftP : 2880 us/call x 390 calls
2019-09-14 11:47:05 1.01% carryA : 3507 us/call x 258 calls
2019-09-14 11:47:05 0.87% fftW : 3000 us/call x 260 calls
2019-09-14 11:47:05 0.70% carryB : 2400 us/call x 260 calls
2019-09-14 11:47:05 0.37% multiply : 2520 us/call x 130 calls
2019-09-14 11:47:05 0.03% carryM : 15600 us/call x 2 calls
2019-09-14 11:47:05
2019-09-14 11:48:19 Stopping, please wait..
2019-09-14 11:48:31 87005279 OK 69000 0.08%; 24611 us/sq; ETA 24d 18:21; 80ce9777c6f885e9 (check 12.89s)
2019-09-14 11:48:31 31.37% fftMiddleIn : 6756 us/call x 4006 calls
2019-09-14 11:48:31 18.90% carryFused : 4080 us/call x 3996 calls
2019-09-14 11:48:31 17.00% tailFused : 3666 us/call x 4000 calls
2019-09-14 11:48:31 16.31% transposeH : 3515 us/call x 4003 calls
2019-09-14 11:48:31 16.20% fftMiddleOut : 3492 us/call x 4003 calls
2019-09-14 11:48:31 0.07% fftP : 6240 us/call x 10 calls
2019-09-14 11:48:31 0.05% fftW : 6686 us/call x 7 calls
2019-09-14 11:48:31 0.04% fftH : 3467 us/call x 9 calls
2019-09-14 11:48:32 0.02% carryB : 2229 us/call x 7 calls
2019-09-14 11:48:32 0.02% multiply : 5200 us/call x 3 calls
2019-09-14 11:48:32 0.02% isEqual : 15600 us/call x 1 calls
2019-09-14 11:48:32
2019-09-14 11:48:32 Exiting because "stop requested"
2019-09-14 11:48:32 Bye
Terminate batch job (Y/N)? n
C:\msys64\home\ken\gpuowl-compile\gw>gw
C:\msys64\home\ken\gpuowl-compile\gw>gpuowl-win -device 0 -carry short -use ORIG_X2
2019-09-14 11:48:40 gpuowl
2019-09-14 11:48:40 Note: no config.txt file found
2019-09-14 11:48:40 config: -device 0 -carry short -use ORIG_X2
2019-09-14 11:48:40 87005279 FFT 5120K: Width 256x4, Height 64x4, Middle 10; 16.59 bits/word
2019-09-14 11:48:40 using short carry kernels
2019-09-14 11:48:48 OpenCL args "-DEXP=87005279u -DWIDTH=1024u -DSMALL_HEIGHT=256u -DMIDDLE=10u -DWEIGHT_STEP=0xa.97d8cd06772f8p-3 -DIWEIGHT_STEP=0xc.1551b6b115
8dp-4 -DWEIGHT_BIGSTEP=0x9.837f0518db8a8p-3 -DIWEIGHT_BIGSTEP=0xd.744fccad69d68p-4 -DORIG_X2=1 -DORIG_X2=1 -I. -cl-fast-relaxed-math -cl-std=CL2.0"
2019-09-14 11:48:51 OpenCL compilation in 3276 ms
2019-09-14 11:48:52 87005279.owl loaded: k 69000, block 1000, res64 80ce9777c6f885e9
2019-09-14 11:49:20 87005279 OK 71000 0.08%; 6497 us/sq; ETA 6d 12:54; cb6cb22058171054 (check 6.72s)
2019-09-14 11:50:19 87005279 80000 0.09%; 6501 us/sq; ETA 6d 12:59; e989bcf6f98d3c02
2019-09-14 11:52:30 87005279 100000 0.11%; 6550 us/sq; ETA 6d 14:07; 4ba1f423b8c71b64
2019-09-14 11:54:41 87005279 120000 0.14%; 6552 us/sq; ETA 6d 14:07; 74525140cca3e28c
2019-09-14 11:56:26 Stopping, please wait..
2019-09-14 11:56:33 87005279 OK 136000 0.16%; 6564 us/sq; ETA 6d 14:24; 32900173c562435a (check 6.75s)
2019-09-14 11:56:33 Exiting because "stop requested"
2019-09-14 11:56:33 Bye
Code:
>gpuowl-win -device 0 -carry short -fft +0 -use ORIG_X2 2019-09-14 11:33:44 gpuowl v6.5-76-g1ca08e2-dirty 2019-09-14 11:33:44 Note: no config.txt file found 2019-09-14 11:33:44 config: -device 0 -carry short -fft +0 -use ORIG_X2 2019-09-14 11:33:44 87005279 FFT 5120K: Width 256x4, Height 64x4, Middle 10; 16.59 bits/word 2019-09-14 11:33:44 using short carry kernels 2019-09-14 11:33:46 OpenCL args "-DEXP=87005279u -DWIDTH=1024u -DSMALL_HEIGHT=256u -DMIDDLE=10u -DWEIGHT_STEP=0xa.97d8cd06772f8p-3 -DIWEIGHT_STEP=0xc.1551b6b115 8dp-4 -DWEIGHT_BIGSTEP=0x9.837f0518db8a8p-3 -DIWEIGHT_BIGSTEP=0xd.744fccad69d68p-4 -DORIG_X2=1 -DORIG_X2=1 -I. -cl-fast-relaxed-math -cl-std=CL2.0" 2019-09-14 11:33:49 OpenCL compilation in 3057 ms 2019-09-14 11:33:50 87005279.owl not found, starting from the beginning. 2019-09-14 11:34:08 87005279 OK 2000 0.00%; 4.108 ms/sq; ETA 4d 03:16; e944fcb41cb63c80 (check 4.35s) 2019-09-14 11:35:22 87005279 20000 0.02%; 4.157 ms/sq; ETA 4d 04:27; 77e12e401949f647 2019-09-14 11:36:45 87005279 40000 0.05%; 4.147 ms/sq; ETA 4d 04:10; 3ccb222b85a3780d 2019-09-14 11:37:31 Stopping, please wait.. 2019-09-14 11:37:36 87005279 OK 51000 0.06%; 4.124 ms/sq; ETA 4d 03:36; 7b72f5d50e454610 (check 4.88s) 2019-09-14 11:37:36 Exiting because "stop requested" 2019-09-14 11:37:36 Bye C:\msys64\home\ken\gpuowl-compile\v6.5-latest\gpuowl>gpuowl-win -device 0 -carry short -fft +0 -use ORIG_X2 -time 2019-09-14 11:38:07 gpuowl v6.5-76-g1ca08e2-dirty 2019-09-14 11:38:07 Note: no config.txt file found 2019-09-14 11:38:07 config: -device 0 -carry short -fft +0 -use ORIG_X2 -time 2019-09-14 11:38:07 87005279 FFT 5120K: Width 256x4, Height 64x4, Middle 10; 16.59 bits/word 2019-09-14 11:38:07 using short carry kernels 2019-09-14 11:38:15 OpenCL args "-DEXP=87005279u -DWIDTH=1024u -DSMALL_HEIGHT=256u -DMIDDLE=10u -DWEIGHT_STEP=0xa.97d8cd06772f8p-3 -DIWEIGHT_STEP=0xc.1551b6b115 8dp-4 -DWEIGHT_BIGSTEP=0x9.837f0518db8a8p-3 -DIWEIGHT_BIGSTEP=0xd.744fccad69d68p-4 -DORIG_X2=1 -DORIG_X2=1 -I. -cl-fast-relaxed-math -cl-std=CL2.0" 2019-09-14 11:38:18 OpenCL compilation in 3151 ms 2019-09-14 11:38:19 87005279.owl loaded: k 51000, block 1000, res64 7b72f5d50e454610 2019-09-14 11:40:08 87005279 OK 53000 0.06%; 25.631 ms/sq; ETA 25d 19:04; 4a2c9b719dd7f2c1 (check 21.59s) 2019-09-14 11:40:08 16.79% carryFused : 4709 us/call x 3869 calls 2019-09-14 11:40:08 16.17% tailFused : 4389 us/call x 3999 calls 2019-09-14 11:40:08 15.41% fftMiddleIn : 3927 us/call x 4259 calls 2019-09-14 11:40:08 15.33% transposeW : 3908 us/call x 4259 calls 2019-09-14 11:40:08 15.31% transposeH : 4024 us/call x 4129 calls 2019-09-14 11:40:08 14.65% fftMiddleOut : 3850 us/call x 4129 calls 2019-09-14 11:40:08 1.58% fftH : 4400 us/call x 390 calls 2019-09-14 11:40:08 1.42% fftP : 3960 us/call x 390 calls 2019-09-14 11:40:08 1.06% carryB : 4440 us/call x 260 calls 2019-09-14 11:40:08 0.91% carryA : 3809 us/call x 258 calls 2019-09-14 11:40:08 0.85% fftW : 3540 us/call x 260 calls 2019-09-14 11:40:08 0.53% multiply : 4440 us/call x 130 calls 2019-09-14 11:40:08 2019-09-14 11:42:47 87005279 60000 0.07%; 22.751 ms/sq; ETA 22d 21:28; 6d81443958902b6b 2019-09-14 11:42:47 19.17% carryFused : 4359 us/call x 6993 calls 2019-09-14 11:42:48 17.20% tailFused : 3909 us/call x 7000 calls 2019-09-14 11:42:48 16.18% transposeH : 3673 us/call x 7007 calls 2019-09-14 11:42:48 16.15% transposeW : 3663 us/call x 7014 calls 2019-09-14 11:42:48 16.11% fftMiddleIn : 3652 us/call x 7014 calls 2019-09-14 11:42:48 14.98% fftMiddleOut : 3400 us/call x 7007 calls 2019-09-14 11:42:48 0.06% fftP : 4457 us/call x 21 calls 2019-09-14 11:42:48 0.04% fftW : 4457 us/call x 14 calls 2019-09-14 11:42:48 0.04% fftH : 2971 us/call x 21 calls 2019-09-14 11:42:48 0.04% carryA : 4457 us/call x 14 calls 2019-09-14 11:42:48 0.02% multiply : 4457 us/call x 7 calls 2019-09-14 11:42:48 2019-09-14 11:43:01 Stopping, please wait.. 2019-09-14 11:43:24 87005279 OK 61000 0.07%; 13.993 ms/sq; ETA 14d 01:57; be2af92c309064ef (check 22.32s) 2019-09-14 11:43:24 21.11% carryFused : 3795 us/call x 1998 calls 2019-09-14 11:43:24 17.03% tailFused : 3058 us/call x 2000 calls 2019-09-14 11:43:24 16.12% fftMiddleOut : 2892 us/call x 2001 calls 2019-09-14 11:43:24 15.86% fftMiddleIn : 2844 us/call x 2002 calls 2019-09-14 11:43:24 14.99% transposeW : 2688 us/call x 2002 calls 2019-09-14 11:43:24 14.51% transposeH : 2604 us/call x 2001 calls 2019-09-14 11:43:24 0.09% fftP : 7800 us/call x 4 calls 2019-09-14 11:43:24 0.09% fftH : 10400 us/call x 3 calls 2019-09-14 11:43:24 0.04% fftW : 5200 us/call x 3 calls 2019-09-14 11:43:24 0.04% carryM : 15600 us/call x 1 calls 2019-09-14 11:43:24 0.04% transposeIn : 15600 us/call x 1 calls 2019-09-14 11:43:24 0.04% readResidue : 15600 us/call x 1 calls 2019-09-14 11:43:24 0.04% isNotZero : 15600 us/call x 1 calls 2019-09-14 11:43:24 2019-09-14 11:43:24 Exiting because "stop requested" 2019-09-14 11:43:24 Bye Code:
>gpuowl-win -device 0 -carry short -use ORIG_X2 2019-09-14 11:47:06 gpuowl 2019-09-14 11:47:06 Note: no config.txt file found 2019-09-14 11:47:06 config: -device 0 -carry short -use ORIG_X2 2019-09-14 11:47:06 87005279 FFT 5120K: Width 256x4, Height 64x4, Middle 10; 16.59 bits/word 2019-09-14 11:47:06 using short carry kernels 2019-09-14 11:47:06 OpenCL args "-DEXP=87005279u -DWIDTH=1024u -DSMALL_HEIGHT=256u -DMIDDLE=10u -DWEIGHT_STEP=0xa.97d8cd06772f8p-3 -DIWEIGHT_STEP=0xc. 1551b6b1158dp-4 -DWEIGHT_BIGSTEP=0x9.837f0518db8a8p-3 -DIWEIGHT_BIGSTEP=0xd.744fccad69d68p-4 -DORIG_X2=1 -DORIG_X2=1 -I. -cl-fast-relaxed-math -cl-st d=CL2.0" 2019-09-14 11:47:10 2019-09-14 11:47:10 OpenCL compilation in 3474 ms 2019-09-14 11:47:11 87005279.owl not found, starting from the beginning. 2019-09-14 11:47:27 87005279 OK 2000 0.00%; 3483 us/sq; ETA 3d 12:10; e944fcb41cb63c80 (check 3.89s) 2019-09-14 11:48:30 87005279 20000 0.02%; 3522 us/sq; ETA 3d 13:06; 77e12e401949f647 2019-09-14 11:49:41 87005279 40000 0.05%; 3557 us/sq; ETA 3d 13:55; 3ccb222b85a3780d 2019-09-14 11:50:10 Stopping, please wait.. 2019-09-14 11:50:14 87005279 OK 48000 0.06%; 3573 us/sq; ETA 3d 14:18; a316078024d009b0 (check 3.97s) 2019-09-14 11:50:14 Exiting because "stop requested" 2019-09-14 11:50:14 Bye Code:
>gpuowl-win -device 0 -use ORIG_X2 -maxAlloc 10240 -user kriesel -cpu dodo-gtx1080ti 2019-09-14 11:51:36 gpuowl v6.7-4-g278407a 2019-09-14 11:51:36 Note: no config.txt file found 2019-09-14 11:51:36 config: -device 0 -use ORIG_X2 -maxAlloc 10240 -user kriesel -cpu dodo-gtx1080ti 2019-09-14 11:51:36 87005279 FFT 5120K: Width 256x4, Height 64x4, Middle 10; 16.59 bits/word 2019-09-14 11:51:36 using short carry kernels 2019-09-14 11:51:36 OpenCL args "-DEXP=87005279u -DWIDTH=1024u -DSMALL_HEIGHT=256u -DMIDDLE=10u -DWEIGHT_STEP=0xa.97d8cd06772f8p-3 -DIWEIGHT_STEP=0xc. 1551b6b1158dp-4 -DWEIGHT_BIGSTEP=0x9.837f0518db8a8p-3 -DIWEIGHT_BIGSTEP=0xd.744fccad69d68p-4 -DORIG_X2=1 -I. -cl-fast-relaxed-math -cl-std=CL2.0" 2019-09-14 11:51:40 2019-09-14 11:51:40 OpenCL compilation in 3650 ms 2019-09-14 11:51:41 87005279.owl not found, starting from the beginning. 2019-09-14 11:51:49 87005279 OK 1000 0.00%; 3400 us/sq; ETA 3d 10:10; 00fdfddc9aeaa71f (check 2.09s) 2019-09-14 11:54:38 87005279 50000 0.06%; 3438 us/sq; ETA 3d 11:03; d3c2d8af5e987770 2019-09-14 11:57:32 87005279 100000 0.11%; 3478 us/sq; ETA 3d 11:58; 4ba1f423b8c71b64 2019-09-14 12:00:27 87005279 150000 0.17%; 3503 us/sq; ETA 3d 12:30; 229fc24f15398a56 2019-09-14 12:03:22 87005279 200000 0.23%; 3507 us/sq; ETA 3d 12:34; 75fc31e283600e79 2019-09-14 12:06:20 87005279 OK 250000 0.29%; 3506 us/sq; ETA 3d 12:30; 2d95d14b64b3f424 (check 2.11s) 2019-09-14 12:09:15 87005279 300000 0.34%; 3509 us/sq; ETA 3d 12:30; 543c72d2989ffcac 2019-09-14 12:12:11 87005279 350000 0.40%; 3510 us/sq; ETA 3d 12:30; 0e1f3273842b2f55 |
|
|
|
|
|
|
#1368 |
|
P90 years forever!
Aug 2002
Yeehaw, FL
11101011011112 Posts |
So it is slower. Thanks for the data.
Oddly the -time option shows my variant spending less time in fftMiddleIn than the production version spends in TransposeW + fftMiddleIn. So -time says it should be faster but the wall clock shows it isn't. Back to the drawing board. |
|
|
|
|
|
#1369 |
|
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
5,419 Posts |
|
|
|
|
|
|
#1370 |
|
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
5,419 Posts |
Be careful what you ask for?
Code:
>gpuowl-win -h
2019-09-14 13:24:58 gpuowl v6.10-0-gc1d0025
Command line options:
-dir <folder> : specify work directory (containing worktodo.txt, results.txt, config.txt, gpuowl.log)
-user <name> : specify the user name.
-cpu <name> : specify the hardware name.
-time : display kernel profiling information.
-fft <size> : specify FFT size, such as: 5000K, 4M, +2, -1.
-block <value> : PRP GEC block size. Default 500. Smaller block is slower but detects errors sooner.
-log <step> : log every <step> iterations, default 50000. Multiple of 10000.
-carry long|short : force carry type. Short carry may be faster, but requires high bits/word.
-B1 : P-1 B1 bound, default 500000
-B2 : P-1 B2 bound, default B1 * 30
-rB2 : ratio of B2 to B1. Default 30, used only if B2 is not explicitly set
-prp <exponent> : run a single PRP test and exit, ignoring worktodo.txt
-pm1 <exponent> : run a single P-1 test and exit, ignoring worktodo.txt
-results <file> : name of results file, default 'results.txt'
-iters <N> : run next PRP test for <N> iterations and exit. Multiple of 10000.
-maxAlloc : limit GPU memory usage to this value in MB
-use NEW_FFT8,OLD_FFT5,NEW_FFT10: comma separated list of defines, see the #if tests in gpuowl.cl (used for perf tuning).
-device <N> : select a specific device:
0 : Ellesmere-36@1266-28:00.0 Radeon (TM) RX 480 Graphics
1 : gfx804-8@1203-03:00.0 Radeon 550 Series
FFT Configurations:
FFT 8K [ 0.01M - 0.17M] 64-64
FFT 32K [ 0.05M - 0.68M] 64-256 256-64
FFT 64K [ 0.10M - 1.33M] 64-512 512-64
FFT 128K [ 0.20M - 2.62M] 1K-64 64-1K 256-256
FFT 192K [ 0.29M - 3.89M] 64-256-6
FFT 224K [ 0.34M - 4.52M] 64-256-7
FFT 256K [ 0.39M - 5.15M] 64-2K 256-512 512-256 2K-64
FFT 288K [ 0.44M - 5.77M] 64-256-9
FFT 320K [ 0.49M - 6.40M] 64-256-10
FFT 352K [ 0.54M - 7.02M] 64-256-11
FFT 384K [ 0.59M - 7.64M] 64-256-12 64-512-6
FFT 448K [ 0.69M - 8.88M] 64-512-7
FFT 512K [ 0.79M - 10.12M] 1K-256 256-1K 512-512 4K-64
FFT 576K [ 0.88M - 11.35M] 64-512-9
FFT 640K [ 0.98M - 12.58M] 64-512-10
FFT 704K [ 1.08M - 13.81M] 64-512-11
FFT 768K [ 1.18M - 15.03M] 64-512-12 64-1K-6 256-256-6
FFT 896K [ 1.38M - 17.47M] 64-1K-7 256-256-7
FFT 1M [ 1.57M - 19.89M] 1K-512 256-2K 512-1K 2K-256
FFT 1152K [ 1.77M - 22.32M] 64-1K-9 256-256-9
FFT 1280K [ 1.97M - 24.73M] 64-1K-10 256-256-10
FFT 1408K [ 2.16M - 27.14M] 64-1K-11 256-256-11
FFT 1536K [ 2.36M - 29.54M] 64-1K-12 64-2K-6 256-256-12 256-512-6 512-256-6
FFT 1792K [ 2.75M - 34.33M] 64-2K-7 256-512-7 512-256-7
FFT 2M [ 3.15M - 39.10M] 1K-1K 512-2K 2K-512 4K-256
FFT 2304K [ 3.54M - 43.85M] 64-2K-9 256-512-9 512-256-9
FFT 2560K [ 3.93M - 48.59M] 64-2K-10 256-512-10 512-256-10
FFT 2816K [ 4.33M - 53.32M] 64-2K-11 256-512-11 512-256-11
FFT 3M [ 4.72M - 58.04M] 1K-256-6 64-2K-12 256-512-12 256-1K-6 512-256-12 512-512-6
FFT 3584K [ 5.51M - 67.44M] 1K-256-7 256-1K-7 512-512-7
FFT 4M [ 6.29M - 76.81M] 1K-2K 2K-1K 4K-512
FFT 4608K [ 7.08M - 86.15M] 1K-256-9 256-1K-9 512-512-9
FFT 5M [ 7.86M - 95.46M] 1K-256-10 256-1K-10 512-512-10
FFT 5632K [ 8.65M - 104.74M] 1K-256-11 256-1K-11 512-512-11
FFT 6M [ 9.44M - 114.00M] 1K-256-12 1K-512-6 256-1K-12 256-2K-6 512-512-12 512-1K-6 2K-256-6
FFT 7M [ 11.01M - 132.46M] 1K-512-7 256-2K-7 512-1K-7 2K-256-7
FFT 8M [ 12.58M - 150.85M] 2K-2K 4K-1K
FFT 9M [ 14.16M - 169.18M] 1K-512-9 256-2K-9 512-1K-9 2K-256-9
FFT 10M [ 15.73M - 187.45M] 1K-512-10 256-2K-10 512-1K-10 2K-256-10
FFT 11M [ 17.30M - 205.67M] 1K-512-11 256-2K-11 512-1K-11 2K-256-11
FFT 12M [ 18.87M - 223.85M] 1K-512-12 1K-1K-6 256-2K-12 512-1K-12 512-2K-6 2K-256-12 2K-512-6 4K-256-6
FFT 14M [ 22.02M - 260.08M] 1K-1K-7 512-2K-7 2K-512-7 4K-256-7
FFT 16M [ 25.17M - 296.17M] 4K-2K
FFT 18M [ 28.31M - 332.13M] 1K-1K-9 512-2K-9 2K-512-9 4K-256-9
FFT 20M [ 31.46M - 367.98M] 1K-1K-10 512-2K-10 2K-512-10 4K-256-10
FFT 22M [ 34.60M - 403.74M] 1K-1K-11 512-2K-11 2K-512-11 4K-256-11
FFT 24M [ 37.75M - 439.40M] 1K-1K-12 1K-2K-6 512-2K-12 2K-512-12 2K-1K-6 4K-256-12 4K-512-6
FFT 28M [ 44.04M - 510.47M] 1K-2K-7 2K-1K-7 4K-512-7
FFT 36M [ 56.62M - 651.81M] 1K-2K-9 2K-1K-9 4K-512-9
FFT 40M [ 62.91M - 722.13M] 1K-2K-10 2K-1K-10 4K-512-10
FFT 44M [ 69.21M - 792.25M] 1K-2K-11 2K-1K-11 4K-512-11
FFT 48M [ 75.50M - 862.18M] 1K-2K-12 2K-1K-12 2K-2K-6 4K-512-12 4K-1K-6
FFT 56M [ 88.08M - 1001.57M] 2K-2K-7 4K-1K-7
FFT 72M [113.25M - 1278.70M] 2K-2K-9 4K-1K-9
FFT 80M [125.83M - 1416.57M] 2K-2K-10 4K-1K-10
FFT 88M [138.41M - 1554.04M] 2K-2K-11 4K-1K-11
FFT 96M [150.99M - 1691.15M] 2K-2K-12 4K-1K-12 4K-2K-6
FFT 112M [176.16M - 1964.39M] 4K-2K-7
FFT 144M [226.49M - 2507.57M] 4K-2K-9
FFT 160M [251.66M - 2777.78M] 4K-2K-10
FFT 176M [276.82M - 3047.18M] 4K-2K-11
FFT 192M [301.99M - 3315.86M] 4K-2K-12
2019-09-14 13:25:02 Exiting because "help"
2019-09-14 13:25:02 Bye
C:\msys64\home\ken\gpuowl-compile\v6.10-0-gc1d0025>gpuowl-win -device 0 -use ORIG_X2 -user kriesel -cpu condorella/rx480
2019-09-14 13:38:16 gpuowl v6.10-0-gc1d0025
2019-09-14 13:38:16 Note: no config.txt file found
2019-09-14 13:38:16 config: -device 0 -use ORIG_X2 -user kriesel -cpu condorella/rx480
2019-09-14 13:38:16 24000577 FFT 1280K: Width 8x8, Height 256x4, Middle 10; 18.31 bits/word
2019-09-14 13:38:16 using short carry kernels
2019-09-14 13:38:21 OpenCL args "-DEXP=24000577u -DWIDTH=64u -DSMALL_HEIGHT=1024u -DMIDDLE=10u -DWEIGHT_STEP=0xc.e5beac96a0b88p-3 -DIWEIGHT_STEP=0x9.eca8ba4660a
fp-4 -DWEIGHT_BIGSTEP=0xe.ac0c6e7dd2438p-3 -DIWEIGHT_BIGSTEP=0x8.b95c1e3ea8bd8p-4 -DORIG_X2=1 -I. -cl-fast-relaxed-math -cl-std=CL2.0"
2019-09-14 13:38:24 OpenCL compilation in 3712 ms
2019-09-14 13:38:25 24000577 P1 B1=220000, B2=3960000, stage1 317550 bits
2019-09-14 13:38:54 24000577 P1 10000 3.15%; 2886 us/sq; ETA 0d 00:15; 7f995dc7dff7f8e0
2019-09-14 13:39:23 24000577 P1 20000 6.30%; 2885 us/sq; ETA 0d 00:14; f705474c0ac30c16
2019-09-14 13:39:52 24000577 P1 30000 9.45%; 2910 us/sq; ETA 0d 00:14; 3fc336b60ee971a2
2019-09-14 13:40:21 24000577 P1 40000 12.60%; 2890 us/sq; ETA 0d 00:13; 87c9fcec37cd0a71
2019-09-14 13:40:50 24000577 P1 50000 15.75%; 2885 us/sq; ETA 0d 00:13; f64948f68fb1d67b
2019-09-14 13:41:19 24000577 P1 60000 18.89%; 2894 us/sq; ETA 0d 00:12; c37c2d473cb0ea06
2019-09-14 13:41:48 24000577 P1 70000 22.04%; 2885 us/sq; ETA 0d 00:12; 5bd384b917eabb12
2019-09-14 13:42:17 24000577 P1 80000 25.19%; 2899 us/sq; ETA 0d 00:11; 91ea4d5d92dc1c29
2019-09-14 13:42:46 24000577 P1 90000 28.34%; 2904 us/sq; ETA 0d 00:11; 9c85386920ff8b45
2019-09-14 13:43:15 24000577 P1 100000 31.49%; 2898 us/sq; ETA 0d 00:11; 438848c849a426c8
2019-09-14 13:43:44 24000577 P1 110000 34.64%; 2898 us/sq; ETA 0d 00:10; 495bc594a2150ed6
2019-09-14 13:44:13 24000577 P1 120000 37.79%; 2885 us/sq; ETA 0d 00:09; 1bd1712dcb680f0d
2019-09-14 13:44:42 24000577 P1 130000 40.94%; 2898 us/sq; ETA 0d 00:09; d03e2db3fd19c843
2019-09-14 13:45:11 24000577 P1 140000 44.09%; 2891 us/sq; ETA 0d 00:09; 9fc5fa31b4959aed
2019-09-14 13:45:40 24000577 P1 150000 47.24%; 2891 us/sq; ETA 0d 00:08; ae6304c818c1f83e
2019-09-14 13:46:08 24000577 P1 160000 50.39%; 2883 us/sq; ETA 0d 00:08; fe8f0bada295328d
2019-09-14 13:46:37 24000577 P1 170000 53.53%; 2890 us/sq; ETA 0d 00:07; 3fd5a4ddb6841e9b
2019-09-14 13:47:07 24000577 P1 180000 56.68%; 2899 us/sq; ETA 0d 00:07; a6234de954685799
2019-09-14 13:47:35 24000577 P1 190000 59.83%; 2894 us/sq; ETA 0d 00:06; c873c91deeefba27
2019-09-14 13:48:04 24000577 P1 200000 62.98%; 2893 us/sq; ETA 0d 00:06; eb92d0b622962612
2019-09-14 13:48:34 24000577 P1 210000 66.13%; 2901 us/sq; ETA 0d 00:05; a64dbff6290ed34a
2019-09-14 13:49:03 24000577 P1 220000 69.28%; 2891 us/sq; ETA 0d 00:05; 7f49b2efd2a795fe
2019-09-14 13:49:32 24000577 P1 230000 72.43%; 2893 us/sq; ETA 0d 00:04; 9884971a1fc42886
2019-09-14 13:50:00 24000577 P1 240000 75.58%; 2893 us/sq; ETA 0d 00:04; ba30a7d0f33bde93
2019-09-14 13:50:30 24000577 P1 250000 78.73%; 2898 us/sq; ETA 0d 00:03; bb8984fecf1af62a
2019-09-14 13:50:58 24000577 P1 260000 81.88%; 2891 us/sq; ETA 0d 00:03; efb3c97f53545dbb
2019-09-14 13:51:28 24000577 P1 270000 85.03%; 2901 us/sq; ETA 0d 00:02; 405373760718e67c
2019-09-14 13:51:57 24000577 P1 280000 88.18%; 2894 us/sq; ETA 0d 00:02; a612ab69e780c283
2019-09-14 13:52:25 24000577 P1 290000 91.32%; 2890 us/sq; ETA 0d 00:01; 740645b16c6380fe
2019-09-14 13:52:55 24000577 P1 300000 94.47%; 2894 us/sq; ETA 0d 00:01; 50ed1a7837d59607
2019-09-14 13:53:24 24000577 P1 310000 97.62%; 2910 us/sq; ETA 0d 00:00; 21a38a3fd1fa6582
2019-09-14 13:53:46 24000577 P1 317550 100.00%; 2893 us/sq; ETA 0d 00:00; 7acca8667b4d2492
2019-09-14 13:53:46 P-1 (B1=220000, B2=3960000, D=30030): primes 260946, expanded 262000, doubles 47491 (left 166492), singles 165964, total 213455 (82%)
2019-09-14 13:53:46 24000577 P2 using blocks [7 - 132] to cover 213455 primes
2019-09-14 13:53:46 24000577 P2 using 770 buffers of 10.0 MB each
2019-09-14 13:56:49 24000577 P2 770/2880: setup 11809 ms; 3029 us/prime, 56682 primes
2019-09-14 13:56:49 24000577 P1 GCD: no factor
2019-09-14 13:59:54 24000577 P2 1540/2880: setup 11793 ms; 3028 us/prime, 57130 primes
2019-09-14 14:02:59 24000577 P2 2310/2880: setup 11856 ms; 3030 us/prime, 57186 primes
2019-09-14 14:05:17 24000577 P2 2880/2880: setup 8720 ms; 3036 us/prime, 42457 primes
2019-09-14 14:05:17 1257787 FFT 64K: Width 8x8, Height 64x8; 19.19 bits/word
2019-09-14 14:05:17 using short carry kernels
2019-09-14 14:05:17 OpenCL args "-DEXP=1257787u -DWIDTH=64u -DSMALL_HEIGHT=512u -DMIDDLE=1u -DWEIGHT_STEP=0xe.00d75658c47c8p-3 -DIWEIGHT_STEP=0x9.2405b0b5f2d88p
-4 -DWEIGHT_BIGSTEP=0xc.5672a115506d8p-3 -DIWEIGHT_BIGSTEP=0xa.5fed6a9b15138p-4 -DORIG_X2=1 -I. -cl-fast-relaxed-math -cl-std=CL2.0"
2019-09-14 14:05:21 OpenCL compilation in 3634 ms
2019-09-14 14:05:21 C:\msys64\home\ken\gpuowl-compile\v6.10-0-gc1d0025\1257787\1257787.owl not found
2019-09-14 14:05:21 C:\msys64\home\ken\gpuowl-compile\v6.10-0-gc1d0025\1257787\1257787-old.owl not found
2019-09-14 14:05:21 starting from the beginning.
2019-09-14 14:05:21 1257787 OK 1000 0.08%; 202 us/sq; ETA 0d 00:04; 91d0e6e562cb2541 (check 0.11s)
2019-09-14 14:05:32 1257787 50000 3.97%; 212 us/sq; ETA 0d 00:04; d7ea0488d047e5e4
2019-09-14 14:05:34 24000577 P2 GCD: 13504596665207
2019-09-14 14:05:34 {"exponent":"24000577", "worktype":"PM1", "status":"F", "program":{"name":"gpuowl", "version":"v6.10-0-gc1d0025"}, "timestamp":"2019-09-14 1
9:05:34 UTC", "user":"kriesel", "computer":"condorella/rx480", "aid":"0", "fft-length":1310720, "B1":220000, "B2":3960000, "factors":["13504596665207"]}
2019-09-14 14:05:42 1257787 100000 7.95%; 216 us/sq; ETA 0d 00:04; 09f25999ff3326ca
2019-09-14 14:05:53 1257787 150000 11.92%; 214 us/sq; ETA 0d 00:04; 367d63ab9a7b46d5
2019-09-14 14:06:04 1257787 200000 15.90%; 215 us/sq; ETA 0d 00:04; 25ebe34e39ca647b
2019-09-14 14:06:15 1257787 OK 250000 19.87%; 215 us/sq; ETA 0d 00:04; 564fdae0bb5a37b1 (check 0.12s)
2019-09-14 14:06:26 1257787 300000 23.85%; 215 us/sq; ETA 0d 00:03; 79b4d6cb0169a9b0
2019-09-14 14:06:36 1257787 350000 27.82%; 217 us/sq; ETA 0d 00:03; 0b9b51c4f7638fd3
2019-09-14 14:06:47 1257787 400000 31.80%; 216 us/sq; ETA 0d 00:03; fe2bfeea5734dd7c
2019-09-14 14:06:58 1257787 450000 35.77%; 216 us/sq; ETA 0d 00:03; 16fa53053e566011
2019-09-14 14:07:09 1257787 OK 500000 39.75%; 215 us/sq; ETA 0d 00:03; 7838f365c8c78d0c (check 0.14s)
terminate called after throwing an instance of 'std::invalid_argument'
what(): stoi
This application has requested the Runtime to terminate it in an unusual way.
Please contact the application's support team for more information.
Code:
Problem signature: Problem Event Name: APPCRASH Application Name: gpuowl-win.exe Application Version: 0.0.0.0 Application Timestamp: 00000000 Fault Module Name: gpuowl-win.exe Fault Module Version: 0.0.0.0 Fault Module Timestamp: 00000000 Exception Code: 40000015 Exception Offset: 000000000005e386 OS Version: 6.1.7601.2.1.0.256.48 Locale ID: 1033 Additional Information 1: 91c7 Additional Information 2: 91c775c91db222fe910a2744dc0825a6 Additional Information 3: de11 Additional Information 4: de11727f51f0f73173ea2f6b995e9dc2 Read our privacy statement online: http://go.microsoft.com/fwlink/?linkid=104288&clcid=0x0409 If the online privacy statement is not available, please read our privacy statement offline: C:\Windows\system32\en-US\erofflps.txt |
|
|
|
|
|
#1371 |
|
"Mihai Preda"
Apr 2015
101010110112 Posts |
|
|
|
|
|
|
#1372 | |
|
"Mihai Preda"
Apr 2015
3·457 Posts |
Quote:
My dislike of "error-count" is caused by the fact that, with GEC and roll-backs, the error-count of the result is always 0, as there is no error *included* in the chain of computation begin-to-result. Let me give an example: Let's say the user start a PRP test of some exponent N. At 50% in the test, a GEC error is detected. The user now starts a whole new PRP(N) test again, from the beginning (e.g. by deleting all the savefiles for N). This second test runs to completion without incident. What should the error-count reported in the result of the second test be? (I suppose 0?) But what if the user, when a GEC error is detected at 50%, instead of starting from the beginnig (0%) starts from a savefile at 10%, and the computation runs without incident to completion, what should the error-count be? (the savefile at 10% is GEC verified good) Again I suppose 0, beause there was no error in this test result -- no error from beginning to 10%, and no error from 10% to end. But what the software does automatically on a GEC error is similar to that user restarting from 10% -- it loads a good savefile, verified, with 0 errors in it, and runs from there to completion without incident (or cancels the test and starts another in the case of another GEC error, etc). [Another way to see it, is that the state of a test should be contained fully in the savefile. Loading a savefile, manually or on a rollback, should re-instate the state from the savefile. In addition, GpuOwl never creates a savefile that didn't pass GEC. Reasoning this way, an "error-count" that is stored in the savefile can never be different from 0] I suppose it would not be useful if GpuOwl added invariably an information "error-count":"0" to every PRP result? Another problem is that GEC errors can also originate from a too-small FFT size (in GpuOwl's case), but that is no indication on the health of the hardware. Is the goal to put a bearing on the "health" of a particular GPU? -- but that would still not affect the validity of the PRP result. And the health of a GPU is not limited to a single test -- e.g. a GPU that often produces GEC errors may still have full runs without errors from time to time, how does that affect the reliability of the result? So, is in fact what is needed a bool indicating whether the GEC was performed or not? |
|
|
|
|
|
|
#1373 | |
|
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
5,419 Posts |
Quote:
|
|
|
|
|
|
|
#1374 | |
|
P90 years forever!
Aug 2002
Yeehaw, FL
5×11×137 Posts |
Quote:
1) It lets the user monitor hardware health. This is especially nice for headless operations. Rather than ssh into each GPU machine and grepping the log files, I can program the server to email a user whenever a non-zero error count is reported (this feature exists now for prime95 LL tests). 2) It lets us spot double check these PRP results someday. The first prime95 implementation had some windows of vulnerability. If there any vulnerabilities remaining, these machines would be the most likely to find them. |
|
|
|
|
|
|
#1375 | |
|
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
5,419 Posts |
Quote:
Code:
>gpuowl-win -h
2019-09-15 09:55:59 gpuowl v6.10-1-gea7d51c
Command line options:
-dir <folder> : specify work directory (containing worktodo.txt, results.txt, config.txt, gpuowl.log)
-user <name> : specify the user name.
-cpu <name> : specify the hardware name.
-time : display kernel profiling information.
-fft <size> : specify FFT size, such as: 5000K, 4M, +2, -1.
-block <value> : PRP GEC block size. Default 500. Smaller block is slower but detects errors sooner.
-log <step> : log every <step> iterations, default 50000. Multiple of 10000.
-carry long|short : force carry type. Short carry may be faster, but requires high bits/word.
-B1 : P-1 B1 bound, default 500000
-B2 : P-1 B2 bound, default B1 * 30
-rB2 : ratio of B2 to B1. Default 30, used only if B2 is not explicitly set
-prp <exponent> : run a single PRP test and exit, ignoring worktodo.txt
-pm1 <exponent> : run a single P-1 test and exit, ignoring worktodo.txt
-results <file> : name of results file, default 'results.txt'
-iters <N> : run next PRP test for <N> iterations and exit. Multiple of 10000.
-maxAlloc : limit GPU memory usage to this value in MB
-use NEW_FFT8,OLD_FFT5,NEW_FFT10: comma separated list of defines, see the #if tests in gpuowl.cl (used for perf tuning).
-device <N> : select a specific device:
0 : Ellesmere-36@1266-28:00.0 Radeon (TM) RX 480 Graphics
1 : gfx804-8@1203-03:00.0 Radeon 550 Series
FFT Configurations:
FFT 8K [ 0.01M - 0.17M] 64-64
FFT 32K [ 0.05M - 0.68M] 64-256 256-64
FFT 64K [ 0.10M - 1.33M] 64-512 512-64
FFT 128K [ 0.20M - 2.62M] 1K-64 64-1K 256-256
FFT 192K [ 0.29M - 3.89M] 64-256-6
FFT 224K [ 0.34M - 4.52M] 64-256-7
FFT 256K [ 0.39M - 5.15M] 64-2K 256-512 512-256 2K-64
FFT 288K [ 0.44M - 5.77M] 64-256-9
FFT 320K [ 0.49M - 6.40M] 64-256-10
FFT 352K [ 0.54M - 7.02M] 64-256-11
FFT 384K [ 0.59M - 7.64M] 64-256-12 64-512-6
FFT 448K [ 0.69M - 8.88M] 64-512-7
FFT 512K [ 0.79M - 10.12M] 1K-256 256-1K 512-512 4K-64
FFT 576K [ 0.88M - 11.35M] 64-512-9
FFT 640K [ 0.98M - 12.58M] 64-512-10
FFT 704K [ 1.08M - 13.81M] 64-512-11
FFT 768K [ 1.18M - 15.03M] 64-512-12 64-1K-6 256-256-6
FFT 896K [ 1.38M - 17.47M] 64-1K-7 256-256-7
FFT 1M [ 1.57M - 19.89M] 1K-512 256-2K 512-1K 2K-256
FFT 1152K [ 1.77M - 22.32M] 64-1K-9 256-256-9
FFT 1280K [ 1.97M - 24.73M] 64-1K-10 256-256-10
FFT 1408K [ 2.16M - 27.14M] 64-1K-11 256-256-11
FFT 1536K [ 2.36M - 29.54M] 64-1K-12 64-2K-6 256-256-12 256-512-6 512-256-6
FFT 1792K [ 2.75M - 34.33M] 64-2K-7 256-512-7 512-256-7
FFT 2M [ 3.15M - 39.10M] 1K-1K 512-2K 2K-512 4K-256
FFT 2304K [ 3.54M - 43.85M] 64-2K-9 256-512-9 512-256-9
FFT 2560K [ 3.93M - 48.59M] 64-2K-10 256-512-10 512-256-10
FFT 2816K [ 4.33M - 53.32M] 64-2K-11 256-512-11 512-256-11
FFT 3M [ 4.72M - 58.04M] 1K-256-6 64-2K-12 256-512-12 256-1K-6 512-256-12 512-512-6
FFT 3584K [ 5.51M - 67.44M] 1K-256-7 256-1K-7 512-512-7
FFT 4M [ 6.29M - 76.81M] 1K-2K 2K-1K 4K-512
FFT 4608K [ 7.08M - 86.15M] 1K-256-9 256-1K-9 512-512-9
FFT 5M [ 7.86M - 95.46M] 1K-256-10 256-1K-10 512-512-10
FFT 5632K [ 8.65M - 104.74M] 1K-256-11 256-1K-11 512-512-11
FFT 6M [ 9.44M - 114.00M] 1K-256-12 1K-512-6 256-1K-12 256-2K-6 512-512-12 512-1K-6 2K-256-6
FFT 7M [ 11.01M - 132.46M] 1K-512-7 256-2K-7 512-1K-7 2K-256-7
FFT 8M [ 12.58M - 150.85M] 2K-2K 4K-1K
FFT 9M [ 14.16M - 169.18M] 1K-512-9 256-2K-9 512-1K-9 2K-256-9
FFT 10M [ 15.73M - 187.45M] 1K-512-10 256-2K-10 512-1K-10 2K-256-10
FFT 11M [ 17.30M - 205.67M] 1K-512-11 256-2K-11 512-1K-11 2K-256-11
FFT 12M [ 18.87M - 223.85M] 1K-512-12 1K-1K-6 256-2K-12 512-1K-12 512-2K-6 2K-256-12 2K-512-6 4K-256-6
FFT 14M [ 22.02M - 260.08M] 1K-1K-7 512-2K-7 2K-512-7 4K-256-7
FFT 16M [ 25.17M - 296.17M] 4K-2K
FFT 18M [ 28.31M - 332.13M] 1K-1K-9 512-2K-9 2K-512-9 4K-256-9
FFT 20M [ 31.46M - 367.98M] 1K-1K-10 512-2K-10 2K-512-10 4K-256-10
FFT 22M [ 34.60M - 403.74M] 1K-1K-11 512-2K-11 2K-512-11 4K-256-11
FFT 24M [ 37.75M - 439.40M] 1K-1K-12 1K-2K-6 512-2K-12 2K-512-12 2K-1K-6 4K-256-12 4K-512-6
FFT 28M [ 44.04M - 510.47M] 1K-2K-7 2K-1K-7 4K-512-7
FFT 36M [ 56.62M - 651.81M] 1K-2K-9 2K-1K-9 4K-512-9
FFT 40M [ 62.91M - 722.13M] 1K-2K-10 2K-1K-10 4K-512-10
FFT 44M [ 69.21M - 792.25M] 1K-2K-11 2K-1K-11 4K-512-11
FFT 48M [ 75.50M - 862.18M] 1K-2K-12 2K-1K-12 2K-2K-6 4K-512-12 4K-1K-6
FFT 56M [ 88.08M - 1001.57M] 2K-2K-7 4K-1K-7
FFT 72M [113.25M - 1278.70M] 2K-2K-9 4K-1K-9
FFT 80M [125.83M - 1416.57M] 2K-2K-10 4K-1K-10
FFT 88M [138.41M - 1554.04M] 2K-2K-11 4K-1K-11
FFT 96M [150.99M - 1691.15M] 2K-2K-12 4K-1K-12 4K-2K-6
FFT 112M [176.16M - 1964.39M] 4K-2K-7
FFT 144M [226.49M - 2507.57M] 4K-2K-9
FFT 160M [251.66M - 2777.78M] 4K-2K-10
FFT 176M [276.82M - 3047.18M] 4K-2K-11
FFT 192M [301.99M - 3315.86M] 4K-2K-12
2019-09-15 09:56:07 Exiting because "help"
2019-09-15 09:56:07 Bye
C:\msys64\home\ken\gpuowl-compile\v6.10-1-gea7d51c>g610
C:\msys64\home\ken\gpuowl-compile\v6.10-1-gea7d51c>gpuowl-win -device 0 -use ORIG_X2 -user kriesel -cpu condorella/rx480
2019-09-15 10:06:04 gpuowl v6.10-1-gea7d51c
2019-09-15 10:06:04 Note: no config.txt file found
2019-09-15 10:06:04 config: -device 0 -use ORIG_X2 -user kriesel -cpu condorella/rx480
2019-09-15 10:06:04 24000577 FFT 1280K: Width 8x8, Height 256x4, Middle 10; 18.31 bits/word
2019-09-15 10:06:04 using short carry kernels
2019-09-15 10:06:11 OpenCL args "-DEXP=24000577u -DWIDTH=64u -DSMALL_HEIGHT=1024u -DMIDDLE=10u -DWEIGHT_STEP=0xc.e5beac96a0b88p-3 -DIWEIGHT_STEP=0x9.eca8ba4660a
fp-4 -DWEIGHT_BIGSTEP=0xe.ac0c6e7dd2438p-3 -DIWEIGHT_BIGSTEP=0x8.b95c1e3ea8bd8p-4 -DORIG_X2=1 -I. -cl-fast-relaxed-math -cl-std=CL2.0"
2019-09-15 10:06:15 OpenCL compilation in 3488 ms
2019-09-15 10:06:15 24000577 P1 B1=220000, B2=3960000, stage1 317550 bits
2019-09-15 10:06:44 24000577 P1 10000 3.15%; 2878 us/sq; ETA 0d 00:15; 7f995dc7dff7f8e0
2019-09-15 10:07:13 24000577 P1 20000 6.30%; 2878 us/sq; ETA 0d 00:14; f705474c0ac30c16
2019-09-15 10:07:41 24000577 P1 30000 9.45%; 2880 us/sq; ETA 0d 00:14; 3fc336b60ee971a2
2019-09-15 10:08:10 24000577 P1 40000 12.60%; 2873 us/sq; ETA 0d 00:13; 87c9fcec37cd0a71
2019-09-15 10:08:39 24000577 P1 50000 15.75%; 2884 us/sq; ETA 0d 00:13; f64948f68fb1d67b
2019-09-15 10:09:08 24000577 P1 60000 18.89%; 2876 us/sq; ETA 0d 00:12; c37c2d473cb0ea06
2019-09-15 10:09:37 24000577 P1 70000 22.04%; 2879 us/sq; ETA 0d 00:12; 5bd384b917eabb12
2019-09-15 10:10:05 24000577 P1 80000 25.19%; 2873 us/sq; ETA 0d 00:11; 91ea4d5d92dc1c29
2019-09-15 10:10:34 24000577 P1 90000 28.34%; 2874 us/sq; ETA 0d 00:11; 9c85386920ff8b45
2019-09-15 10:11:03 24000577 P1 100000 31.49%; 2870 us/sq; ETA 0d 00:10; 438848c849a426c8
2019-09-15 10:11:32 24000577 P1 110000 34.64%; 2882 us/sq; ETA 0d 00:10; 495bc594a2150ed6
2019-09-15 10:12:00 24000577 P1 120000 37.79%; 2867 us/sq; ETA 0d 00:09; 1bd1712dcb680f0d
2019-09-15 10:12:29 24000577 P1 130000 40.94%; 2882 us/sq; ETA 0d 00:09; d03e2db3fd19c843
2019-09-15 10:12:58 24000577 P1 140000 44.09%; 2891 us/sq; ETA 0d 00:09; 9fc5fa31b4959aed
2019-09-15 10:13:27 24000577 P1 150000 47.24%; 2874 us/sq; ETA 0d 00:08; ae6304c818c1f83e
2019-09-15 10:13:56 24000577 P1 160000 50.39%; 2872 us/sq; ETA 0d 00:08; fe8f0bada295328d
2019-09-15 10:14:25 24000577 P1 170000 53.53%; 2870 us/sq; ETA 0d 00:07; 3fd5a4ddb6841e9b
2019-09-15 10:14:53 24000577 P1 180000 56.68%; 2878 us/sq; ETA 0d 00:07; a6234de954685799
2019-09-15 10:15:22 24000577 P1 190000 59.83%; 2878 us/sq; ETA 0d 00:06; c873c91deeefba27
2019-09-15 10:15:51 24000577 P1 200000 62.98%; 2874 us/sq; ETA 0d 00:06; eb92d0b622962612
2019-09-15 10:16:20 24000577 P1 210000 66.13%; 2880 us/sq; ETA 0d 00:05; a64dbff6290ed34a
2019-09-15 10:16:49 24000577 P1 220000 69.28%; 2869 us/sq; ETA 0d 00:05; 7f49b2efd2a795fe
2019-09-15 10:17:18 24000577 P1 230000 72.43%; 2870 us/sq; ETA 0d 00:04; 9884971a1fc42886
2019-09-15 10:17:46 24000577 P1 240000 75.58%; 2878 us/sq; ETA 0d 00:04; ba30a7d0f33bde93
2019-09-15 10:18:15 24000577 P1 250000 78.73%; 2869 us/sq; ETA 0d 00:03; bb8984fecf1af62a
2019-09-15 10:18:44 24000577 P1 260000 81.88%; 2877 us/sq; ETA 0d 00:03; efb3c97f53545dbb
2019-09-15 10:19:13 24000577 P1 270000 85.03%; 2875 us/sq; ETA 0d 00:02; 405373760718e67c
2019-09-15 10:19:42 24000577 P1 280000 88.18%; 2880 us/sq; ETA 0d 00:02; a612ab69e780c283
2019-09-15 10:20:11 24000577 P1 290000 91.32%; 2891 us/sq; ETA 0d 00:01; 740645b16c6380fe
2019-09-15 10:20:39 24000577 P1 300000 94.47%; 2878 us/sq; ETA 0d 00:01; 50ed1a7837d59607
2019-09-15 10:21:08 24000577 P1 310000 97.62%; 2877 us/sq; ETA 0d 00:00; 21a38a3fd1fa6582
2019-09-15 10:21:30 24000577 P1 317550 100.00%; 2878 us/sq; ETA 0d 00:00; 7acca8667b4d2492
2019-09-15 10:21:30 P-1 (B1=220000, B2=3960000, D=30030): primes 260946, expanded 262000, doubles 47491 (left 166492), singles 165964, total 213455 (82%)
2019-09-15 10:21:30 24000577 P2 using blocks [7 - 132] to cover 213455 primes
2019-09-15 10:21:30 24000577 P2 using 770 buffers of 10.0 MB each
2019-09-15 10:24:34 24000577 P2 770/2880: setup 11824 ms; 3025 us/prime, 56682 primes
2019-09-15 10:24:34 24000577 P1 GCD: no factor
2019-09-15 10:27:38 24000577 P2 1540/2880: setup 11778 ms; 3025 us/prime, 57130 primes
2019-09-15 10:30:43 24000577 P2 2310/2880: setup 11793 ms; 3026 us/prime, 57186 primes
2019-09-15 10:33:01 24000577 P2 2880/2880: setup 8720 ms; 3032 us/prime, 42457 primes
2019-09-15 10:33:01 1257787 FFT 64K: Width 8x8, Height 64x8; 19.19 bits/word
2019-09-15 10:33:01 using short carry kernels
2019-09-15 10:33:01 OpenCL args "-DEXP=1257787u -DWIDTH=64u -DSMALL_HEIGHT=512u -DMIDDLE=1u -DWEIGHT_STEP=0xe.00d75658c47c8p-3 -DIWEIGHT_STEP=0x9.2405b0b5f2d88p
-4 -DWEIGHT_BIGSTEP=0xc.5672a115506d8p-3 -DIWEIGHT_BIGSTEP=0xa.5fed6a9b15138p-4 -DORIG_X2=1 -I. -cl-fast-relaxed-math -cl-std=CL2.0"
2019-09-15 10:33:04 OpenCL compilation in 3712 ms
2019-09-15 10:33:04 C:\msys64\home\ken\gpuowl-compile\v6.10-1-gea7d51c\1257787\1257787.owl not found
2019-09-15 10:33:04 C:\msys64\home\ken\gpuowl-compile\v6.10-1-gea7d51c\1257787\1257787-old.owl not found
2019-09-15 10:33:04 starting from the beginning.
2019-09-15 10:33:05 1257787 OK 1000 0.08%; 202 us/sq; ETA 0d 00:04; 91d0e6e562cb2541 (check 0.11s)
2019-09-15 10:33:15 1257787 50000 3.97%; 213 us/sq; ETA 0d 00:04; d7ea0488d047e5e4
2019-09-15 10:33:18 24000577 P2 GCD: 13504596665207
2019-09-15 10:33:18 {"exponent":"24000577", "worktype":"PM1", "status":"F", "program":{"name":"gpuowl", "version":"v6.10-1-gea7d51c"}, "timestamp":"2019-09-15 1
5:33:18 UTC", "user":"kriesel", "computer":"condorella/rx480", "aid":"0", "fft-length":1310720, "B1":220000, "B2":3960000, "factors":["13504596665207"]}
2019-09-15 10:33:26 1257787 100000 7.95%; 214 us/sq; ETA 0d 00:04; 09f25999ff3326ca
2019-09-15 10:33:37 1257787 150000 11.92%; 213 us/sq; ETA 0d 00:04; 367d63ab9a7b46d5
2019-09-15 10:33:47 1257787 200000 15.90%; 212 us/sq; ETA 0d 00:04; 25ebe34e39ca647b
2019-09-15 10:33:58 1257787 OK 250000 19.87%; 212 us/sq; ETA 0d 00:04; 564fdae0bb5a37b1 (check 0.12s)
2019-09-15 10:34:09 1257787 300000 23.85%; 214 us/sq; ETA 0d 00:03; 79b4d6cb0169a9b0
2019-09-15 10:34:20 1257787 350000 27.82%; 213 us/sq; ETA 0d 00:03; 0b9b51c4f7638fd3
2019-09-15 10:34:30 1257787 400000 31.80%; 213 us/sq; ETA 0d 00:03; fe2bfeea5734dd7c
2019-09-15 10:34:41 1257787 450000 35.77%; 213 us/sq; ETA 0d 00:03; 16fa53053e566011
2019-09-15 10:34:52 1257787 OK 500000 39.75%; 213 us/sq; ETA 0d 00:03; 7838f365c8c78d0c (check 0.11s)
2019-09-15 10:34:52 Exception NSt10filesystem7__cxx1116filesystem_errorE: filesystem error: cannot rename: File exists [C:\msys64\home\ken\gpuowl-compile\v6.10-
1-gea7d51c\1257787\1257787-new.owl] [C:\msys64\home\ken\gpuowl-compile\v6.10-1-gea7d51c\1257787\1257787.owl]
2019-09-15 10:34:52 Bye
C:\msys64\home\ken\gpuowl-compile\v6.10-1-gea7d51c>dir 1257787
Volume in drive C has no label.
Volume Serial Number is 3E40-A384
Directory of C:\msys64\home\ken\gpuowl-compile\v6.10-1-gea7d51c\1257787
09/15/2019 10:34 AM <DIR> .
09/15/2019 10:34 AM <DIR> ..
09/15/2019 10:34 AM 157,270 1257787-new.owl
09/15/2019 10:33 AM 157,268 1257787-old.owl
09/15/2019 10:33 AM 157,270 1257787.owl
3 File(s) 471,808 bytes
2 Dir(s) 863,544,840,192 bytes free
Code:
B1=220000,B2=3960000;PFactor=0,1,2,24000577,-1,76,2 PRP=0,1,2,1257787,-1,70,0 Last fiddled with by kriesel on 2019-09-15 at 16:01 |
|
|
|
|
![]() |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| mfakto: an OpenCL program for Mersenne prefactoring | Bdot | GPU Computing | 1676 | 2021-06-30 21:23 |
| GPUOWL AMD Windows OpenCL issues | xx005fs | GpuOwl | 0 | 2019-07-26 21:37 |
| Testing an expression for primality | 1260 | Software | 17 | 2015-08-28 01:35 |
| Testing Mersenne cofactors for primality? | CRGreathouse | Computer Science & Computational Number Theory | 18 | 2013-06-08 19:12 |
| Primality-testing program with multiple types of moduli (PFGW-related) | Unregistered | Information & Answers | 4 | 2006-10-04 22:38 |