![]() |
[QUOTE=SELROC;507686]That is an old version. Now at version 6.2 ...[/QUOTE]Yes. I'm playing catchup.
|
gpuowl v6.1 Windows build and first takes
1 Attachment(s)
A few warnings during the make (which also occurred for v6.0):[CODE]$ make openowl-win
g++ -std=c++17 -O2 -DREV=\"569e6ef\" -Wall Pm1Plan.cpp GmpUtil.cpp Worktodo.cpp common.cpp gpuowl.cpp Gpu.cpp clwrap.cpp Task.cpp checkpoint.cpp timeutil.cpp Args.cpp Primes.cpp state.cpp Signal.cpp FFTConfig.cpp -o openowl-win-569e6ef -lOpenCL -lgmp -pthread -L/opt/rocm/opencl/lib/x86_64 -L/opt/amdgpu-pro/lib/x86_64-linux-gnu -L/c/Windows/System32 -L. -static Gpu.cpp: In member function 'PRPState Gpu::loadPRP(u32, u32, Buffer&, Buffer&, Buffer&)': Gpu.cpp:470:9: warning: unknown conversion type character 'l' in format [-Wformat=] log("%u EE loaded: %d, blockSize %d, %016llx (expected %016llx)\n", ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Gpu.cpp:470:9: warning: unknown conversion type character 'l' in format [-Wformat=] Gpu.cpp:470:9: warning: too many arguments for format [-Wformat-extra-args] Gpu.cpp: In member function 'std::pair<bool, long long unsigned int> Gpu::isPrimePRP(u32, const Args&)': Gpu.cpp:517:11: warning: unknown conversion type character 'l' in format [-Wformat=] log("%s %8d / %d, %016llx\n", isPrime ? "PP" : "CC", kEnd, E, finalRes64); ^~~~~~~~~~~~~~~~~~~~~~~~ Gpu.cpp:517:11: warning: too many arguments for format [-Wformat-extra-args] checkpoint.cpp: In member function 'void PRPState::loadInt(u32, u32)': checkpoint.cpp:81:9: warning: unknown conversion type character 'l' in format [-Wformat=] log("%s loaded: k %u, block %u, res64 %016llx\n", name.c_str(), k, blockSize, res64); ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ checkpoint.cpp:81:9: warning: too many arguments for format [-Wformat-extra-args] [/CODE]Showing devices with preceding device numbers, and combining list of available fft configurations into -h are good changes. [CODE]C:\msys64\home\ken\gpuowl-compile\v6.1-569e6ef>openowl-win-v61-569e6ef -h 2019-02-05 08:48:52 gpuowl 6.1-569e6ef Command line options: -user <name> : specify the user name. -cpu <name> : specify the hardware name. -time : display kernel profiling information. -fft <size> : specify FFT size, such as: 5000K, 4M, +2, -1. -block <value> : PRP GEC block size. Default 400. Smaller block is slower but detects errors sooner. -carry long|short : force carry type. Short carry may be faster, but requires high bits/word. -D <value> : P-1 second-stage D block size; multiple of 210; default auto based on GPU available memory. -device <N> : select a specific device: 0 : Ellesmere-36x1266-@28:0.0 Radeon (TM) RX 480 Graphics 1 : gfx804-8x1203-@3:0.0 Radeon 550 Series FFT Configurations: FFT 8K [ 0.01M - 0.18M] 64-64 FFT 24K [ 0.04M - 0.51M] 64-64-3 FFT 32K [ 0.05M - 0.68M] 64-256 256-64 FFT 40K [ 0.06M - 0.85M] 64-64-5 FFT 64K [ 0.10M - 1.34M] 64-512 512-64 FFT 72K [ 0.11M - 1.50M] 64-64-9 FFT 96K [ 0.15M - 1.99M] 64-256-3 256-64-3 FFT 128K [ 0.20M - 2.63M] 1K-64 64-1K 256-256 FFT 160K [ 0.25M - 3.27M] 64-256-5 256-64-5 FFT 192K [ 0.29M - 3.91M] 64-512-3 512-64-3 FFT 256K [ 0.39M - 5.18M] 64-2K 256-512 512-256 2K-64 FFT 288K [ 0.44M - 5.81M] 64-256-9 256-64-9 FFT 320K [ 0.49M - 6.44M] 64-512-5 512-64-5 FFT 384K [ 0.59M - 7.69M] 1K-64-3 64-1K-3 256-256-3 FFT 512K [ 0.79M - 10.18M] 1K-256 256-1K 512-512 4K-64 FFT 576K [ 0.88M - 11.42M] 64-512-9 512-64-9 FFT 640K [ 0.98M - 12.66M] 1K-64-5 64-1K-5 256-256-5 FFT 768K [ 1.18M - 15.12M] 64-2K-3 256-512-3 512-256-3 2K-64-3 FFT 1M [ 1.57M - 20.02M] 1K-512 256-2K 512-1K 2K-256 FFT 1152K [ 1.77M - 22.45M] 1K-64-9 64-1K-9 256-256-9 FFT 1280K [ 1.97M - 24.88M] 64-2K-5 256-512-5 512-256-5 2K-64-5 FFT 1536K [ 2.36M - 29.72M] 1K-256-3 256-1K-3 512-512-3 4K-64-3 FFT 2M [ 3.15M - 39.34M] 1K-1K 512-2K 2K-512 4K-256 FFT 2304K [ 3.54M - 44.13M] 64-2K-9 256-512-9 512-256-9 2K-64-9 FFT 2560K [ 3.93M - 48.90M] 1K-256-5 256-1K-5 512-512-5 4K-64-5 FFT 3M [ 4.72M - 58.41M] 1K-512-3 256-2K-3 512-1K-3 2K-256-3 FFT 4M [ 6.29M - 77.30M] 1K-2K 2K-1K 4K-512 FFT 4608K [ 7.08M - 86.70M] 1K-256-9 256-1K-9 512-512-9 4K-64-9 FFT 5M [ 7.86M - 96.07M] 1K-512-5 256-2K-5 512-1K-5 2K-256-5 FFT 6M [ 9.44M - 114.74M] 1K-1K-3 512-2K-3 2K-512-3 4K-256-3 FFT 8M [ 12.58M - 151.83M] 2K-2K 4K-1K FFT 9M [ 14.16M - 170.28M] 1K-512-9 256-2K-9 512-1K-9 2K-256-9 FFT 10M [ 15.73M - 188.68M] 1K-1K-5 512-2K-5 2K-512-5 4K-256-5 FFT 12M [ 18.87M - 225.32M] 1K-2K-3 2K-1K-3 4K-512-3 FFT 16M [ 25.17M - 298.13M] 4K-2K FFT 18M [ 28.31M - 334.34M] 1K-1K-9 512-2K-9 2K-512-9 4K-256-9 FFT 20M [ 31.46M - 370.44M] 1K-2K-5 2K-1K-5 4K-512-5 FFT 24M [ 37.75M - 442.34M] 2K-2K-3 4K-1K-3 FFT 36M [ 56.62M - 656.22M] 1K-2K-9 2K-1K-9 4K-512-9 FFT 40M [ 62.91M - 727.03M] 2K-2K-5 4K-1K-5 FFT 48M [ 75.50M - 868.07M] 4K-2K-3 FFT 72M [113.25M - 1287.53M] 2K-2K-9 4K-1K-9 FFT 80M [125.83M - 1426.38M] 4K-2K-5 FFT 144M [226.49M - 2525.23M] 4K-2K-9 [/CODE]216091 and 756839 ran into trouble like in V6.0. Not tested then was 1398269, which in V6.1 also has the error on load. [CODE] 2019-02-05 08:45:49 condorella/rx-480 1398269 FFT 72K: Width 8x8, Height 8x8, Middle 9; 18.97 bits/word 2019-02-05 08:45:49 condorella/rx-480 using short carry kernels 2019-02-05 08:45:53 condorella/rx-480 OpenCL compilation in 3968 ms, with "-DEXP=1398269u -DWIDTH=64u -DSMALL_HEIGHT=64u -DMIDDLE=9u -I. -cl-fast-relaxed-math -cl-std=CL2.0" 2019-02-05 08:45:53 condorella/rx-480 1398269.owl not found, starting from the beginning. 2019-02-05 08:45:54 condorella/rx-480 1398269 EE loaded: 0, blockSize 400, 0000000000000000 (expected 0000000000000003) 2019-02-05 08:45:54 condorella/rx-480 Exiting because "error on load" 2019-02-05 08:45:54 condorella/rx-480 Bye[/CODE][CODE]2019-02-05 09:12:18 condorella/rx-480 6972593 FFT 384K: Width 256x4, Height 8x8, Middle 3; 17.73 bits/word 2019-02-05 09:12:18 condorella/rx-480 using short carry kernels 2019-02-05 09:12:25 condorella/rx-480 OpenCL compilation in 4100 ms, with "-DEXP=6972593u -DWIDTH=1024u -DSMALL_HEIGHT=64u -DMIDDLE=3u -I. -cl-fast-relaxed-mat h -cl-std=CL2.0" 2019-02-05 09:12:25 condorella/rx-480 6972593.owl not found, starting from the beginning. 2019-02-05 09:12:25 condorella/rx-480 6972593 EE loaded: 0, blockSize 400, 0000000000000000 (expected 0000000000000003) 2019-02-05 09:12:25 condorella/rx-480 Exiting because "error on load" 2019-02-05 09:12:25 condorella/rx-480 Bye[/CODE][CODE]2019-02-05 09:14:39 condorella/rx-480 20996011 FFT 1152K: Width 256x4, Height 8x8, Middle 9; 17.80 bits/word 2019-02-05 09:14:39 condorella/rx-480 using short carry kernels 2019-02-05 09:14:46 condorella/rx-480 OpenCL compilation in 4130 ms, with "-DEXP=20996011u -DWIDTH=1024u -DSMALL_HEIGHT=64u -DMIDDLE=9u -I. -cl-fast-relaxed-ma th -cl-std=CL2.0" 2019-02-05 09:14:46 condorella/rx-480 20996011.owl not found, starting from the beginning. 2019-02-05 09:14:47 condorella/rx-480 20996011 EE loaded: 0, blockSize 400, ca26e8b69c18204c (expected 0000000000000003) 2019-02-05 09:14:47 condorella/rx-480 Exiting because "error on load" 2019-02-05 09:14:47 condorella/rx-480 Bye[/CODE]Timing for 1280K fft at 2.95ms/iter was anomalously higher than for 1536k (1.33 ms/iter) Similarly for 2304K at 5.15 ms/iter vs. 3072K at 2.75 ms/iter I only tried the fft lengths the program chose for known Mp exponents. So there may be more cases of low speed by default selection. There were cases where the same fft length was a few percent faster or slower in V6.1 than the best previously observed in V3.5 to v5.0. The executable in the attached zip file is considerably smaller than the one for V6.0, because I omitted doing "strip openowl" at V6.0. |
Openowl V6.2 on Windows and first takes (trouble in 6.2 and 6.1)
V6.2 openowl build for Win64 and first takes
No executable posted yet. Skip to the end for why. Same 3 warnings again during the build process as listed for v6.1. -h worked with or without a valid worktodo file, good. openowl -h output:[CODE]2019-02-05 17:39:12 gpuowl 6.2-4a213af Command line options: -user <name> : specify the user name. -cpu <name> : specify the hardware name. -time : display kernel profiling information. -fft <size> : specify FFT size, such as: 5000K, 4M, +2, -1. -block <value> : PRP GEC block size. Default 400. Smaller block is slower but detects errors sooner. -carry long|short : force carry type. Short carry may be faster, but requires high bits/word. -D <value> : P-1 second-stage D block size; multiple of 210; default auto based on GPU available memory. -device <N> : select a specific device: 0 : Ellesmere-36x1266-@28:0.0 Radeon (TM) RX 480 Graphics 1 : gfx804-8x1203-@3:0.0 Radeon 550 Series FFT Configurations: FFT 8K [ 0.01M - 0.18M] 64-64 FFT 32K [ 0.05M - 0.68M] 64-256 256-64 FFT 48K [ 0.07M - 1.01M] 64-64-6 FFT 64K [ 0.10M - 1.34M] 64-512 512-64 FFT 72K [ 0.11M - 1.50M] 64-64-9 FFT 80K [ 0.12M - 1.66M] 64-64-10 FFT 128K [ 0.20M - 2.63M] 1K-64 64-1K 256-256 FFT 192K [ 0.29M - 3.91M] 64-256-6 256-64-6 FFT 256K [ 0.39M - 5.18M] 64-2K 256-512 512-256 2K-64 FFT 288K [ 0.44M - 5.81M] 64-256-9 256-64-9 FFT 320K [ 0.49M - 6.44M] 64-256-10 256-64-10 FFT 384K [ 0.59M - 7.69M] 64-512-6 512-64-6 FFT 512K [ 0.79M - 10.18M] 1K-256 256-1K 512-512 4K-64 FFT 576K [ 0.88M - 11.42M] 64-512-9 512-64-9 FFT 640K [ 0.98M - 12.66M] 64-512-10 512-64-10 FFT 768K [ 1.18M - 15.12M] 1K-64-6 64-1K-6 256-256-6 FFT 1M [ 1.57M - 20.02M] 1K-512 256-2K 512-1K 2K-256 FFT 1152K [ 1.77M - 22.45M] 1K-64-9 64-1K-9 256-256-9 FFT 1280K [ 1.97M - 24.88M] 1K-64-10 64-1K-10 256-256-10 FFT 1536K [ 2.36M - 29.72M] 64-2K-6 256-512-6 512-256-6 2K-64-6 FFT 2M [ 3.15M - 39.34M] 1K-1K 512-2K 2K-512 4K-256 FFT 2304K [ 3.54M - 44.13M] 64-2K-9 256-512-9 512-256-9 2K-64-9 FFT 2560K [ 3.93M - 48.90M] 64-2K-10 256-512-10 512-256-10 2K-64-10 FFT 3M [ 4.72M - 58.41M] 1K-256-6 256-1K-6 512-512-6 4K-64-6 FFT 4M [ 6.29M - 77.30M] 1K-2K 2K-1K 4K-512 FFT 4608K [ 7.08M - 86.70M] 1K-256-9 256-1K-9 512-512-9 4K-64-9 FFT 5M [ 7.86M - 96.07M] 1K-256-10 256-1K-10 512-512-10 4K-64-10 FFT 6M [ 9.44M - 114.74M] 1K-512-6 256-2K-6 512-1K-6 2K-256-6 FFT 8M [ 12.58M - 151.83M] 2K-2K 4K-1K FFT 9M [ 14.16M - 170.28M] 1K-512-9 256-2K-9 512-1K-9 2K-256-9 FFT 10M [ 15.73M - 188.68M] 1K-512-10 256-2K-10 512-1K-10 2K-256-10 FFT 12M [ 18.87M - 225.32M] 1K-1K-6 512-2K-6 2K-512-6 4K-256-6 FFT 16M [ 25.17M - 298.13M] 4K-2K FFT 18M [ 28.31M - 334.34M] 1K-1K-9 512-2K-9 2K-512-9 4K-256-9 FFT 20M [ 31.46M - 370.44M] 1K-1K-10 512-2K-10 2K-512-10 4K-256-10 FFT 24M [ 37.75M - 442.34M] 1K-2K-6 2K-1K-6 4K-512-6 FFT 36M [ 56.62M - 656.22M] 1K-2K-9 2K-1K-9 4K-512-9 FFT 40M [ 62.91M - 727.03M] 1K-2K-10 2K-1K-10 4K-512-10 FFT 48M [ 75.50M - 868.07M] 2K-2K-6 4K-1K-6 FFT 72M [113.25M - 1287.53M] 2K-2K-9 4K-1K-9 FFT 80M [125.83M - 1426.38M] 2K-2K-10 4K-1K-10 FFT 96M [150.99M - 1702.92M] 4K-2K-6 FFT 144M [226.49M - 2525.23M] 4K-2K-9 FFT 160M [251.66M - 2797.39M] 4K-2K-10[/CODE]p=756839 error on load, program terminated. p=859433 error on load, program terminated. p=1398269 error on load, program terminated. p=13466917 error on load, program terminated. p=20996011 error on load, program terminated. p=24036583 error on load, program terminated. 1536k fft length anomalously long timing 3.51ms/iter 2304k similarly, 5.15ms/iter, while 3072k is 2.55 The big item though, is no [B]known Mersenne prime[/B] that I ran to completion in V6.2 produced a prime indication at console or results file. [B]All[/B] from 19937 to 1257787 [B]indicated composite[/B] or a load error. So, I'm not posting a Windows executable for gpuowl v6.2, for now. [B]The same problem occurred with V6.1. Don't use that posted executable.[/B] The interim 64-bit residues at 10000, 100000, and 1000000 that I checked all matched expected values. [CODE]2019-02-05 08:35:12 gpuowl 6.1-569e6ef 2019-02-05 08:35:12 condorella/rx-480 -device 0 -user kriesel -cpu condorella/rx-480 2019-02-05 08:35:12 condorella/rx-480 19937 FFT 8K: Width 8x8, Height 8x8; 2.43 bits/word 2019-02-05 08:35:12 condorella/rx-480 using long carry kernels 2019-02-05 08:35:18 condorella/rx-480 OpenCL compilation in 4156 ms, with "-DEXP=19937u -DWIDTH=64u -DSMALL_HEIGHT=64u -DMIDDLE=1u -I. -cl-fast-relaxed-math -cl-std=CL2.0" 2019-02-05 08:35:18 condorella/rx-480 19937.owl not found, starting from the beginning. 2019-02-05 08:35:18 condorella/rx-480 19937 OK 800 4.00%; 0.13 ms/sq; ETA 0d 00:00; 7aa6c3340ce46bab (check 0.06s) 2019-02-05 08:35:20 condorella/rx-480 19937 10000 50.00%; 0.14 ms/sq; ETA 0d 00:00; 6248f957ba3ee3c5 2019-02-05 08:35:21 condorella/rx-480 [COLOR=Red][B]CC[/B][/COLOR] 19936 / 19937, [B][COLOR=SeaGreen]fffffffffffffffc[/COLOR][/B] 2019-02-05 08:35:21 condorella/rx-480 19937 OK 20000 100.00%; 0.14 ms/sq; ETA 0d 00:00; f5eb5782c7855ffd (check 0.06s) 2019-02-05 08:35:21 condorella/rx-480 {"exponent":"19937", "worktype":"PRP-3", "status":"[COLOR=red][B]C[/B][/COLOR]", "program":{"name":"gpuowl", "version":"6.1-569e6ef"}, "timestamp":"2019-02-05 14:35:21 UTC", "user":"kriesel", "computer":"condorella/rx-480", "aid":"0", "fft-length":8192, "res64":"[B][COLOR=SeaGreen]fffffffffffffffc[/COLOR][/B]", "residue-type":4} [/CODE][CODE]2019-02-05 17:49:24 condorella/rx-480 44497 FFT 8K: Width 8x8, Height 8x8; 5.43 bits/word 2019-02-05 17:49:24 condorella/rx-480 using long carry kernels 2019-02-05 17:49:28 condorella/rx-480 OpenCL compilation in 3948 ms, with "-DEXP=44497u -DWIDTH=64u -DSMALL_HEIGHT=64u -DMIDDLE=1u -I. -cl-fast-relaxed-math -cl-std=CL2.0" 2019-02-05 17:49:28 condorella/rx-480 44497.owl not found, starting from the beginning. 2019-02-05 17:49:29 condorella/rx-480 44497 OK 800 1.79%; 0.42 ms/sq; ETA 0d 00:00; c89e6116066cebba (check 0.31s) 2019-02-05 17:49:33 condorella/rx-480 44497 10000 22.32%; 0.46 ms/sq; ETA 0d 00:00; d45f9720e7aa56ae 2019-02-05 17:49:38 condorella/rx-480 44497 20000 44.64%; 0.48 ms/sq; ETA 0d 00:00; e0fc41c8eadc4e96 2019-02-05 17:49:43 condorella/rx-480 44497 30000 66.96%; 0.53 ms/sq; ETA 0d 00:00; 8792168dfe598ffa 2019-02-05 17:49:48 condorella/rx-480 44497 40000 89.29%; 0.47 ms/sq; ETA 0d 00:00; 9b4920985d079c24 2019-02-05 17:49:50 condorella/rx-480 [B][COLOR=red]CC[/COLOR][/B] 44496 / 44497, [B][COLOR=seagreen]fffffffffffffffc[/COLOR][/B] 2019-02-05 17:49:51 condorella/rx-480 44497 OK 44800 100.00%; 0.54 ms/sq; ETA 0d 00:00; e92a77e2d568e367 (check 0.45s) 2019-02-05 17:49:51 condorella/rx-480 {"exponent":"44497", "worktype":"PRP-3", "status":"[B][COLOR=red]C[/COLOR][/B]", "program":{"name":"gpuowl", "version":"6.2-4a213af"}, "timestamp":"2019-02-05 23:49:51 UTC", "user":"kriesel", "computer":"condorella/rx-480", "aid":"0", "fft-length":8192, "res64":"[B][COLOR=seagreen]fffffffffffffffc[/COLOR][/B]", "residue-type":4} [/CODE] |
Something is wrong with gpuowl !!!
[QUOTE=kriesel;507780]V6.2 openowl build for Win64 and first takes
No executable posted yet. Skip to the end for why. Same 3 warnings again during the build process as listed for v6.1. -h worked with or without a valid worktodo file, good. openowl -h output:[CODE]2019-02-05 17:39:12 gpuowl 6.2-4a213af Command line options: -user <name> : specify the user name. -cpu <name> : specify the hardware name. -time : display kernel profiling information. -fft <size> : specify FFT size, such as: 5000K, 4M, +2, -1. -block <value> : PRP GEC block size. Default 400. Smaller block is slower but detects errors sooner. -carry long|short : force carry type. Short carry may be faster, but requires high bits/word. -D <value> : P-1 second-stage D block size; multiple of 210; default auto based on GPU available memory. -device <N> : select a specific device: 0 : Ellesmere-36x1266-@28:0.0 Radeon (TM) RX 480 Graphics 1 : gfx804-8x1203-@3:0.0 Radeon 550 Series FFT Configurations: FFT 8K [ 0.01M - 0.18M] 64-64 FFT 32K [ 0.05M - 0.68M] 64-256 256-64 FFT 48K [ 0.07M - 1.01M] 64-64-6 FFT 64K [ 0.10M - 1.34M] 64-512 512-64 FFT 72K [ 0.11M - 1.50M] 64-64-9 FFT 80K [ 0.12M - 1.66M] 64-64-10 FFT 128K [ 0.20M - 2.63M] 1K-64 64-1K 256-256 FFT 192K [ 0.29M - 3.91M] 64-256-6 256-64-6 FFT 256K [ 0.39M - 5.18M] 64-2K 256-512 512-256 2K-64 FFT 288K [ 0.44M - 5.81M] 64-256-9 256-64-9 FFT 320K [ 0.49M - 6.44M] 64-256-10 256-64-10 FFT 384K [ 0.59M - 7.69M] 64-512-6 512-64-6 FFT 512K [ 0.79M - 10.18M] 1K-256 256-1K 512-512 4K-64 FFT 576K [ 0.88M - 11.42M] 64-512-9 512-64-9 FFT 640K [ 0.98M - 12.66M] 64-512-10 512-64-10 FFT 768K [ 1.18M - 15.12M] 1K-64-6 64-1K-6 256-256-6 FFT 1M [ 1.57M - 20.02M] 1K-512 256-2K 512-1K 2K-256 FFT 1152K [ 1.77M - 22.45M] 1K-64-9 64-1K-9 256-256-9 FFT 1280K [ 1.97M - 24.88M] 1K-64-10 64-1K-10 256-256-10 FFT 1536K [ 2.36M - 29.72M] 64-2K-6 256-512-6 512-256-6 2K-64-6 FFT 2M [ 3.15M - 39.34M] 1K-1K 512-2K 2K-512 4K-256 FFT 2304K [ 3.54M - 44.13M] 64-2K-9 256-512-9 512-256-9 2K-64-9 FFT 2560K [ 3.93M - 48.90M] 64-2K-10 256-512-10 512-256-10 2K-64-10 FFT 3M [ 4.72M - 58.41M] 1K-256-6 256-1K-6 512-512-6 4K-64-6 FFT 4M [ 6.29M - 77.30M] 1K-2K 2K-1K 4K-512 FFT 4608K [ 7.08M - 86.70M] 1K-256-9 256-1K-9 512-512-9 4K-64-9 FFT 5M [ 7.86M - 96.07M] 1K-256-10 256-1K-10 512-512-10 4K-64-10 FFT 6M [ 9.44M - 114.74M] 1K-512-6 256-2K-6 512-1K-6 2K-256-6 FFT 8M [ 12.58M - 151.83M] 2K-2K 4K-1K FFT 9M [ 14.16M - 170.28M] 1K-512-9 256-2K-9 512-1K-9 2K-256-9 FFT 10M [ 15.73M - 188.68M] 1K-512-10 256-2K-10 512-1K-10 2K-256-10 FFT 12M [ 18.87M - 225.32M] 1K-1K-6 512-2K-6 2K-512-6 4K-256-6 FFT 16M [ 25.17M - 298.13M] 4K-2K FFT 18M [ 28.31M - 334.34M] 1K-1K-9 512-2K-9 2K-512-9 4K-256-9 FFT 20M [ 31.46M - 370.44M] 1K-1K-10 512-2K-10 2K-512-10 4K-256-10 FFT 24M [ 37.75M - 442.34M] 1K-2K-6 2K-1K-6 4K-512-6 FFT 36M [ 56.62M - 656.22M] 1K-2K-9 2K-1K-9 4K-512-9 FFT 40M [ 62.91M - 727.03M] 1K-2K-10 2K-1K-10 4K-512-10 FFT 48M [ 75.50M - 868.07M] 2K-2K-6 4K-1K-6 FFT 72M [113.25M - 1287.53M] 2K-2K-9 4K-1K-9 FFT 80M [125.83M - 1426.38M] 2K-2K-10 4K-1K-10 FFT 96M [150.99M - 1702.92M] 4K-2K-6 FFT 144M [226.49M - 2525.23M] 4K-2K-9 FFT 160M [251.66M - 2797.39M] 4K-2K-10[/CODE]p=756839 error on load, program terminated. p=859433 error on load, program terminated. p=1398269 error on load, program terminated. p=13466917 error on load, program terminated. p=20996011 error on load, program terminated. p=24036583 error on load, program terminated. 1536k fft length anomalously long timing 3.51ms/iter 2304k similarly, 5.15ms/iter, while 3072k is 2.55 The big item though, is no [B]known Mersenne prime[/B] that I ran to completion in V6.2 produced a prime indication at console or results file. [B]All[/B] from 19937 to 1257787 [B]indicated composite[/B] or a load error. So, I'm not posting a Windows executable for gpuowl v6.2, for now. [B]The same problem occurred with V6.1. Don't use that posted executable.[/B] The interim 64-bit residues at 10000, 100000, and 1000000 that I checked all matched expected values. [CODE]2019-02-05 08:35:12 gpuowl 6.1-569e6ef 2019-02-05 08:35:12 condorella/rx-480 -device 0 -user kriesel -cpu condorella/rx-480 2019-02-05 08:35:12 condorella/rx-480 19937 FFT 8K: Width 8x8, Height 8x8; 2.43 bits/word 2019-02-05 08:35:12 condorella/rx-480 using long carry kernels 2019-02-05 08:35:18 condorella/rx-480 OpenCL compilation in 4156 ms, with "-DEXP=19937u -DWIDTH=64u -DSMALL_HEIGHT=64u -DMIDDLE=1u -I. -cl-fast-relaxed-math -cl-std=CL2.0" 2019-02-05 08:35:18 condorella/rx-480 19937.owl not found, starting from the beginning. 2019-02-05 08:35:18 condorella/rx-480 19937 OK 800 4.00%; 0.13 ms/sq; ETA 0d 00:00; 7aa6c3340ce46bab (check 0.06s) 2019-02-05 08:35:20 condorella/rx-480 19937 10000 50.00%; 0.14 ms/sq; ETA 0d 00:00; 6248f957ba3ee3c5 2019-02-05 08:35:21 condorella/rx-480 [COLOR=Red][B]CC[/B][/COLOR] 19936 / 19937, [B][COLOR=SeaGreen]fffffffffffffffc[/COLOR][/B] 2019-02-05 08:35:21 condorella/rx-480 19937 OK 20000 100.00%; 0.14 ms/sq; ETA 0d 00:00; f5eb5782c7855ffd (check 0.06s) 2019-02-05 08:35:21 condorella/rx-480 {"exponent":"19937", "worktype":"PRP-3", "status":"[COLOR=red][B]C[/B][/COLOR]", "program":{"name":"gpuowl", "version":"6.1-569e6ef"}, "timestamp":"2019-02-05 14:35:21 UTC", "user":"kriesel", "computer":"condorella/rx-480", "aid":"0", "fft-length":8192, "res64":"[B][COLOR=SeaGreen]fffffffffffffffc[/COLOR][/B]", "residue-type":4} [/CODE][CODE]2019-02-05 17:49:24 condorella/rx-480 44497 FFT 8K: Width 8x8, Height 8x8; 5.43 bits/word 2019-02-05 17:49:24 condorella/rx-480 using long carry kernels 2019-02-05 17:49:28 condorella/rx-480 OpenCL compilation in 3948 ms, with "-DEXP=44497u -DWIDTH=64u -DSMALL_HEIGHT=64u -DMIDDLE=1u -I. -cl-fast-relaxed-math -cl-std=CL2.0" 2019-02-05 17:49:28 condorella/rx-480 44497.owl not found, starting from the beginning. 2019-02-05 17:49:29 condorella/rx-480 44497 OK 800 1.79%; 0.42 ms/sq; ETA 0d 00:00; c89e6116066cebba (check 0.31s) 2019-02-05 17:49:33 condorella/rx-480 44497 10000 22.32%; 0.46 ms/sq; ETA 0d 00:00; d45f9720e7aa56ae 2019-02-05 17:49:38 condorella/rx-480 44497 20000 44.64%; 0.48 ms/sq; ETA 0d 00:00; e0fc41c8eadc4e96 2019-02-05 17:49:43 condorella/rx-480 44497 30000 66.96%; 0.53 ms/sq; ETA 0d 00:00; 8792168dfe598ffa 2019-02-05 17:49:48 condorella/rx-480 44497 40000 89.29%; 0.47 ms/sq; ETA 0d 00:00; 9b4920985d079c24 2019-02-05 17:49:50 condorella/rx-480 [B][COLOR=red]CC[/COLOR][/B] 44496 / 44497, [B][COLOR=seagreen]fffffffffffffffc[/COLOR][/B] 2019-02-05 17:49:51 condorella/rx-480 44497 OK 44800 100.00%; 0.54 ms/sq; ETA 0d 00:00; e92a77e2d568e367 (check 0.45s) 2019-02-05 17:49:51 condorella/rx-480 {"exponent":"44497", "worktype":"PRP-3", "status":"[B][COLOR=red]C[/COLOR][/B]", "program":{"name":"gpuowl", "version":"6.2-4a213af"}, "timestamp":"2019-02-05 23:49:51 UTC", "user":"kriesel", "computer":"condorella/rx-480", "aid":"0", "fft-length":8192, "res64":"[B][COLOR=seagreen]fffffffffffffffc[/COLOR][/B]", "residue-type":4} [/CODE][/QUOTE] I have just tested prime 44497 and the result is "C". Please double-check the program...we are computing also large numbers... |
[QUOTE=SELROC;507792]I have just tested prime 44497 and the result is "C".
Please double-check the program...we are computing also large numbers...[/QUOTE] Same with prime 86243... |
Thanks -- I've been careless, the same bug strikes again!
Anybody can check if he found a misreported prime by searching for fffffffffffffffc in gpuowl.log, e.g. with grep under linux. The fix is underway. Fixed now, please re-checkout. It's fine to upgrade in the middle of an ongoing PRP test, the result will be good. |
[QUOTE=preda;507794]Thanks -- I've been careless, the same bug strikes again!
Anybody can check if he found a misreported prime by searching for fffffffffffffffc in gpuowl.log, e.g. with grep under linux. The fix is underway. Fixed now, please re-checkout. It's fine to upgrade in the middle of an ongoing PRP test, the result will be good.[/QUOTE] Wow thanks for the quick response. I will upgrade ASAP :-) |
[QUOTE=preda;507794]Thanks -- I've been careless, the same bug strikes again!
Anybody can check if he found a misreported prime by searching for fffffffffffffffc in gpuowl.log, e.g. with grep under linux. The fix is underway. Fixed now, please re-checkout. It's fine to upgrade in the middle of an ongoing PRP test, the result will be good.[/QUOTE] The problem is that some exponents have already been computed to completion. I would double or triple check them... |
[QUOTE=SELROC;507798]The problem is that some exponents have already been computed to completion. I would double or triple check them...[/QUOTE]
No need. If the final residue was not 0xfffffffffffffffc (i.e. -3), there is no risk of being prime. Just do this: grep fffffffffffffffc gpuowl.log and only re-check the exponents that produced that as final residue (most probably there are none). |
readymade qa input for worktodo
I strongly recommend running some known-mersenne-prime low exponents before committing to github for the author, or before using for production for the end users.
[url]https://www.mersenneforum.org/showpost.php?p=506082&postcount=3[/url] (And look closely at the results, more closely than I did for v6.1 the first time yesterday!) |
[QUOTE=preda;507806]No need. If the final residue was not 0xfffffffffffffffc (i.e. -3), there is no risk of being prime. Just do this:
grep fffffffffffffffc gpuowl.log and only re-check the exponents that produced that as final residue (most probably there are none).[/QUOTE] And only from the last saved interim iteration count before the previous completion, not the whole computation duration, and which the program will do automatically. (Unless you deleted the interim files yourself: p.owl and p-prev.owl) The interim residues checked out ok, as did the final, in the cases I checked. It's only the very last bit that went wrong. |
| All times are UTC. The time now is 23:12. |
Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.