![]() |
[QUOTE=Prime95;557702]And another proof failure [url]http://mersenne.org/M108979987[/url][/QUOTE]
I'll look into hardening the proof generation over the following days. |
Why does it ignore
-log 50000 in config.txt.? I still get progress updats every 10000. [CODE]-user petrw1 -cpu colab -device 0 -log 50000 -maxAlloc 4000[/CODE] |
[QUOTE=petrw1;557810]Why does it ignore
-log 50000 in config.txt.? I still get progress updats every 10000. [CODE]-user petrw1 -cpu colab -device 0 -log 50000 -maxAlloc 4000[/CODE][/QUOTE] For what task? P-1 is special, and is being reworked anyway. |
[QUOTE=preda;557815]For what task? P-1 is special, and is being reworked anyway.[/QUOTE]
yes P1 thx |
1 Attachment(s)
[QUOTE=Xyzzy][CODE]2020-06-05 17:13:16 gfx1012-0 OpenCL compilation in 3.10 s
2020-06-05 17:13:17 Radeon Pro W5500-0 77936867 OK 0 loaded: blockSize 400, 0000000000000003 2020-06-05 17:13:21 Radeon Pro W5500-0 77936867 OK 800 0.00%; 2982 us/it; ETA 2d 16:34; 1579c241dc63eca6 (check 1.27s) 2020-06-05 17:23:18 Radeon Pro W5500-0 77936867 OK 200000 0.26%; 2991 us/it; ETA 2d 16:35; f0b04b45b0855bd2 (check 1.28s) 2020-06-05 17:33:15 Radeon Pro W5500-0 77936867 OK 400000 0.51%; 2979 us/it; ETA 2d 16:10; c03f94396a5aa29e (check 1.27s) 2020-06-05 17:43:17 Radeon Pro W5500-0 77936867 OK 600000 0.77%; 3004 us/it; ETA 2d 16:32; b9decd65ca71b629 (check 1.28s) 2020-09-04 13:24:28 GeForce GTX 1080 Ti-0 OpenCL compilation in 2.02 s 2020-09-04 13:24:29 GeForce GTX 1080 Ti-0 77936867 OK 0 loaded: blockSize 400, 0000000000000003 2020-09-04 13:24:32 GeForce GTX 1080 Ti-0 77936867 OK 800 0.00%; 2481 us/it; ETA 2d 05:43; 1579c241dc63eca6 (check 1.04s) 2020-09-04 13:32:54 GeForce GTX 1080 Ti-0 77936867 OK 200000 0.26%; 2514 us/it; ETA 2d 06:18; f0b04b45b0855bd2 (check 1.04s) 2020-09-04 13:41:12 GeForce GTX 1080 Ti-0 77936867 OK 400000 0.51%; 2483 us/it; ETA 2d 05:29; c03f94396a5aa29e (check 1.05s) 2020-09-04 13:49:27 GeForce GTX 1080 Ti-0 77936867 OK 600000 0.77%; 2473 us/it; ETA 2d 05:07; b9decd65ca71b629 (check 1.06s) 2020-09-04 17:42:56 GeForce GTX 980 Ti-0 OpenCL compilation in 1.83 s 2020-09-04 17:42:58 GeForce GTX 980 Ti-0 77936867 OK 0 loaded: blockSize 400, 0000000000000003 2020-09-04 17:43:04 GeForce GTX 980 Ti-0 77936867 OK 800 0.00%; 4221 us/it; ETA 3d 19:23; 1579c241dc63eca6 (check 1.73s) 2020-09-04 17:57:13 GeForce GTX 980 Ti-0 77936867 OK 200000 0.26%; 4258 us/it; ETA 3d 19:56; f0b04b45b0855bd2 (check 1.75s) 2020-09-04 18:11:28 GeForce GTX 980 Ti-0 77936867 OK 400000 0.51%; 4263 us/it; ETA 3d 19:49; c03f94396a5aa29e (check 1.75s) 2020-09-04 18:25:42 GeForce GTX 980 Ti-0 77936867 OK 600000 0.77%; 4262 us/it; ETA 3d 19:34; b9decd65ca71b629 (check 1.75s)[/CODE][/QUOTE][URL="https://pcpartpicker.com/product/FYrYcf/nvidia-geforce-rtx-2060-super-8-gb-founders-edition-video-card-900-1g160-2565-000"]2060 Super[/URL] [CODE]2020-09-25 01:11:43 GeForce RTX 2060 SUPER-0 OpenCL compilation in 1.35 s 2020-09-25 01:11:45 GeForce RTX 2060 SUPER-0 77936867 OK 0 loaded: blockSize 400, 0000000000000003 2020-09-25 01:11:50 GeForce RTX 2060 SUPER-0 77936867 OK 800 0.00%; 3966 us/it; ETA 3d 13:51; 1579c241dc63eca6 (check 1.65s) 2020-09-25 01:34:08 GeForce RTX 2060 SUPER-0 77936867 OK 200000 0.26%; 5188 us/it; ETA 4d 16:01; f0b04b45b0855bd2 (check 2.18s) 2020-09-25 01:51:15 GeForce RTX 2060 SUPER-0 77936867 OK 400000 0.51%; 5123 us/it; ETA 4d 14:21; c03f94396a5aa29e (check 2.12s) 2020-09-25 02:08:30 GeForce RTX 2060 SUPER-0 77936867 OK 600000 0.77%; 5164 us/it; ETA 4d 14:56; b9decd65ca71b629 (check 2.13s)[/CODE]With a heavy load like FurMark this GPU draws 175W, but with gpuowl it uses only 70W. We tried running two instances and it didn't change. Are we doing something wrong? |
[QUOTE=Prime95;557230]The second proof failure is here: [URL="https://www.mersenne.org/report_exponent/?exp_lo=108979853&exp_hi=&full=1"]108979853[/URL]
I do not know if Bruno Victal is a forum member and can comment on what may have happened.[/QUOTE] Residue matched. The certificate is not yet certified, so my guess is it was either an error in CERT-generation process in original run (most likely), or if my CERT turns out to be not good either, it could be some weird bug in both programs (highly unlikely). |
[QUOTE=Xyzzy;557835][URL="https://pcpartpicker.com/product/FYrYcf/nvidia-geforce-rtx-2060-super-8-gb-founders-edition-video-card-900-1g160-2565-000"]2060 Super[/URL]
[CODE]2020-09-25 01:11:43 GeForce RTX 2060 SUPER-0 OpenCL compilation in 1.35 s 2020-09-25 01:11:45 GeForce RTX 2060 SUPER-0 77936867 OK 0 loaded: blockSize 400, 0000000000000003 2020-09-25 01:11:50 GeForce RTX 2060 SUPER-0 77936867 OK 800 0.00%; 3966 us/it; ETA 3d 13:51; 1579c241dc63eca6 (check 1.65s) 2020-09-25 01:34:08 GeForce RTX 2060 SUPER-0 77936867 OK 200000 0.26%; 5188 us/it; ETA 4d 16:01; f0b04b45b0855bd2 (check 2.18s) 2020-09-25 01:51:15 GeForce RTX 2060 SUPER-0 77936867 OK 400000 0.51%; 5123 us/it; ETA 4d 14:21; c03f94396a5aa29e (check 2.12s) 2020-09-25 02:08:30 GeForce RTX 2060 SUPER-0 77936867 OK 600000 0.77%; 5164 us/it; ETA 4d 14:56; b9decd65ca71b629 (check 2.13s)[/CODE]With a heavy load like FurMark this GPU draws 175W, but with gpuowl it uses only 70W. We tried running two instances and it didn't change. Are we doing something wrong?[/QUOTE] RTX20xx is much more productively used in TF than in any DP related computation (LL, PRP, P-1) because of its extremely high SP/DP ratio. 77936867 is a composite exponent [URL="https://www.alpertron.com.ar/ECM.HTM"]77 936867 = 1447 × 53861[/URL], so has known factors, which means there's little point in making that PRP run. I haven't tried much DP on RTX20xx, because the extreme SP/DP ratio would make it sort of a waste of the gpu's capability, but do run DP often on GTX10xx, and see power differences in GTX10xx depending on whether the run is TF (high power, in one case more than the system can handle) or DP dominant (less power, runs on a system that can't handle the TF power load of a GTX1080). GTX10xx SP/DP ratio is large but not as extreme as for RTX20xx. So it seems plausible the SP,DP power difference could be more significant on the RTX20xx gpus. |
[QUOTE=preda;471318]I recently understood how to implement a "Fast Galois Transform" (FGT) which is simply complex arithmetic with integers modulo some number M.
I had hope that this integer-only transform may be faster on the GPU because it does not use double-precision floating point (which is slow on commodity GPUs). So I had fun and implemented FGT modulo M(31)=2^31-1 and modulo M(61)=2^61-1. Unfortunately the hoped performance gain was not there, but it was a very cool exercise nevertheless. Anyway, now it's possible to select among these 4 transforms: -fft DP : the old double precision floating point -fft SP : simple precision FP -fft M61 : FGT(M61) -fft M31 : FGT(M31) Of these, SP is very fast but also useless at 2M FFT-size and up (it may prove useful for something at lower FFT sizes). M31 has about 5 bits-per-word usable at 4M FFT size. It's not much use by itself, but can be tested. M61 has deeper word bits than DP. So it can be used for real work. Unfortunately it's also slower than DP. Part of the slowness may be from poor compiler optimizations and that aspect may improve in the future, hopefully. I updated the savefile format to save "compacted" bits now. That means that it's possible to change the FFT size (among 2M, 4M, 8M) or the FFT kind (e.g. switching between DP and M61) in the middle of a test, and everything should work fine (assuming enough usable bits are available; otherwise the "Gerbicz check" which is used with all the transforms will catch it). It's nice that adding the FGTs was done with very little additional code compared to DP-FFT-only -- most of the code is common. Also, a dynamic step of the Gerbicz verification is implemented, which starts with a very small step of 2K iterations at program start (allowing the user to see that the program functions correctly) and ramps up towards 500K if no errors are encountered, or back down if errors are detected. Anyway, if anybody wants to play with pure-integer convolutions on the GPU (for the limited FFT sizes of 2M/4M/8M), the code is here: [URL]https://github.com/preda/gpuowl[/URL][/QUOTE] Is it possible to test SP in gpuowl now? I will soon bought nvidia RTX 3090 and want to test whether 3090 generate good results I want to test dp,sp and int32(maybe using M31), is it possible using gpuowl to perform such test? anyway, thanks for your great program! |
[QUOTE=kriesel;557845]77936867 is a composite exponent [URL="https://www.alpertron.com.ar/ECM.HTM"]77 936867 = 1447 × 53861[/URL], so has known factors, which means there's little point in making that PRP run.[/QUOTE]We just use that exponent for benchmarking purposes. As you mentioned, it does much better with trial factoring.[CODE]Sep 25 13:32 | 3664 79.6% | 0.381 1m15s | 2150.15 75061 n.a.%
Sep 25 13:32 | 3669 79.7% | 0.375 1m13s | 2184.55 75061 n.a.% Sep 25 13:32 | 3676 79.8% | 0.376 1m13s | 2178.74 75061 n.a.% Sep 25 13:32 | 3681 79.9% | 0.374 1m12s | 2190.39 75061 n.a.% Sep 25 13:32 | 3684 80.0% | 0.376 1m12s | 2178.74 75061 n.a.% Sep 25 13:32 | 3685 80.1% | 0.376 1m12s | 2178.74 75061 n.a.% Sep 25 13:32 | 3696 80.2% | 0.380 1m12s | 2155.81 75061 n.a.% Sep 25 13:32 | 3697 80.3% | 0.378 1m11s | 2167.21 75061 n.a.% Sep 25 13:32 | 3709 80.4% | 0.375 1m11s | 2184.55 75061 n.a.% Sep 25 13:32 | 3712 80.5% | 0.376 1m10s | 2178.74 75061 n.a.%[/CODE] |
[QUOTE=kriesel;544438]Congratulations, you can apparently run mfakto on the hd630 because the Intel OpenCL is working. (But it does not have DP and OpenCL2.0, which gpuowl requires.)[/QUOTE]Somehow we got gpuowl to work on our integrated graphics. (It runs so slow that it isn't worth doing, but it is neat that it works at all!)[CODE]2020-09-25 13:26:07 Intel(R) UHD Graphics 630-1 OpenCL compilation in 6.71 s
2020-09-25 13:26:21 Intel(R) UHD Graphics 630-1 77936867 OK 0 loaded: blockSize 400, 0000000000000003 2020-09-25 13:26:56 Intel(R) UHD Graphics 630-1 77936867 OK 800 0.00%; 29530 us/it; ETA 26d 15:17; 1579c241dc63eca6 (check 11.88s) 2020-09-25 15:07:23 Intel(R) UHD Graphics 630-1 77936867 OK 200000 0.26%; 30198 us/it; ETA 27d 04:05; f0b04b45b0855bd2 (check 11.97s)[/CODE] |
[QUOTE=Neutron3529;557865]Is it possible to test SP in gpuowl now?
I will soon bought nvidia RTX 3090 and want to test whether 3090 generate good results I want to test dp,sp and int32(maybe using M31), is it possible using gpuowl to perform such test? anyway, thanks for your great program![/QUOTE] The RTX 3090 is not much better than the 3080. Unless you want the absolute best performance in a single card, the 3080 is a vastly better currency/performance card. |
| All times are UTC. The time now is 22:38. |
Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.