![]() |
![]() |
#3400 | |
Aug 2020
1001012 Posts |
![]() Quote:
processing: TF no-factor for M333930227 (278-279) CPU credit is 183.3218 GHz-days. processing: TF factor 656156919987798312067063 for M333930227 (279-280) [range fully factored] CPU credit is 366.6436 GHz-days. That seems to match now! |
|
![]() |
![]() |
![]() |
#3401 | |
Sep 2011
Germany
22×3×223 Posts |
![]()
Some of our users are running into an error:
on Windows7 (NVIDIA GeForce GTX 1060 6GB (4095MB) driver: 436.15 OpenCL: 1.2): Quote:
Code:
got assignment: exp=140615327 bit_min=72 bit_max=73 (6.80 GHz-days) Starting trial factoring M140615327 from 2^72 to 2^73 (6.80 GHz-days) k_min = 16791791417940 k_max = 33583582839939 Using GPU kernel "barrett76_mul32_gs" Date Time | class Pct | time ETA | GHz-d/day Sieve Wait Oct 23 20:27 | 0 0.1% | 7.842 2h05m | 78.07 82485 n.a.% M140615327 has a factor: 38814612911305349835664385407 ERROR: cudaGetLastError() returned 702: the launch timed out and was terminated Last fiddled with by rebirther on 2020-10-25 at 07:53 |
|
![]() |
![]() |
![]() |
#3402 |
"Viliam Furík"
Jul 2018
Martin, Slovakia
7×47 Posts |
![]()
The driver seems to be old. I have 456.71 installed. It may be a problem. That is only a guess.
BTW, I checked the factor, it's not a factor. |
![]() |
![]() |
![]() |
#3403 |
"James Heinrich"
May 2004
ex-Northern Ontario
CAB16 Posts |
![]()
Interestingly, it is a factor, but not of M140,615,327. It's actually a composite factor, the smallest composite factor of the smallest two factors of M3,321,928,619.
![]() |
![]() |
![]() |
![]() |
#3404 | |
"Viliam Furík"
Jul 2018
Martin, Slovakia
7·47 Posts |
![]() Quote:
|
|
![]() |
![]() |
![]() |
#3405 | |
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
10010111011012 Posts |
![]() Quote:
If a factor is reported found that starts 388 and ends 407, check the factor, the logs, the hardware, driver, temperatures, gpu memory reliability, etc. Since factors f of Mersenne numbers must be of form f= 2 k p + 1, it could only potentially be a factor of Mersenne numbers with one of those prime factors of (k p) as exponents. Put (38814612911305349835664385407-1)/2 in https://www.alpertron.com.ar/ECM.HTM and it yields 19407 306455 652674 917832 192703 = 36 × 31081 × 65381 × 3 943673 × 3321 928619. Code:
########## testcase 1557/2867 ########## Starting trial factoring M3321928619 from 2^94 to 2^95 (1207701.03 GHz-days) Using GPU kernel "95bit_mul32_gs" Date Time | class Pct | time ETA | GHz-d/day Sieve Wait Dec 04 09:54 | 237 1.0% | 0.048 n.a. | n.a. 82485 n.a.% M3321928619 has a factor: 38814612911305349835664385407 found 1 factor for M3321928619 from 2^94 to 2^95 [mfaktc 0.21 95bit_mul32_gs] selftest for M3321928619 passed! tf(): total time spent: 0.048s Starting trial factoring M3321928619 from 2^94 to 2^95 (1207701.03 GHz-days) Using GPU kernel "95bit_mul32" Date Time | class Pct | time ETA | GHz-d/day Sieve Wait Dec 04 09:54 | 237 1.0% | 0.099 n.a. | n.a. 5000 1.74% M3321928619 has a factor: 38814612911305349835664385407 found 1 factor for M3321928619 from 2^94 to 2^95 [mfaktc 0.21 95bit_mul32] selftest for M3321928619 passed! tf(): total time spent: 0.101s Code:
batch wrapper reports mfaktc-win-64.exe (re)launch at Mon 05/28/2018 1:16:53.17 count 3 on model gtx480 dev 0 mfaktc v0.20 (64bit built) Compiletime options THREADS_PER_BLOCK 256 SIEVE_SIZE_LIMIT 32kiB SIEVE_SIZE 193154bits SIEVE_SPLIT 250 MORE_CLASSES enabled Runtime options SievePrimes 25000 SievePrimesAdjust 1 SievePrimesMin 5000 SievePrimesMax 100000 NumStreams 3 CPUStreams 3 GridSize 3 GPUSievePrimes 82486 GPUSieveSize 64Mi bits GPUSieveProcessSize 16Ki bits WorkFile worktodo.txt Checkpoints enabled CheckpointDelay 900s Stages enabled StopAfterFactor bitlevel PrintMode full V5UserID Kriesel ComputerID dodo-gtx480-0 ProgressHeader "Date Time | class Pct | time ETA | GHz-d/day Sieve Wait" ProgressFormat "%d %T | %C %p%% | %t %e | %g %s %W%%" AllowSleep no TimeStampInResults yes CUDA version info binary compiled for CUDA 6.50 CUDA runtime version 6.50 CUDA driver version 9.10 CUDA device info name GeForce GTX 480 compute capability 2.0 maximum threads per block 1024 number of multiprocessors 15 (480 shader cores) clock rate 1451MHz Automatic parameters threads per grid 983040 running a simple selftest... Selftest statistics number of tests 92 successfull tests 92 selftest PASSED! got assignment: exp=329000033 bit_min=80 bit_max=81 (744.28 GHz-days) Starting trial factoring M329000033 from 2^80 to 2^81 (744.28 GHz-days) k_min = 1837273097800140 k_max = 3674546195606701 Using GPU kernel "barrett87_mul32_gs" found a valid checkpoint file! last finished class was: 2391 found 0 factor(s) already Date Time | class Pct | time ETA | GHz-d/day Sieve Wait May 28 01:18 | 2392 52.2% | 95.831 12h13m | 698.99 82485 n.a.% M329000033 has a factor: 38814612911305349835664385407 ERROR: cudaGetLastError() returned 77: an illegal memory access was encountered batch wrapper reports mfaktc-win-64.exe exited at Mon 05/28/2018 1:18:34.45 Last fiddled with by kriesel on 2020-10-25 at 19:12 |
|
![]() |
![]() |
![]() |
#3406 |
"James Heinrich"
May 2004
ex-Northern Ontario
3·23·47 Posts |
![]()
Ah, that at least explains how it could come up with a Mersenne factor, albeit for the wrong exponent. I guess really the code should (but doesn't) reset the factor variable when starting a new exponent and so when the "found a factor" code block gets incorrectly triggered due to a hardware/driver error it uses the last value (from the quick-self-test).
|
![]() |
![]() |
![]() |
#3407 |
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
484510 Posts |
![]()
The known usually bad factor 388...407 comes up in discussion about annually at least since 2017. From my recently updated mfaktc thread notes file:
posts 2787 - 2806 2017-12-31 https://www.mersenneforum.org/showpo...postcount=2787 false factor report 38814612911305349835664385407 and ensuing discussion including ways to attempt eliminating its appearance posts 2824-2827 July 2018 reproducible false factor; bad gpu ram https://www.mersenneforum.org/showpo...postcount=2824 post 3167 2019-07-02 the usual false factor is (also) seen to correlate with Windows TDRs https://www.mersenneforum.org/showpo...postcount=3167 (Check Windows system event log for Windows TDR events) post 3177 2019-07-21 TheJudger able to reproduce the issue https://www.mersenneforum.org/showpo...postcount=3177 And now, also posts 3401-3406+, Oct 2020. In a nutshell; cooling/temperatures; bad gpu ram; other hardware problems; gpu too slow; default or inadequate values in Windows TDR related registry entries Last fiddled with by kriesel on 2020-10-25 at 20:50 |
![]() |
![]() |
![]() |
#3408 |
Aug 2020
2·3·19 Posts |
![]()
Is there somewhere I can find a compiled mfaktc binary for CUDA 11 (or a convenient way to compile it)? I'm using Google colab and recently it started giving me runtimes with CUDA 11.
Code:
CUDA version info binary compiled for CUDA 10.10 CUDA runtime version 0.75 CUDA driver version 11.10 |
![]() |
![]() |
![]() |
#3409 |
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
3×5×17×19 Posts |
![]()
Would someone please post CUDA11 mfaktc (2047 limit or higher GpuSieveSize; at least more-classes) compiled packages for Ubuntu Linux or Windows 7-10? I haven't seen any of either.
TheJudger tells how to compile for CUDA10 and how to set up for that in post 2910. Seems a good likely starting point for tackling CUDA11. There's also https://www.mersenneforum.org/showpo...postcount=3086 to 3088, nomead getting his system to compile mfaktc in Windows https://www.mersenneforum.org/showpo...postcount=3088 updated build process by nomead Once posted, James Heinrich may add them to the mersenne.ca download mirror. |
![]() |
![]() |
![]() |
#3410 |
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
3×5×17×19 Posts |
![]()
This slow gpu came with one of the used systems I bought. I check out any model I get access to, and this one was interesting in what it revealed about a possible source of the known false factor occurrence.
The ancient slow NVIDIA NVS295 with Windows default TDR settings (note the bogus 200 GhzD/day indication also): Code:
batch wrapper logs (re)launch of EAGLET mfaktc quadro nvs295 at Tue 07/02/2019 20:45:32.15 mfaktc v0.21 (64bit built) Compiletime options THREADS_PER_BLOCK 256 SIEVE_SIZE_LIMIT 32kiB SIEVE_SIZE 193154bits SIEVE_SPLIT 250 MORE_CLASSES enabled Runtime options SievePrimes 25000 SievePrimesAdjust 1 SievePrimesMin 5000 SievePrimesMax 100000 NumStreams 3 CPUStreams 3 GridSize 3 GPU Sieving enabled GPUSievePrimes 82486 GPUSieveSize 64Mi bits GPUSieveProcessSize 16Ki bits Checkpoints enabled CheckpointDelay 900s WorkFileAddDelay 3600s Stages enabled StopAfterFactor bitlevel PrintMode full V5UserID kriesel ComputerID eaglet-nvs295 AllowSleep no TimeStampInResults yes CUDA version info binary compiled for CUDA 6.50 CUDA runtime version 6.50 CUDA driver version 6.50 CUDA device info name Quadro NVS 295 compute capability 1.1 max threads per block 512 max shared memory per MP 16384 byte number of multiprocessors 1 CUDA cores per MP 8 CUDA cores - total 8 clock rate (CUDA cores) 1300MHz memory clock rate: 695MHz memory bus width: 64 bit Automatic parameters threads per grid 1048576 GPUSievePrimes (adjusted) 82486 GPUsieve minimum exponent 1055144 running a simple selftest... Selftest statistics number of tests 107 successfull tests 107 selftest PASSED! got assignment: exp=119998999 bit_min=72 bit_max=73 (7.97 GHz-days) Starting trial factoring M119998999 from 2^72 to 2^73 (7.97 GHz-days) k_min = 19676691147960 k_max = 39353382296711 Using GPU kernel "barrett76_mul32_gs" Date Time | class Pct | time ETA | GHz-d/day Sieve Wait Jul 02 20:46 | 0 0.1% | 3.548 56m43s | 202.20 82485 n.a.% M119998999 has a factor: 38814612911305349835664385407 ERROR: cudaGetLastError() returned 30: unknown error at Tue 07/02/2019 20:46:14.24mfaktc quadro nvs295 exit logged by batch wrapper Code:
selftest PASSED! got assignment: exp=119998999 bit_min=72 bit_max=73 (7.97 GHz-days) Starting trial factoring M119998999 from 2^72 to 2^73 (7.97 GHz-days) k_min = 19676691147960 k_max = 39353382296711 Using GPU kernel "barrett76_mul32_gs" Date Time | class Pct | time ETA | GHz-d/day Sieve Wait Jul 02 21:05 | 0 0.1% | 411.97 4d13h | 1.74 82485 n.a.% Jul 02 21:11 | 5 0.2% | 411.85 4d13h | 1.74 82485 n.a.% Jul 02 21:18 | 9 0.3% | 411.49 4d13h | 1.74 82485 n.a.% Jul 02 21:25 | 12 0.4% | 411.39 4d13h | 1.74 82485 n.a.% Jul 02 21:32 | 20 0.5% | 411.78 4d13h | 1.74 82485 n.a.% Jul 02 21:39 | 21 0.6% | 412.10 4d13h | 1.74 82485 n.a.% Jul 02 21:46 | 29 0.7% | 411.38 4d12h | 1.74 82485 n.a.% Jul 02 21:53 | 32 0.8% | 410.54 4d12h | 1.75 82485 n.a.% Jul 02 21:59 | 36 0.9% | 410.11 4d12h | 1.75 82485 n.a.% Jul 02 22:06 | 41 1.0% | 410.09 4d12h | 1.75 82485 n.a.% Jul 02 22:13 | 44 1.1% | 410.10 4d12h | 1.75 82485 n.a.% Jul 02 22:20 | 56 1.3% | 411.00 4d12h | 1.75 82485 n.a.% Jul 02 22:27 | 57 1.4% | 412.03 4d12h | 1.74 82485 n.a.% Jul 02 22:34 | 60 1.5% | 411.38 4d12h | 1.74 82485 n.a.% Jul 02 22:41 | 65 1.6% | 410.46 4d11h | 1.75 82485 n.a.% Jul 02 22:47 | 69 1.7% | 410.11 4d11h | 1.75 82485 n.a.% Jul 02 22:54 | 77 1.8% | 410.10 4d11h | 1.75 82485 n.a.% Jul 02 23:01 | 81 1.9% | 410.11 4d11h | 1.75 82485 n.a.% Jul 02 23:08 | 84 2.0% | 410.12 4d11h | 1.75 82485 n.a.% Jul 02 23:15 | 89 2.1% | 410.07 4d11h | 1.75 82485 n.a.% Date Time | class Pct | time ETA | GHz-d/day Sieve Wait Jul 02 23:22 | 92 2.2% | 410.44 4d11h | 1.75 82485 n.a.% Jul 02 23:28 | 96 2.3% | 411.38 4d11h | 1.74 82485 n.a.% Jul 02 23:35 | 104 2.4% | 410.48 4d10h | 1.75 82485 n.a.% Jul 02 23:42 | 117 2.5% | 410.10 4d10h | 1.75 82485 n.a.% Jul 02 23:49 | 120 2.6% | 410.09 4d10h | 1.75 82485 n.a.% Jul 02 23:56 | 125 2.7% | 410.10 4d10h | 1.75 82485 n.a.% ... Jul 07 10:07 | 4601 99.8% | 410.11 13m40s | 1.75 82485 n.a.% Jul 07 10:14 | 4605 99.9% | 410.09 6m50s | 1.75 82485 n.a.% Jul 07 10:21 | 4617 100.0% | 411.44 0m00s | 1.74 82485 n.a.% no factor for M119998999 from 2^72 to 2^73 [mfaktc 0.21 barrett76_mul32_gs] tf(): total time spent: 4d 13h 23m 8.488s For more info on TDR, see https://docs.nvidia.com/gameworks/co...n_recovery.htm and https://www.mersenneforum.org/showpo...3&postcount=10 |
![]() |
![]() |
![]() |
Thread Tools | |
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
mfakto: an OpenCL program for Mersenne prefactoring | Bdot | GPU Computing | 1668 | 2020-12-22 15:38 |
The P-1 factoring CUDA program | firejuggler | GPU Computing | 753 | 2020-12-12 18:07 |
gr-mfaktc: a CUDA program for generalized repunits prefactoring | MrRepunit | GPU Computing | 32 | 2020-11-11 19:56 |
mfaktc 0.21 - CUDA runtime wrong | keisentraut | Software | 2 | 2020-08-18 07:03 |
World's second-dumbest CUDA program | fivemack | Programming | 112 | 2015-02-12 22:51 |