![]() |
|
|
#2828 |
|
Sep 2003
5×11×47 Posts |
If anyone here has used mfaktc to search for factors of Wagstaff numbers (2^p + 1)/3 and if you have conserved log files or lists of factors that you're willing to share, let me know.
I've already asked in Wagstaff-related threads, and tried contacting some of the folks who were active when earlier searches were being done around 2013. I just thought I'd inquire here too. |
|
|
|
|
|
#2829 | |
|
Sep 2003
5×11×47 Posts |
Quote:
Same specs as you listed for the Tesla V100-PCIE-16GB except a slightly faster clock rate: Code:
clock rate (CUDA cores) 1530MHz mfaktc passes all the Mersenne self tests. However, when I compile an alternate version with -DWAGSTAFF added to CFLAGS, it fails all the Wagstaff self tests. Did you try the Wagstaff self tests on your V100 and do they work for you? Is anything more needed to create a Wagstaff version, other than adding the -DWAGSTAFF flag in CFLAGS? The compilation uses gcc 4.8 |
|
|
|
|
|
|
#2830 | ||
|
"Oliver"
Mar 2005
Germany
11×101 Posts |
Hello,
Quote:
Quote:
Oliver |
||
|
|
|
|
|
#2831 |
|
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
22·5·271 Posts |
While looking for something else, I stumbled across this:
The source of parse.c for CUDAPm1 indicates # or \\ or / are comment characters marking the rest of a worktodo line as a comment I've confirmed by test in mfaktc that \\ worked; # or / did not work in my test, which placed them mostly at the beginnings of records. I could tell by the line number in any warning messages which did or did not work. The capability is not present in the readme.txt (yet) that I recall. Last fiddled with by kriesel on 2018-08-02 at 20:38 |
|
|
|
|
|
#2832 |
|
"Mihai Preda"
Apr 2015
3×457 Posts |
I have a question, maybe somebody knows, how do the mfaktc and mfakto codebases compare?
I think at some point in history, mfakto was inspired by mfaktc. But in the interleaving years, how did they diverge? do they have now any different capabilities? or different self-test data sets? (aside from targeting different platforms, CUDA vs. OpenCL). |
|
|
|
|
|
#2833 | |
|
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
22·5·271 Posts |
Quote:
Per http://www.mersenneforum.org/showpos...91&postcount=2 Mfaktc max bit depth 95, mfakto 92. Minimum exponent may vary. Comparing their respective readme files and bug and wish lists may show some other differences. Mfaktc bug and wish list http://www.mersenneforum.org/showpos...21&postcount=3 Mfakto bug and wish list http://www.mersenneforum.org/showpos...37&postcount=3 Some client management software supports mfaktc or mfakto, typically not both. http://www.mersenneforum.org/showpos...92&postcount=3 (All the above, and more, are periodically updated in place, as part of the mersenne-gpu-computing-oriented reference material I've been accumulating at http://www.mersenneforum.org/forumdisplay.php?f=154) And of course, there's comparing the source code in the portions that are not CUDA or OpenCl specific. Mfaktc self-test:tests multiple kernels per testcase Code:
########## testcase 1/2867 ########## ... Selftest statistics number of tests 26192 successfull tests 26192 kernel | success | fail -------------------+---------+------- UNKNOWN kernel | 0 | 0 71bit_mul24 | 2586 | 0 75bit_mul32 | 2682 | 0 95bit_mul32 | 2867 | 0 barrett76_mul32 | 1096 | 0 barrett77_mul32 | 1114 | 0 barrett79_mul32 | 1153 | 0 barrett87_mul32 | 1066 | 0 barrett88_mul32 | 1069 | 0 barrett92_mul32 | 1084 | 0 75bit_mul32_gs | 2420 | 0 95bit_mul32_gs | 2597 | 0 barrett76_mul32_gs | 1079 | 0 barrett77_mul32_gs | 1096 | 0 barrett79_mul32_gs | 1130 | 0 barrett87_mul32_gs | 1044 | 0 barrett88_mul32_gs | 1047 | 0 barrett92_mul32_gs | 1062 | 0 selftest PASSED! Code:
Started a simple selftest ... ######### testcase 1/30 (M1031831[63-64]) ######### ######### testcase 2/30 (M51332417[68-69]) ######### ######### testcase 3/30 (M50896831[69-70]) ######### ######### testcase 4/30 (M50979079[70-71]) ######### ######### testcase 5/30 (M51232133[71-72]) ######### ######### testcase 6/30 (M50830523[71-72]) ######### ######### testcase 7/30 (M50752613[72-73]) ######### ######### testcase 8/30 (M51507913[72-73]) ######### ######### testcase 9/30 (M51916901[73-74]) ######### ######### testcase 10/30 (M51157933[74-75]) ######### ######### testcase 11/30 (M51308501[75-76]) ######### ######### testcase 12/30 (M51671491[75-76]) ######### ######### testcase 13/30 (M50805581[77-78]) ######### ######### testcase 14/30 (M51157429[78-79]) ######### ######### testcase 15/30 (M51406151[78-79]) ######### ######### testcase 16/30 (M51478381[79-80]) ######### ######### testcase 17/30 (M51350527[80-81]) ######### ######### testcase 18/30 (M53061139[80-81]) ######### ######### testcase 19/30 (M48629519[81-82]) ######### ######### testcase 20/30 (M51752893[83-84]) ######### ######### testcase 21/30 (M51760133[83-84]) ######### ######### testcase 22/30 (M51090757[84-85]) ######### ######### testcase 23/30 (M51050171[84-85]) ######### ######### testcase 24/30 (M50989481[86-87]) ######### ######### testcase 25/30 (M50856937[86-87]) ######### ######### testcase 26/30 (M53065231[88-89]) ######### ######### testcase 27/30 (M3321929777[63-64]) ######### ######### testcase 28/30 (M3321930841[63-64]) ######### ######### testcase 29/30 (M55069117[64-65]) ######### ######### testcase 30/30 (M45448679[81-82]) ######### Selftest statistics number of tests 30 successful tests 30 Code:
######### testcase 1/34071 (M67094119[81-82]) ######### ... ######### testcase 34071/34071 (M112404491[91-92]) ######### Starting trial factoring M112404491 from 2^91 to 2^92 (4461450.54GHz-days) Date Time | class Pct | time ETA | GHz-d/day Sieve Wait Jan 16 20:34 | 1848 0.1% | 0.124 n.a. | n.a. 81206 0.00% M112404491 has a factor: 3941616367695054034124905537 (91.670846 bits, 2992945.937358 GHz-d) found 1 factor for M112404491 from 2^91 to 2^92 [mfakto 0.15pre6-Win cl_barrett32_92_gs_2] selftest for M112404491 passed (cl_barrett32_92_gs)! tf(): total time spent: 0.124s Selftest statistics number of tests 34026 successful tests 34026 selftest PASSED! Last fiddled with by kriesel on 2018-08-06 at 15:06 |
|
|
|
|
|
|
#2834 |
|
"Mihai Preda"
Apr 2015
3·457 Posts |
I have been playing with some GPU sieving code, similar to the GPU sieve used by mfaktc and mafkto.
The sieve works in the usual way: for each prime P from a set of primes, compute the initial "bit-to-clear" for a given exponent E and K (q = 2*E*K+1), and then mark off bits at every P step starting with the bit-to-clear. Is there some (mathematical) reason for the number of survivors of this kind of sieve to slightly decrease as K grows? In other words, are there slightly fewer candidates surviving the sieve when the bit-level of K grows? (I know that the number of actual primes does decrease as K grows, but is this fact reflected at all when sieving with the technique above?) |
|
|
|
|
|
#2835 | |
|
Jun 2003
2·3·7·112 Posts |
Quote:
This is assuming you're sieving with fewer primes than sqrt(candidate). |
|
|
|
|
|
|
#2836 |
|
"Mihai Preda"
Apr 2015
3·457 Posts |
That's what I thought. Need to find the bug that generates the observed behavior then..
|
|
|
|
|
|
#2837 | |
|
"Mihai Preda"
Apr 2015
55B16 Posts |
Quote:
If it's sqrt(k), this may be it. If I sieve with primes up to 2^23, exponent 2^28, then TF under 75bits would have slightly reduced filtering. |
|
|
|
|
|
|
#2838 | |
|
Jun 2003
2×3×7×112 Posts |
Quote:
Can you provide some stats as to the pattern you're observing (of the fraction of survivors)? |
|
|
|
|
![]() |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| mfakto: an OpenCL program for Mersenne prefactoring | Bdot | GPU Computing | 1676 | 2021-06-30 21:23 |
| The P-1 factoring CUDA program | firejuggler | GPU Computing | 753 | 2020-12-12 18:07 |
| gr-mfaktc: a CUDA program for generalized repunits prefactoring | MrRepunit | GPU Computing | 32 | 2020-11-11 19:56 |
| mfaktc 0.21 - CUDA runtime wrong | keisentraut | Software | 2 | 2020-08-18 07:03 |
| World's second-dumbest CUDA program | fivemack | Programming | 112 | 2015-02-12 22:51 |