mersenneforum.org gr-mfaktc: a CUDA program for generalized repunits prefactoring
 Register FAQ Search Today's Posts Mark Forums Read

2019-11-03, 14:54   #12
kriesel

"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

1D7C16 Posts

Quote:
 Originally Posted by storm5510 Using the version of mfaktc I had, the 1080 would run around 1050 GHz-d/day. This one was 730 GHz-d/day., more or less. This was with base 2.
So, the ~30% performance loss in gr-mfaktc relative to mfaktc means Mersenne factorers should stick with the mainstream mfaktc.

2019-11-03, 15:02   #13
MrRepunit

Mar 2011
Germany

97 Posts

Quote:
 Originally Posted by kriesel So, the ~30% performance loss in gr-mfaktc relative to mfaktc means Mersenne factorers should stick with the mainstream mfaktc.
For now yes. At a later point I might create a version that uses the original code path for Mersenne primes, probably once I have included negative bases...

 2019-11-05, 13:32 #14 storm5510 Random Account     Aug 2009 Not U. + S.A. 262610 Posts Something I was thinking about late yesterday evening: Below is a line from your work example file in the archive. Code: Factor=base=10,1055167,1,64 Question: Are the start and end bits of the same power, as in 101 to 1064?
2019-11-05, 21:22   #15
MrRepunit

Mar 2011
Germany

11000012 Posts

Quote:
 Question: Are the start and end bits of the same power, as in 101 to 1064?

The search boundaries are always a power of 2.

Last fiddled with by MrRepunit on 2019-11-05 at 21:22

 2019-11-13, 18:49 #16 lalera     Jul 2003 24·3·13 Posts hi, i tried out gr-mfaktc-0.21 with base=6, n=600k to 1000k on a gtx1050ti and the performance is good
2019-11-13, 20:44   #17
MrRepunit

Mar 2011
Germany

97 Posts

Quote:
 Originally Posted by lalera hi, i tried out gr-mfaktc-0.21 with base=6, n=600k to 1000k on a gtx1050ti and the performance is good
Up to 2^64 the performance should be really good because there is a special 64 bit GPU kernel. To get optimal performance you should disable the stages in mfaktc.ini:
Code:
Stages=0
If the desktop should get too unresponsive, then also lower the GPUSieveSize:
Code:
GPUSieveSize=8

Last fiddled with by MrRepunit on 2019-11-13 at 20:44

 2019-11-13, 20:57 #18 lalera     Jul 2003 24·3·13 Posts hi, i think that i need stages=1 because i do use StopAfterFactor=2 in mfaktc.ini
2019-11-16, 15:31   #19
storm5510
Random Account

Aug 2009
Not U. + S.A.

262610 Posts

Quote:
 Originally Posted by lalera hi, i think that i need stages=1 because i do use StopAfterFactor=2 in mfaktc.ini
I have used "Stages" set to zero with "StopAfterFactor" set to 2 on some tests, like the example below.

Code:
Factor=N/A,96751147,73,75
There is no separation, so if it finds a factor, it is done regardless of where it is in the process.

2019-12-13, 14:48   #20
storm5510
Random Account

Aug 2009
Not U. + S.A.

51028 Posts

Quote:
 Originally Posted by MrRepunit ...finally I completed the generalized repunit version of mfaktc. Changes compared to mfaktc-0.21: - implemented factoring of generalized repunits - Removed Barrett and 72 bit kernels - Removed Wagstaff related stuff - Added 64 bit kernels - Compiling with more-classes flag seem to be slightly faster, thus it is switched on - allowed are all bases >= 2, program might crash if base is larger than roughly 100,000 - implemented special cases for bases 2, 3, 5, 6, 7, 8, 10, 11, 12 - dropped lower limit for exponents from 100,000 to 50,000

Question: Would you be willing to do a custom build for a single individual?

2019-12-15, 00:15   #21
MrRepunit

Mar 2011
Germany

6116 Posts

Quote:
 Originally Posted by storm5510 Question: Would you be willing to do a custom build for a single individual?
Hi. Yes, I can do it if the wished-for changes are doable in some shorter time. My guess is that you want to lower the minimal exponent limit. Can do it, but than I cannot promise it still works in all cases. Also the program will waste more time because the presieving depth is affected.
But anyway, let me know what you need...

2019-12-15, 01:18   #22
storm5510
Random Account

Aug 2009
Not U. + S.A.

262610 Posts

Quote:
 Originally Posted by MrRepunit Hi. Yes, I can do it if the wished-for changes are doable in some shorter time. My guess is that you want to lower the minimal exponent limit. Can do it, but than I cannot promise it still works in all cases. Also the program will waste more time because the presieving depth is affected. But anyway, let me know what you need...
Thank you for the reply. This was just a passing thought. As time has gone by, the bottom end seems to have crept up on everything. Many are 100,000, leaving the smaller exponents with ECM and not much more. Then, on only a few programs. The downside is the time required to run the small ones to higher bit sizes.

An example: I have a very old factoring program called Factor5. It uses the CPU only. A while back, I gave it a small exponent, M1619 I believe it was. Start and end bits in the mid 60's. It stayed at 0.000% for 20 minutes or so before it changed. I did the math to 100%. Hundreds of years, or maybe thousands. So, it would not be anywhere near practical to do anything like this, even with a GPU.

 Similar Threads Thread Thread Starter Forum Replies Last Post TheJudger GPU Computing 3625 2023-03-30 00:08 Bdot GPU Computing 1720 2023-02-27 03:10 firejuggler GPU Computing 753 2020-12-12 18:07 fivemack Programming 112 2015-02-12 22:51 xilman Programming 1 2009-11-16 10:26

All times are UTC. The time now is 23:03.

Thu Mar 30 23:03:39 UTC 2023 up 224 days, 20:32, 0 users, load averages: 1.17, 0.85, 0.72