![]() |
|
|
#3268 |
|
"James Heinrich"
May 2004
ex-Northern Ontario
342110 Posts |
While there are quite a few examples of two factors in the same bitlevel+class, I have found zero examples of even 3 with matching class/bitlevel, so the chance of 10 is not, I think, something we need to worry much about.
|
|
|
|
|
|
#3269 |
|
Romulan Interpreter
Jun 2011
Thailand
72×197 Posts |
When you find 10 with mfaktc tell me and I will search the rest of the class "by q".
![]() Just to make sure none escaped
|
|
|
|
|
|
#3270 | |
|
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
10101001010102 Posts |
Quote:
There are a few things that could make the server database content lower: 1. I messed up the calculations and overestimated the probabilities. A distinct possibility. 2. Early version software only coped with a small number of factors/bitlevel/class/exponent, such as 2 or 1. I think that was the case, around mfaktc v0.04 or v0.05, per https://mersenneforum.org/showpost.p...&postcount=131, but I don't know when it began to handle more. Possibly as soon as V0.05 mfaktc; https://mersenneforum.org/showpost.p...&postcount=151 Not sure about mfakto or prime95/mprime or whatever else was used before gpu factoring. User TJAOI probably does not change the picture much. 3. Factoring stops after the current bit level or class, after a factor is found for an exponent, so does not find coincident factors that occur in a higher class or bitlevel for the same exponent. This effect would be minor. 4. Whatever I'm not thinking of now. |
|
|
|
|
|
|
#3271 | |
|
"Kieren"
Jul 2011
In My Own Galaxy!
2·3·1,693 Posts |
Quote:
Stopping or continuing is controlled in mfaktc.ini. # possible values for StopAfterFactor: # 0: Do not stop the current assignment after a factor was found. # 1: When a factor was found for the current assignment stop after the # current bitlevel. This makes only sense when Stages is enabled. # 2: When a factor was found for the current assignment stop after the # current class. # # Default: StopAfterFactor=1 StopAfterFactor=1 |
|
|
|
|
|
|
#3272 | |
|
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
2×32×7×43 Posts |
Quote:
|
|
|
|
|
|
|
#3273 |
|
Jan 2013
22×17 Posts |
Just got a notebook with a 2060 in it. What version should I grab and from where to get going again?
|
|
|
|
|
|
#3274 | |
|
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
541810 Posts |
Quote:
|
|
|
|
|
|
|
#3275 |
|
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
2·32·7·43 Posts |
Programmers, please consider modifying mfaktc to allow larger maximum GpuSieveSize than the current maximum 2047 (megabits), and sharing the enhancement.
Numerous gpu models show increased throughput going from GpuSieveSize 1024 to 2047. Even as old and slow as GTX1060. The increase for faster models such as RTX 2080 Super is significant. It appears there is a little more to be gained if the code is modified again to support yet larger values. RTX2080x produce more throughput with multiiple instances, indicating there's more yet to be gained with larger GpuSieveSize. Charts of performance of a single instance versus GpuSieveSize settings show a positive slope near the current maximum, also. Future faster gpus will likely continue the trend of the large impact of larger GpuSieveSize on faster gpus, that we have seen in the gpu models released in the past few years. (RTX3080 is coming...) 2048 x 2^20 =2^31 bit position computation requires computation with unsigned 32 bit integer or larger, but the actual code is signed 32 bit integer, maximum 2^31-1. The RTX2080x would benefit from more GpuSieveSize than the program currently supports. Unsigned 32 bit would be good at 4095 max. But there's code that uses a negative value in the same variable, that would need to be changed. Jumping to 64-bit signed is a possibility but I wonder how much that would impact overall performance. There's no need to support many more bits of GpuSieveSize than are present in gpu vram, typically 4GB = 32Gbits =32768Mbits to 16GB = 128Gbits = 131072Mbits these days. Allow another factor of four to cover the next several years. There may be other smaller limits, such as what the gpu model's OpenCl support is capable of. (I've already seen that on old gpu models at ~1023 or 511 depending on model or perhaps driver version.) For some background, see https://mersenneforum.org/showpost.p...postcount=3202 and related posts. |
|
|
|
|
|
#3276 |
|
"Eric"
Jan 2018
USA
22×53 Posts |
Any builds that support the newest CUDA 11? Hopefully there's something about CUDA 11 that could potentially bring some sort of speed up?
|
|
|
|
|
|
#3277 | |
|
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
10101001010102 Posts |
Quote:
|
|
|
|
|
|
|
#3278 | |
|
Random Account
Aug 2009
22·3·163 Posts |
Quote:
|
|
|
|
|
![]() |
| Thread Tools | |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| mfakto: an OpenCL program for Mersenne prefactoring | Bdot | GPU Computing | 1676 | 2021-06-30 21:23 |
| The P-1 factoring CUDA program | firejuggler | GPU Computing | 753 | 2020-12-12 18:07 |
| gr-mfaktc: a CUDA program for generalized repunits prefactoring | MrRepunit | GPU Computing | 32 | 2020-11-11 19:56 |
| mfaktc 0.21 - CUDA runtime wrong | keisentraut | Software | 2 | 2020-08-18 07:03 |
| World's second-dumbest CUDA program | fivemack | Programming | 112 | 2015-02-12 22:51 |