mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2020-05-18, 19:55   #3268
James Heinrich
 
James Heinrich's Avatar
 
"James Heinrich"
May 2004
ex-Northern Ontario

55 Posts
Default

Quote:
Originally Posted by kriesel View Post
I compute the odds of 6 factors per bit level and exponent to be rather small
While there are quite a few examples of two factors in the same bitlevel+class, I have found zero examples of even 3 with matching class/bitlevel, so the chance of 10 is not, I think, something we need to worry much about.
James Heinrich is offline   Reply With Quote
Old 2020-05-19, 14:00   #3269
LaurV
Romulan Interpreter
 
LaurV's Avatar
 
Jun 2011
Thailand

100010101001112 Posts
Default

When you find 10 with mfaktc tell me and I will search the rest of the class "by q".
Just to make sure none escaped
LaurV is offline   Reply With Quote
Old 2020-05-19, 21:56   #3270
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

120B16 Posts
Default

Quote:
Originally Posted by James Heinrich View Post
While there are quite a few examples of two factors in the same bitlevel+class, I have found zero examples of even 3 with matching class/bitlevel, so the chance of 10 is not, I think, something we need to worry much about.
In a spreadsheet I estimated odds for n per bitlevel/class/exponent, for n from 0 to 6, without regard to whether a fewer number might be found at a lower bitlevel or class, and concluded more than 5 is improbable, based on 40 bits to 92 bits sums. https://www.mersenneforum.org/showpo...82&postcount=5

There are a few things that could make the server database content lower:

1. I messed up the calculations and overestimated the probabilities. A distinct possibility.

2. Early version software only coped with a small number of factors/bitlevel/class/exponent, such as 2 or 1. I think that was the case, around mfaktc v0.04 or v0.05, per https://mersenneforum.org/showpost.p...&postcount=131, but I don't know when it began to handle more. Possibly as soon as V0.05 mfaktc; https://mersenneforum.org/showpost.p...&postcount=151
Not sure about mfakto or prime95/mprime or whatever else was used before gpu factoring. User TJAOI probably does not change the picture much.

3. Factoring stops after the current bit level or class, after a factor is found for an exponent, so does not find coincident factors that occur in a higher class or bitlevel for the same exponent. This effect would be minor.

4. Whatever I'm not thinking of now.
kriesel is offline   Reply With Quote
Old 2020-05-19, 22:09   #3271
kladner
 
kladner's Avatar
 
"Kieren"
Jul 2011
In My Own Galaxy!

97·103 Posts
Default

Quote:
Originally Posted by kriesel View Post
In a spreadsheet I estimated odds for n per bitlevel/class/exponent, for n from 0 to 6, without regard to whether a fewer number might be found at a lower bitlevel or class, and concluded more than 5 is improbable, based on 40 bits to 92 bits sums. https://www.mersenneforum.org/showpo...82&postcount=5


3. Factoring stops after the current bit level or class, after a factor is found for an exponent, so does not find coincident factors that occur in a higher class or bitlevel for the same exponent. This effect would be minor.

4. Whatever I'm not thinking of now.

Stopping or continuing is controlled in mfaktc.ini.

# possible values for StopAfterFactor:
# 0: Do not stop the current assignment after a factor was found.
# 1: When a factor was found for the current assignment stop after the
# current bitlevel. This makes only sense when Stages is enabled.
# 2: When a factor was found for the current assignment stop after the
# current class.
#
# Default: StopAfterFactor=1

StopAfterFactor=1
kladner is offline   Reply With Quote
Old 2020-05-19, 22:42   #3272
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

31×149 Posts
Default

Quote:
Originally Posted by kladner View Post
Stopping or continuing is controlled in mfaktc.ini.

# possible values for StopAfterFactor:
# 0: Do not stop the current assignment after a factor was found.
# 1: When a factor was found for the current assignment stop after the
# current bitlevel. This makes only sense when Stages is enabled.
# 2: When a factor was found for the current assignment stop after the
# current class.
#
# Default: StopAfterFactor=1

StopAfterFactor=1
Exactly my point. Also applies to mfakto. I have a vague recollection of something similar in prime95 but can not confirm that in the documentation. (And note that TF credit in the case of a factor found is given as if StopAfterFactor=2, even if what was run is StopAfterFactor=1 or 0.)
kriesel is offline   Reply With Quote
Old 2020-06-13, 04:17   #3273
tului
 
Jan 2013

22·17 Posts
Default

Just got a notebook with a 2060 in it. What version should I grab and from where to get going again?
tului is offline   Reply With Quote
Old 2020-06-13, 16:22   #3274
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

31×149 Posts
Default

Quote:
Originally Posted by tului View Post
Just got a notebook with a 2060 in it. What version should I grab and from where to get going again?
RTX2060? Probably cuda10 capable v0.21 from https://download.mersenne.ca/mfaktc/mfaktc-0.21
kriesel is offline   Reply With Quote
Old 2020-06-29, 17:23   #3275
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

461910 Posts
Default

Programmers, please consider modifying mfaktc to allow larger maximum GpuSieveSize than the current maximum 2047 (megabits), and sharing the enhancement.

Numerous gpu models show increased throughput going from GpuSieveSize 1024 to 2047. Even as old and slow as GTX1060. The increase for faster models such as RTX 2080 Super is significant. It appears there is a little more to be gained if the code is modified again to support yet larger values. RTX2080x produce more throughput with multiiple instances, indicating there's more yet to be gained with larger GpuSieveSize. Charts of performance of a single instance versus GpuSieveSize settings show a positive slope near the current maximum, also. Future faster gpus will likely continue the trend of the large impact of larger GpuSieveSize on faster gpus, that we have seen in the gpu models released in the past few years. (RTX3080 is coming...)

2048 x 2^20 =2^31 bit position computation requires computation with unsigned 32 bit integer or larger, but the actual code is signed 32 bit integer, maximum 2^31-1. The RTX2080x would benefit from more GpuSieveSize than the program currently supports. Unsigned 32 bit would be good at 4095 max. But there's code that uses a negative value in the same variable, that would need to be changed. Jumping to 64-bit signed is a possibility but I wonder how much that would impact overall performance.
There's no need to support many more bits of GpuSieveSize than are present in gpu vram, typically 4GB = 32Gbits =32768Mbits to 16GB = 128Gbits = 131072Mbits these days. Allow another factor of four to cover the next several years.

There may be other smaller limits, such as what the gpu model's OpenCl support is capable of. (I've already seen that on old gpu models at ~1023 or 511 depending on model or perhaps driver version.)
For some background, see https://mersenneforum.org/showpost.p...postcount=3202 and related posts.
kriesel is offline   Reply With Quote
Old 2020-06-29, 18:38   #3276
xx005fs
 
"Eric"
Jan 2018
USA

24·13 Posts
Default CUDA 11?

Any builds that support the newest CUDA 11? Hopefully there's something about CUDA 11 that could potentially bring some sort of speed up?
xx005fs is offline   Reply With Quote
Old 2020-06-29, 19:00   #3277
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

31·149 Posts
Default

Quote:
Originally Posted by xx005fs View Post
Any builds that support the newest CUDA 11? Hopefully there's something about CUDA 11 that could potentially bring some sort of speed up?
It's probably prep for RTX30xx. For a given gpu, later CUDA versions may be SLOWER. https://www.tomsguide.com/news/rtx-3080
kriesel is offline   Reply With Quote
Old 2020-07-22, 15:31   #3278
storm5510
Random Account
 
storm5510's Avatar
 
Aug 2009
U.S.A.

1,619 Posts
Default

Quote:
Originally Posted by kriesel View Post
It's probably prep for RTX30xx. For a given gpu, later CUDA versions may be SLOWER. https://www.tomsguide.com/news/rtx-3080
I am testing the current 2047 version on my GTX 1080. It seems to run slightly faster. Around 1,090 GHz-d/day. The previous one I had been using ran about 1,045. I suppose there are other advantages. Either way, it started without any problems.
storm5510 is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
mfakto: an OpenCL program for Mersenne prefactoring Bdot GPU Computing 1657 2020-10-27 01:23
The P-1 factoring CUDA program firejuggler GPU Computing 752 2020-09-08 16:15
"CUDA runtime version 0.0" when running mfaktc.exe froderik GPU Computing 4 2016-10-30 15:29
World's second-dumbest CUDA program fivemack Programming 112 2015-02-12 22:51
World's dumbest CUDA program? xilman Programming 1 2009-11-16 10:26

All times are UTC. The time now is 11:16.

Tue Oct 27 11:16:30 UTC 2020 up 47 days, 8:27, 0 users, load averages: 0.99, 1.34, 1.40

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2020, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.