![]() |
|
|
#1750 | ||
|
"Oliver"
Mar 2005
Germany
11×101 Posts |
Quote:
Quote:
![]() For GK104 they totally crippelt int32 performance in alot ways. ![]() I guess that I need to code new kernels for CC 3.0 but this may take some time. For mfaktc the reduced number of registers (per core) and the reduced L1/shared memory (per core) is OK but the crippelt int32 performance is really bad. Oliver |
||
|
|
|
|
|
#1751 |
|
Mar 2003
Melbourne
5×103 Posts |
And to make the LL v TF calculations that much harder... Re the comment made to ram selection for video cards.
Can we gather any DC LL stats yet on GPUs? Are they matching more/less often? -- Craig |
|
|
|
|
|
#1752 |
|
"Oliver"
Mar 2005
Germany
11·101 Posts |
Hi,
since Kepler "light" aka GK104 sucks at integer code: any idea whether it is feasible to use a couple of 32bit floats for "small long integers" or not? Primary I need addition, subtraction and multiplication of integers with ~80/160 bits of data. Oliver |
|
|
|
|
|
#1753 | |
|
Nov 2010
Germany
10010101012 Posts |
Quote:
But I have no idea if that would be any faster than pure integer ... |
|
|
|
|
|
|
#1754 |
|
Romulan Interpreter
Jun 2011
Thailand
72×197 Posts |
@bcp19: Would you like to post the version of mfaktc that you used to TF M2137, M2267 and M2273 from 60 to 61 bits? I am curious how did you split the classes, as it is not enough to modify the limit in the source of 0.18 and recompile. Anyhow, doing those TF's makes no sense, as how many ECM was done in that area, there should be no factors below 180 bits. One can do "fake reports" there and get a lot of TF credit (thousands of GHzDays), like reporting all exponents TF-ed to 70 or 80 bits, and still have no negative influence on the project. From the percent of the ECM done, there is no factor below 40 to 60 digits (depending on exponent) on that range of expos below 5000, that means no factor below 120-180 bits. So theoretically, if I want to surpass Xyzzy and Nucleon at TF, I could report all of them to 65 bits, without influencing negative the project in a whole. But this is childish. So please, could you post the "test" version of mfaktc that you used to do those tests? No harm intended, just being curious how did you solve the problem.
|
|
|
|
|
|
#1755 | |
|
Oct 2011
67910 Posts |
Quote:
|
|
|
|
|
|
|
#1756 |
|
Romulan Interpreter
Jun 2011
Thailand
72·197 Posts |
That is wonderful! May I be part of the "testing team"? I would like to play with lower exponents too. Please send me some win-64 binaries and tell me what expos to attack so we don't step on each-other toes.
Please see this post here too, related to this problem. |
|
|
|
|
|
#1757 | |
|
Oct 2011
2A716 Posts |
Quote:
|
|
|
|
|
|
|
#1758 | |
|
Feb 2004
A016 Posts |
Quote:
![]() http://www.mersenneforum.org/mfaktc/ |
|
|
|
|
|
|
#1759 | |
|
Oct 2011
7×97 Posts |
Quote:
Last fiddled with by bcp19 on 2012-04-17 at 17:48 |
|
|
|
|
|
|
#1760 |
|
Romulan Interpreter
Jun 2011
Thailand
72×197 Posts |
I think that the author is the one who can decide if he wants it to make it public or not. Of course I would like to have it, to play with it, but maybe he has a good reason why didn't make it public. It could be still under test, or under development, I tried in the past to modify it by myself, when I found that is not enough to change the lower limit constrain, and in fact, because there are much more candidates to test when the expo is small, the modification is not easy to do.
|
|
|
|
![]() |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| mfakto: an OpenCL program for Mersenne prefactoring | Bdot | GPU Computing | 1676 | 2021-06-30 21:23 |
| The P-1 factoring CUDA program | firejuggler | GPU Computing | 753 | 2020-12-12 18:07 |
| gr-mfaktc: a CUDA program for generalized repunits prefactoring | MrRepunit | GPU Computing | 32 | 2020-11-11 19:56 |
| mfaktc 0.21 - CUDA runtime wrong | keisentraut | Software | 2 | 2020-08-18 07:03 |
| World's second-dumbest CUDA program | fivemack | Programming | 112 | 2015-02-12 22:51 |