mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2015-01-23, 05:48   #2432
snme2pm1
 
"Graham uses ISO 8601"
Mar 2014
AU, Sydney

35 Posts
Default

Apparently there are new versions approaching.
In the Green camp TheJudger, in the Red (and the rest) camp BDot.
I am aware of some negative views on the Prime95 Throttle mechanism, but I do rely on having that for a notebook mounted atop a fan.
Prime95 is deployed, sometimes with throttle, when using mfaktc would be unsustainable in the current AU summer conditions.
I have to actively adjust load based on weather forecasts to prevent error exits from mfaktc.
Would a similar style of throttle facility be feasible for mfaktx?
Even better if it could track system temperature!
P.S. I am aware of discussion in another thread about faulty notebook results, yet surely better if high temperature conditions can be managed.

Last fiddled with by snme2pm1 on 2015-01-23 at 06:19
snme2pm1 is offline   Reply With Quote
Old 2015-01-23, 13:55   #2433
James Heinrich
 
James Heinrich's Avatar
 
"James Heinrich"
May 2004
ex-Northern Ontario

23·149 Posts
Default

Since you started factoring at 263 the CPU sieve must be used, at least for 263-264. I suspect that you may have "stages=0" set in mfaktc.ini which prevents the assignment from being split at the crossover point.
James Heinrich is offline   Reply With Quote
Old 2015-01-23, 14:11   #2434
Mark Rose
 
Mark Rose's Avatar
 
"/X\(‘-‘)/X\"
Jan 2013

2×5×293 Posts
Default

Quote:
Originally Posted by James Heinrich View Post
Since you started factoring at 263 the CPU sieve must be used, at least for 263-264. I suspect that you may have "stages=0" set in mfaktc.ini which prevents the assignment from being split at the crossover point.
Nope. mfaktc is splitting assignments just fine above 67.
Mark Rose is offline   Reply With Quote
Old 2015-01-23, 14:34   #2435
James Heinrich
 
James Heinrich's Avatar
 
"James Heinrich"
May 2004
ex-Northern Ontario

1101011000112 Posts
Default

In any case it shouldn't be an issue for most people, since the whole PrimeNet range is long since factored well beyond 264 -- even up to M232 should be finished within 6 months or so. If you're redoing old TF for some reason, just split your assignments manually for now
James Heinrich is offline   Reply With Quote
Old 2015-01-23, 15:09   #2436
Mark Rose
 
Mark Rose's Avatar
 
"/X\(‘-‘)/X\"
Jan 2013

2·5·293 Posts
Default

Quote:
Originally Posted by James Heinrich View Post
In any case it shouldn't be an issue for most people, since the whole PrimeNet range is long since factored well beyond 264 -- even up to M232 should be finished within 6 months or so. If you're redoing old TF for some reason, just split your assignments manually for now
Indeed. I'm not really worried about it. It's the first time I've done work at bit levels below 68 :D
Mark Rose is offline   Reply With Quote
Old 2015-01-23, 20:02   #2437
TheJudger
 
TheJudger's Avatar
 
"Oliver"
Mar 2005
Germany

11·101 Posts
Default

Quote:
Originally Posted by Mark Rose View Post
I found a "bug".

While double checking some trial factoring assignments in another thread, mfaktc picked 75bit_mul32 for Factor=N/A,54820379,63,66 instead of the usual barrett76_mul32_gs. It ran about 1/7th the speed that 64,66 assignments run since it was sieving on the CPU.

I think the default should be to always GPU sieve, or if the CPU sieve is to be used, to split the assignment at the cross over point.
Works as designed.
No barrett based kernel in mfaktc 0.20 is able to handle FCs below 264. So it isn't a matter of GPU or CPU sieving. There are only the "old" schoolbook-division kernels which can do lower FCs. In 0.20 non of these kernels is GPU sieve enabled. This will change in 0.21.
One could try to improve the automatic splitting of bitlevel, ofcourse.

Oliver
TheJudger is offline   Reply With Quote
Old 2015-01-23, 20:13   #2438
James Heinrich
 
James Heinrich's Avatar
 
"James Heinrich"
May 2004
ex-Northern Ontario

23·149 Posts
Default

Quote:
Originally Posted by TheJudger View Post
One could try to improve the automatic splitting of bitlevel, of course.
That's the key. I'm not sure what the current splitting logic is but if it's relatively easy it would be great to forcibly split the assignment at CPU/GPU-sieving cutoff (currently 264), at least if stages=1.

Certainly not a big issue considering everything (in PrimeNet at least) is long since done beyond 264, but I don't think it would be a bad idea to add it anyways if the code change is simple.
James Heinrich is offline   Reply With Quote
Old 2015-01-23, 21:47   #2439
lycorn
 
lycorn's Avatar
 
"GIMFS"
Sep 2002
Oeiras, Portugal

2×11×67 Posts
Default

Quote:
Originally Posted by James Heinrich View Post

everything (in PrimeNet at least) is long since done beyond 264
Well, not quite... There are stiil more than 70K numbers trial factored to 64 bits or less.

Granted the exponents are all < 5M, but still some people are working on them and finding many new factors . GPU sieving would be a most welcome helping hand, specially if it could be used to TF under 1M as well.
lycorn is offline   Reply With Quote
Old 2015-01-23, 23:38   #2440
James Heinrich
 
James Heinrich's Avatar
 
"James Heinrich"
May 2004
ex-Northern Ontario

D6316 Posts
Default

Quote:
Originally Posted by lycorn View Post
Well, not quite... There are stiil more than 70K numbers trial factored to 64 bits or less. Granted the exponents are all < 5M...
Fair point, I stand corrected.
74759 unfactored exponents as I count them, and 319 of them are over 5M (between 5,215,801 and 5,325,181):
Code:
+----------+--------+
| count(*) | mrange |
+----------+--------+
|    16976 |      0 |
|    15886 |      1 |
|    21636 |      2 |
|    16963 |      3 |
|     2979 |      4 |
|      319 |      5 |
+----------+--------+
James Heinrich is offline   Reply With Quote
Old 2015-01-24, 00:11   #2441
Jayder
 
Jayder's Avatar
 
Dec 2012

4268 Posts
Default

Quote:
Originally Posted by James Heinrich View Post
Fair point, I stand corrected.
74759 unfactored exponents as I count them, and 319 of them are over 5M (between 5,215,801 and 5,325,181):
Those have been factored up to 2^65. I guess it didn't get passed to mersenne.ca.
Jayder is offline   Reply With Quote
Old 2015-01-24, 09:56   #2442
lycorn
 
lycorn's Avatar
 
"GIMFS"
Sep 2002
Oeiras, Portugal

101110000102 Posts
Default

Jayder is right.
If you query Primenet directly, via Reports -> Detailed Reports -> Factoring Limits, you´ll get (as of 09:48 UTC):

61 bits - 13250
62 bits - 3257
63 bits - 53708
64 bits - 188

Total - 70403

The highest is 4699963, TFed to 63 bits.
lycorn is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
mfakto: an OpenCL program for Mersenne prefactoring Bdot GPU Computing 1676 2021-06-30 21:23
The P-1 factoring CUDA program firejuggler GPU Computing 753 2020-12-12 18:07
gr-mfaktc: a CUDA program for generalized repunits prefactoring MrRepunit GPU Computing 32 2020-11-11 19:56
mfaktc 0.21 - CUDA runtime wrong keisentraut Software 2 2020-08-18 07:03
World's second-dumbest CUDA program fivemack Programming 112 2015-02-12 22:51

All times are UTC. The time now is 04:41.


Fri Aug 6 04:41:41 UTC 2021 up 13 days, 23:10, 1 user, load averages: 2.26, 2.58, 3.54

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.