mersenneforum.org

mersenneforum.org (https://www.mersenneforum.org/index.php)
-   GPU Computing (https://www.mersenneforum.org/forumdisplay.php?f=92)
-   -   mfaktc: a CUDA program for Mersenne prefactoring (https://www.mersenneforum.org/showthread.php?t=12827)

James Heinrich 2013-01-07 16:39

[QUOTE=TheJudger;323927]why not using a fixed exponent for the benchmark.[/QUOTE]Mostly to avoid wasting 5-10 GHz-days of work -- users can just submit info about whatever they're currently working on with minimal extra effort. Also helps me get more benchmark data. Bigger exponents take more iterations, and are also worth more credit, so that's a non-issue.

chalsall 2013-01-07 17:59

[QUOTE=TheJudger;323840]mfaktc 0.20 is finally available![/QUOTE]

Nice work as always Oliver. (And, of course, George).

For those who suddenly find themselves with spare CPU capacity available (and have some memory available), please consider doing some P-1 work.

The TF wavefront is currently holding steady to the LL wavefront (we're about 47 days ahead), but sadly many LL assignments are being made without P-1 having already been done properly.

ixfd64 2013-01-07 18:03

Perhaps a little off-topic, but how feasible is GPU P-1 factoring?

firejuggler 2013-01-07 18:23

it is possible, but not efficient?

kladner 2013-01-07 19:43

[QUOTE=chalsall;323942]For those who suddenly find themselves with spare CPU capacity available (and have some memory available), please consider doing some P-1 work.[/QUOTE]

Five P-1s online since last night, Sir! :cool: I'm holding back one core to see how responsive things are, though if anything it's a bit better than it was with 6x mfaktc 0.19 running.

I have been experimenting with the GPUSieve settings. On the GTX 570, GPUSieveSize=128 improved the Time value from ~5.9s to ~5.8s, and GHz-D/D from 419 to 429. GPUSieveProcessSize=8 caused a very slight improvement, and I returned it to the default of 16.

On the GTX 460, GPUSieveSize=128 essentially doubled the time and halved the GHz-D/D. IIRC, going to GPUSieveSize=32 had a similar effect. However, GPUSieveProcessSize=8 reduced the Time from 12.05s to 11.9s. GHz-D/D went from 207 to 209.

chalsall 2013-01-07 20:21

[QUOTE=kladner;323956]Five P-1s online since last night, Sir! :cool: I'm holding back one core to see how responsive things are, though if anything it's a bit better than it was with 6x mfaktc 0.19 running.[/quote]

Thanks kladner.

It's a little funny how much we find we have to fight Augustus ourselves simply to come to terms.....

We throw our hands out hoping for impact, but we don't actually hope for impact.

Kinda weird....

kracker 2013-01-07 23:42

[QUOTE=chalsall;323942]
For those who suddenly find themselves with spare CPU capacity available (and have some memory available), please consider doing some P-1 work.
[/QUOTE]

Hmm, which do you think is more useful to the project, DC or P-1?
And this probably is a unanswerable or stupid question: if P-1, what is the "minimum" memory required/recommended for what is being given out now?

James Heinrich 2013-01-07 23:48

[QUOTE=kracker;323979]P-1, what is the "minimum" memory required/recommended for what is being given out now?[/QUOTE]Assuming what's given out now is somewhere around 60M, my [url=http://mersenne.ca/prob.php?exponent=60000011&guess_saved_tests=2]P-1 probability calculator[/url] says that around 512MB is "minimum", 1.5GB is "good" and 12GB is "max".

kracker 2013-01-08 00:02

[QUOTE=James Heinrich;323980]Assuming what's given out now is somewhere around 60M, my [URL="http://mersenne.ca/prob.php?exponent=60000011&guess_saved_tests=2"]P-1 probability calculator[/URL] says that around 512MB is "minimum", 1.5GB is "good" and 12GB is "max".[/QUOTE]

Hmm, there is a "max"? didn't know that.
Anyways, thanks, good to know :smile:

Dubslow 2013-01-08 00:05

[QUOTE=kracker;323982]Hmm, there is a "max"? didn't know that.
Anyways, thanks, good to know :smile:[/QUOTE]

Well, sort of. If there's enough memory, Prime95 will do all of stage 2 in one pass ("processing 480 of 480 relative primes"). If there's 10 times [i]that[/i] amount of memory, then it will use more relative primes, but the gains are minimal at best. Heck, after 3-4 GiB, the gains are minimal.

kracker 2013-01-08 00:10

[QUOTE=Dubslow;323984]Well, sort of. If there's enough memory, Prime95 will do all of stage 2 in one pass ("processing 480 of 480 relative primes"). If there's 10 times [I]that[/I] amount of memory, then it will use more relative primes, but the gains are minimal at best. Heck, after 3-4 GiB, the gains are minimal.[/QUOTE]

I see. So it's not really a literal "max" it's just "over this, gains almost useless unless.."


All times are UTC. The time now is 23:16.

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.