mersenneforum.org

mersenneforum.org (https://www.mersenneforum.org/index.php)
-   GPU Computing (https://www.mersenneforum.org/forumdisplay.php?f=92)
-   -   mfaktc: a CUDA program for Mersenne prefactoring (https://www.mersenneforum.org/showthread.php?t=12827)

TObject 2012-08-21 20:19

[QUOTE=James Heinrich;308813]If you care to fiddle with mfaktc.ini::ProgressFormat a bit, GHz-days/day (%g) is probably more useful to look at than M/s (%r). This is the number you really want to maximize.

[/QUOTE]

Sure. Thanks.

Three threads:
16-minute assignement - 94 GHz-days/day
8-hour assignement - 97 GHz-days/day
8-day assignement - 96 GHz-days/day

GPU load: 97%
GPU clock: 797 MHz
GPU Memory Clock: 2025 MHz

GTX 580

SievePrimes has auto-adjusted down to 2000 on all three threads. That probably means there is more performance to milk out of the GPU, but still we are somewhat close to maximum with only three threads.

That is with me using the computer (web browsing and email at the moment).

kladner 2012-08-21 21:21

Thanks for pointing out these options, James. Showing g-d/d is a welcome addition.

lalera 2012-08-28 13:28

i have a question
 
is it possible to make a lessclasses version of mfaktc v 0.19
for win7 64bit ?
especially because of the new and fast 76bit kernel

TheJudger 2012-08-30 16:34

I have an answer
 
[QUOTE=lalera;309519]is it possible to make a lessclasses version of mfaktc v 0.19
for win7 64bit ?
especially because of the new and fast 76bit kernel[/QUOTE]

Yes, it is possible!
Windows executeables with "Less Classes": [url]http://www.mersenneforum.org/mfaktc/mfaktc-0.19.win.cuda42.LessClasses.zip[/url]

Oliver

NormanRKN 2012-08-30 17:25

hi!
what is the difference between the lessclasses and the stock version ?

TheJudger 2012-08-30 17:53

Hi Norman,

the "LessClasses" version splits the job into 96 of 420 classes instead of 960 of 4620 classes.
This version is good for short running jobs (below 1s per class with the default version) because of less overhead for class switch at the cost of a slightly lower sieve efficency (because multiples of 11 are not avoided with the residue classes).
This option is a compiletime option in src/params.h.

Oliver

NormanRKN 2012-08-30 18:00

thx Olli,

but I can use it for long running jobs too without any trouble ?

Norman

TheJudger 2012-08-30 18:04

Hi Norman,

yes, for sure. Worst case it runs a little bit slower and/or higher demand for CPU.

Oliver

Dubslow 2012-08-30 18:04

[QUOTE=NormanRKN;309760]thx Olli,

but I can use it for long running jobs too without any trouble ?

Norman[/QUOTE]

Yes, but it will be slower than the default version. It's faster for very short jobs, slower for the work that GIMPS usually does.

NormanRKN 2012-08-30 18:16

aaah ok!!
is there a range where "short" starts or ends to know what versions I should use or is it dependent on hardware (speed) ?

Norman

Dubslow 2012-08-30 18:22

[QUOTE=NormanRKN;309765]is there a range where "short" starts or ends to know what versions I should use or is it dependent on hardware (speed) ?[/QUOTE]

[QUOTE=TheJudger;309758]This version is good for short running jobs (below 1s per class with the default version) [/QUOTE]
^^^


All times are UTC. The time now is 23:16.

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.