mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2012-01-07, 04:36   #1541
bcp19
 
bcp19's Avatar
 
Oct 2011

2A716 Posts
Default

Quote:
Originally Posted by flashjh View Post
You're right. I wouldn't put more than one core per instance, but I think his GPU could handle more.
It depends on the setup. I have a Core 2 Quad that runs 2 instances on a GTS 450. Main reason why is that if I run 1 instance, I get 80M/s, where with the 2 running each instance gets 60M/s. Since I don't like to let SievePrimes move from 5000, my CPU's have 10+% of wait time, so I actually run P95 on all 4 cores, but the 2 sharing with mfaktc take about 8-10 times as long to run. My 2400 runs a GTX 560Ti and has the same setup, but the shared cores only take about 3x as long. It runs about 180M/s single instance, 260M/s combined.
bcp19 is offline   Reply With Quote
Old 2012-01-07, 12:03   #1542
TheJudger
 
TheJudger's Avatar
 
"Oliver"
Mar 2005
Germany

45716 Posts
Default

Quote:
Originally Posted by nucleon View Post
What I've noticed is that v0.18 needs a little more cpu grunt to max out GPUs.
Yepp, the GPU code is a little bit faster in 0.18 thus you'll need a bit more CPU power (or lower SievePrimes) to keep the GPU busy. There will be another little boost for especially for CC 2.0 chips once CUDA 4.1 is available.
http://www.mersenneforum.org/showpos...postcount=1328
http://www.mersenneforum.org/showpos...postcount=1368
http://www.mersenneforum.org/showpos...postcount=1409

Oliver
TheJudger is offline   Reply With Quote
Old 2012-01-07, 15:13   #1543
Dubslow
Basketry That Evening!
 
Dubslow's Avatar
 
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88

3×29×83 Posts
Default

Heh. I've noticed my cpu wait go up, not down. I'll try and figure it out later.
Dubslow is offline   Reply With Quote
Old 2012-01-09, 21:05   #1544
MrRepunit
 
MrRepunit's Avatar
 
Mar 2011
Germany

1358 Posts
Default mfaktc for repunits

Hi,

how much effort would it be to extend mfaktc to handle (generalized) repunits? The factors have the same (or similar) properties as Mersenne primes: 2kp+1.
If it is not too much work I would like to put this on the wish-list for mfaktc.

Thanks...
MrRepunit is offline   Reply With Quote
Old 2012-01-10, 17:45   #1545
TheJudger
 
TheJudger's Avatar
 
"Oliver"
Mar 2005
Germany

11×101 Posts
Default

Hi!

Quote:
Originally Posted by MrRepunit View Post
Hi,

how much effort would it be to extend mfaktc to handle (generalized) repunits? The factors have the same (or similar) properties as Mersenne primes: 2kp+1.
If it is not too much work I would like to put this on the wish-list for mfaktc.
Do you have any more information about generalized repunits (math)?

Oliver
TheJudger is offline   Reply With Quote
Old 2012-01-10, 20:46   #1546
MrRepunit
 
MrRepunit's Avatar
 
Mar 2011
Germany

3·31 Posts
Default

Quote:
Originally Posted by TheJudger View Post
Hi!

Do you have any more information about generalized repunits (math)?

Oliver
Here you go:
http://www.cs.uwaterloo.ca/journals/...NER/dubner.pdf
http://www.maths.tcd.ie/pub/ims/bull59/R5901.pdf
http://en.wikipedia.org/wiki/Repunit

I hope this is what you wanted.

Danilo
MrRepunit is offline   Reply With Quote
Old 2012-01-14, 05:49   #1547
Dubslow
Basketry That Evening!
 
Dubslow's Avatar
 
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88

3·29·83 Posts
Default

Quote:
Originally Posted by TheJudger View Post
Can you try to upgrade the driver: devdriver_4.0_linux_64_270.41.19.run


Well, if everything is right the CUDA SDK and CUDA Toolkit is not needed to run the precompiled mfaktc executable. You'll need a proper driver, nothing more.

Oliver
Here's the issues I've had over the last 4 months or so with drivers.
http://forums.nvidia.com/index.php?showtopic=220802

Last fiddled with by Dubslow on 2012-01-14 at 05:50
Dubslow is offline   Reply With Quote
Old 2012-01-15, 19:26   #1548
TheJudger
 
TheJudger's Avatar
 
"Oliver"
Mar 2005
Germany

100010101112 Posts
Default

Hi Danilo,

Quote:
Originally Posted by MrRepunit View Post
OK, might take some days until I look into it.

Oliver
TheJudger is offline   Reply With Quote
Old 2012-01-15, 19:56   #1549
MrRepunit
 
MrRepunit's Avatar
 
Mar 2011
Germany

3·31 Posts
Default

Quote:
Originally Posted by TheJudger View Post
Hi Danilo,

OK, might take some days until I look into it.

Oliver
Many thanks in advance!
MrRepunit is offline   Reply With Quote
Old 2012-01-16, 05:30   #1550
flashjh
 
flashjh's Avatar
 
"Jerry"
Nov 2011
Vancouver, WA

112310 Posts
Default

Quote:
Originally Posted by TheJudger View Post
Hello!

http://www.mersenneforum.org/mfaktc/mfaktc-0.18.tar.gz
http://www.mersenneforum.org/mfaktc/mfaktc-0.18.win.zip
http://www.mersenneforum.org/mfaktc/...linux64.tar.gz

The executables need at least a CUDA 4.0 capable driver (270 series driver or newer). The Windows zip archive contains both, the 32 bit and 64 bit version. I'll upload new executables once CUDA 4.1 is public available. The sources should compile with older CUDA version, too, but they might be slower. CUDA 4.1 will give another performance improvement for the barrett based kernels on compute capability 2.x GPUs (especially on 2.0).


Compared to mfaktc 0.17 there are "more than usuall" minor changes. Highlights from the Changelog.txt:
  • autoadjustment of SievePrimes is now less dependend on the gridsize and
    absolute speed. Instead of measuring the absolute (average) time waited
    per precessing block (grid size) now the relative time spent on waiting
    for the GPU is calculated. In the per-class output "avg. wait" is replaced
    by "CPU wait".
  • new commandline option: "-v" (verbosity) let the user decide how many
    informations are printed
    (suggested by aspen on www.mersenneforum.org)
  • "has a factor" result lines now contain informations (program name,
    versions, bitlevel, ...) James Heinrich is working on this on the server
    side. This should give more accurate credits for "has a factor" results
    from the primenet server once this is fully implemented.
  • mfaktc no longer refuses to load a checkpoint file from a Linux version
    with a Windows version of mfaktc and vice versa. Of course mfaktc still
    refuses to load checkpoint files from other versions than itself
    (identical version string!)
  • added a (simple) signal handler (captures SIGINT and SIGTERM).
    1st ^C: mfaktc will exit after the currently processed class is finished.
    2nd ^C: mfaktc will stop immediately
  • added a minimum delay between two checkpoint file writes. The user can set
    the delay in mfaktc.ini (CheckpointDelay).
  • added a new code path to barrett79_mul32 and barrett92_mul32 kernels, CUDA
    >= 4.1 features multiply-add with carry for compute capability >= 2.0.
    On my GTX 470 (compute capability) this yields up to 15% for
    barrett92_mul32 and up to 7% for barrett79_mul32 extra throughput.
As usuall: finish your current assignments with your current version and do the update after it, mfaktc 0.18 will refuse foreign checkpoint files.

Oliver
Ok, so I'm having some problems. I've been running .18 for some time now, 5 instances with no problems.

Then, last couple of days only instance 1 has started crashing, no error window or anything -- it's just closes. The other problem is that it's duplicating work. For no apparent reason it will do the same factor 2 or 3 times at the same bit level (which is a huge waste) and it always crashes after a duplicate so it's wasting even more time. I tried deleting the .exe and using a fresh copy with no luck. I haven't changed anything else, so I don't know what it could be. The other four instances work fine, no lock-ups or closes or duplicate work. I at a loss here. Anyone having similar problems? Using Win7 64bit with two GTX 580s.
flashjh is offline   Reply With Quote
Old 2012-01-16, 06:38   #1551
LaurV
Romulan Interpreter
 
LaurV's Avatar
 
Jun 2011
Thailand

23·17·71 Posts
Default

Quote:
Originally Posted by flashjh View Post
it's duplicating work. For no apparent reason it will do the same factor 2 or 3 times at the same bit level
smells like no rights to write the worktodo file, is that opened by somebody else? did you change the rights? file attributes? (hidden, system?)
mfaktc needs to access worktodo file to modify the first line, therefore eliminate the work already done. If it can not do that, it will certainly repeat the work.
LaurV is online now   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
mfakto: an OpenCL program for Mersenne prefactoring Bdot GPU Computing 1676 2021-06-30 21:23
The P-1 factoring CUDA program firejuggler GPU Computing 753 2020-12-12 18:07
gr-mfaktc: a CUDA program for generalized repunits prefactoring MrRepunit GPU Computing 32 2020-11-11 19:56
mfaktc 0.21 - CUDA runtime wrong keisentraut Software 2 2020-08-18 07:03
World's second-dumbest CUDA program fivemack Programming 112 2015-02-12 22:51

All times are UTC. The time now is 10:28.


Mon Aug 2 10:28:19 UTC 2021 up 10 days, 4:57, 0 users, load averages: 1.75, 1.50, 1.29

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.