mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2012-12-26, 09:05   #1959
firejuggler
 
firejuggler's Avatar
 
Apr 2010
Over the rainbow

2×1,303 Posts
Default

3# as well, even if i'm not really active on Gpu front at this time
firejuggler is offline   Reply With Quote
Old 2012-12-26, 10:47   #1960
ET_
Banned
 
ET_'s Avatar
 
"Luigi"
Aug 2002
Team Italia

61·79 Posts
Default

#3 for me, even if I used mmff 0.26 with no -gs switch, and it appeared to sieve on GPU.

Luigi
ET_ is offline   Reply With Quote
Old 2012-12-26, 11:57   #1961
James Heinrich
 
James Heinrich's Avatar
 
"James Heinrich"
May 2004
ex-Northern Ontario

23·149 Posts
Default

Quote:
Originally Posted by axn View Post
Intelligent/Smart option -- Do a selftest and pick the one with highest thruput.
I'm not sure if that would differ from #3 -- I'm not sure if there are cases where GPU sieving would be slower that CPU sieving. Possibly on a very-slow GPU (GT 620 or similar) with a very fast CPU -- I'm not sure if Oliver/George have looked into the cutoff points of efficiency.

It has been determined that CC 1.x GPUs have poor throughput for GPU sieving to the point where it makes no sense so it's never available, but for CC 2.0+ the benefit is significant.

The value for GPUSievePrimes might be a viable target for auto-adjustment, but in brief testing I found very little difference in throughput using different values.
James Heinrich is offline   Reply With Quote
Old 2012-12-26, 13:09   #1962
VictordeHolland
 
VictordeHolland's Avatar
 
"Victor de Hollander"
Aug 2011
the Netherlands

23·3·72 Posts
Default

Can somebody give an rough indication of when the 0.20 'production' client will be released? I know it is all done in your spare time, but I (and probably others) are eagerly waiting for this big improvement.
VictordeHolland is offline   Reply With Quote
Old 2012-12-26, 16:35   #1963
TheJudger
 
TheJudger's Avatar
 
"Oliver"
Mar 2005
Germany

45716 Posts
Default

Hello,

Quote:
Originally Posted by James Heinrich View Post
I'm not sure if that would differ from #3 -- I'm not sure if there are cases where GPU sieving would be slower that CPU sieving. Possibly on a very-slow GPU (GT 620 or similar) with a very fast CPU -- I'm not sure if Oliver/George have looked into the cutoff points of efficiency.
OK, we have a winner: option #3.
No, no automatic switching between CPU and GPU sieving, I like simple solutions.

Oliver
TheJudger is offline   Reply With Quote
Old 2012-12-26, 16:40   #1964
TheJudger
 
TheJudger's Avatar
 
"Oliver"
Mar 2005
Germany

100010101112 Posts
Default

Quote:
Originally Posted by VictordeHolland View Post
Can somebody give an rough indication of when the 0.20 'production' client will be released? I know it is all done in your spare time, but I (and probably others) are eagerly waiting for this big improvement.
When it's done!

Oliver

P.S. I want to finish v0.20 this year. Add a week or two for testing where I give it to a few people and if no problems are found I'll release it.
TheJudger is offline   Reply With Quote
Old 2012-12-27, 10:02   #1965
xtreme2k
 
xtreme2k's Avatar
 
Aug 2002

2568 Posts
Default

Hey guys,

Got a new 670GTX (upgrade from 460GTX). I am now cracking at approx 195-200M/s (vs 150M/s) which is pretty poor given the upgrade. NV really made the new chip a pure gaming chip with crappy compute performance and its really showing.

The only advantage I know would be this GPU uses similiar power vs the 460GTX (if not only slightly more).

Does mfaktc work on dual GPU systems with separate GPU? How does one assign each GPU (if this is possible). I was thinking of plugging in the 460GTX purely for compute use? What do you guys think?
Attached Thumbnails
Click image for larger version

Name:	Capture3.PNG
Views:	113
Size:	130.1 KB
ID:	9057  
xtreme2k is offline   Reply With Quote
Old 2012-12-27, 10:04   #1966
LaurV
Romulan Interpreter
 
LaurV's Avatar
 
Jun 2011
Thailand

3×3,221 Posts
Default

Yes, you can plug both of them in, and depending on your CPU, use more instances of mfaktc for each.
Use the "-d x" switch to say which GPU be used by which instance, substitute x with the gpu number. Is in the docs, and right in this very thread too, few pages back.

Last fiddled with by LaurV on 2012-12-27 at 10:06
LaurV is offline   Reply With Quote
Old 2012-12-27, 10:07   #1967
Sutton Shin
 
Sep 2012

17 Posts
Default Stupid Question

Where is the program for windows 7 x64?

I KNOW that it is a stupid question. I am too lazy to go through 86 pages of posts.

Last fiddled with by Sutton Shin on 2012-12-27 at 10:09
Sutton Shin is offline   Reply With Quote
Old 2012-12-27, 10:33   #1968
Dubslow
Basketry That Evening!
 
Dubslow's Avatar
 
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88

3×29×83 Posts
Default

Quote:
Originally Posted by Sutton Shin View Post
Where is the program for windows 7 x64?

I KNOW that it is a stupid question. I am too lazy to go through 86 pages of posts.
Very stupid. Even if you're too lazy to go through posts, Google is your friend. (First or third link are perfectly useful.)



Quote:
Originally Posted by LaurV View Post
Yes, you can plug both of them in, and depending on your CPU, use more instances of mfaktc for each.
Use the "-d x" switch to say which GPU be used by which instance, substitute x with the gpu number. Is in the docs, and right in this very thread too, few pages back.
In other words, RTFM.


____________________________________________________________________________

Last fiddled with by Dubslow on 2012-12-27 at 10:35 Reason: I'm really feeling like a jackass now, aren't I? (Maybe because it's 4 in the morning.)
Dubslow is offline   Reply With Quote
Old 2012-12-27, 11:55   #1969
TheJudger
 
TheJudger's Avatar
 
"Oliver"
Mar 2005
Germany

11·101 Posts
Default

Quote:
Originally Posted by xtreme2k View Post
Hey guys,

Got a new 670GTX (upgrade from 460GTX). I am now cracking at approx 195-200M/s (vs 150M/s) which is pretty poor given the upgrade. NV really made the new chip a pure gaming chip with crappy compute performance and its really showing.
Well, you're limited by your CPU, a single core can't feed your GTX 670. You can either start a second instance using a second CPU core our wait for mfaktc 0.20 and use GPU sieving. Depending on the exponent and bitlevel a GTX 670 should yield > 300M/s in best case (barrett76 kernel).
And you're right, for applications which make heavy use of integer instructions (like mfaktc), the newer chips are not so good. Anyway the energy efficiency (performance per watt while running mfaktc) is very good.

Oliver
TheJudger is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
mfakto: an OpenCL program for Mersenne prefactoring Bdot GPU Computing 1676 2021-06-30 21:23
The P-1 factoring CUDA program firejuggler GPU Computing 753 2020-12-12 18:07
gr-mfaktc: a CUDA program for generalized repunits prefactoring MrRepunit GPU Computing 32 2020-11-11 19:56
mfaktc 0.21 - CUDA runtime wrong keisentraut Software 2 2020-08-18 07:03
World's second-dumbest CUDA program fivemack Programming 112 2015-02-12 22:51

All times are UTC. The time now is 08:21.


Fri Aug 6 08:21:49 UTC 2021 up 14 days, 2:50, 1 user, load averages: 2.80, 2.44, 2.33

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.