mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2013-08-17, 18:20   #320
owftheevil
 
owftheevil's Avatar
 
"Carl Darby"
Oct 2012
Spring Mountains, Nevada

32·5·7 Posts
Default

Many thanks to frmky, here's a 64bit windows build of CUDAPm1, using CUDA toolkit 5.0. I have tested this very little, but seems to be working OK.
Attached Files
File Type: exe CUDAPm1-5.0.exe (541.0 KB, 81 views)
owftheevil is offline   Reply With Quote
Old 2013-08-17, 18:52   #321
kladner
 
kladner's Avatar
 
"Kieren"
Jul 2011
In My Own Galaxy!

2·3·1,693 Posts
Default

Quote:
Originally Posted by owftheevil View Post
Many thanks to frmky, here's a 64bit windows build of CUDAPm1, using CUDA toolkit 5.0. I have tested this very little, but seems to be working OK.
OMG! Wow! Thanks to both of you!
kladner is offline   Reply With Quote
Old 2013-08-17, 20:47   #322
frmky
 
frmky's Avatar
 
Jul 2003
So Cal

1000010100002 Posts
Default

Quote:
Originally Posted by nucleon View Post
GTX780 is Titan-lite. GTX780 is a different chip to GTX770.

If you can't get a GTX5x0, and a Titan is out of your price range then GTX780 is your better bet. Of course factoring in budget constraints.
The DP performance of the GTX 780 has been cut to GTX 7xx levels, so for DP compute it is really no different than the earlier chip. A GTX 580 should still give better performance at a much lower price.
frmky is offline   Reply With Quote
Old 2013-08-17, 20:52   #323
James Heinrich
 
James Heinrich's Avatar
 
"James Heinrich"
May 2004
ex-Northern Ontario

23·149 Posts
Default

Quote:
Originally Posted by frmky View Post
The DP performance of the GTX 780 has been cut to GTX 7xx levels, so for DP compute it is really no different than the earlier chip. A GTX 580 should still give better performance at a much lower price.
According to benchmark data I have for CUDAlucas, the GTX 780 is still slightly ahead of the GTX 580 by roughly 5%
I'm not sure how relative performance varies between CUDAlucas and CUDAPm1.
James Heinrich is offline   Reply With Quote
Old 2013-08-17, 20:57   #324
kladner
 
kladner's Avatar
 
"Kieren"
Jul 2011
In My Own Galaxy!

2×3×1,693 Posts
Default

Quote:
Originally Posted by James Heinrich View Post
According to benchmark data I have for CUDAlucas, the GTX 780 is still slightly ahead of the GTX 580 by roughly 5%
I'm not sure how relative performance varies between CUDAlucas and CUDAPm1.
Too bad it also uses ~2.5% more power, too. I'd say this gives the edge to the 580 because of its lower price.
kladner is offline   Reply With Quote
Old 2013-08-17, 23:46   #325
nucleon
 
nucleon's Avatar
 
Mar 2003
Melbourne

51510 Posts
Default

If you are after DP* result throughput efficiency.

Your best best is to skip GPUs and buy multiple low-clocked quad core machines +high clock ram.

Capex might be more, opex is lower for a given throughput.

-- Craig
*I stress DP. TF - GPUs blow CPUs out of the water.
nucleon is offline   Reply With Quote
Old 2013-08-18, 02:11   #326
owftheevil
 
owftheevil's Avatar
 
"Carl Darby"
Oct 2012
Spring Mountains, Nevada

32·5·7 Posts
Default

Quote:
Originally Posted by James Heinrich View Post
According to benchmark data I have for CUDAlucas, the GTX 780 is still slightly ahead of the GTX 580 by roughly 5%
I'm not sure how relative performance varies between CUDAlucas and CUDAPm1.
I haven't tested this very thoroughly yet, but it seems that on cards with smaller amounts of memory, e.g. a 560 with ~1gb of memory, CUDALucas and CUDAPm1 have about the same thoughput, whereas with 6gb of memory, throughput for CuPm1 is about 15% greater than for CuLu.
owftheevil is offline   Reply With Quote
Old 2013-08-18, 04:29   #327
kladner
 
kladner's Avatar
 
"Kieren"
Jul 2011
In My Own Galaxy!

1015810 Posts
Default

Quote:
Originally Posted by kladner View Post
Too bad it also uses ~2.5% more power, too.
Brought to you by the Department of Redundancy Department.
kladner is offline   Reply With Quote
Old 2013-08-18, 08:47   #328
Karl M Johnson
 
Karl M Johnson's Avatar
 
Mar 2010

3×137 Posts
Default

Quote:
Originally Posted by owftheevil View Post
Many thanks to frmky, here's a 64bit windows build of CUDAPm1, using CUDA toolkit 5.0. I have tested this very little, but seems to be working OK.
Thank you for the new binary.
I see some changes (like full S1 and S2 checkpoints) from the old one I've had (dated 06 May 2013).

Owners of the defective Titan may run CUDAPm1/CUDALucas on Windows like this:
Code:
start
CUDAPm1 [flags if not using ini file]
goto :start
So whenever CUDAPm1 quits due to the vRAM being unstable, it will launch again and restart from the latest checkpoint.
For this to work effectively, I suggest setting the checkpoint iterations to a thousand, so checkpoints would be written every couple of seconds and running CUDAPm1 from a RAM disk, so that the checkpoints would not wear your storage media.

One drawback of this method is that it will never go out of the loop, even if there are no tasks in the worktodo file.
Another one is related to the volatile nature of RAM disks: if your system crashes or reboots, you lose all the work.

Comments are welcome

Last fiddled with by Karl M Johnson on 2013-08-18 at 09:22
Karl M Johnson is offline   Reply With Quote
Old 2013-08-19, 18:22   #329
owftheevil
 
owftheevil's Avatar
 
"Carl Darby"
Oct 2012
Spring Mountains, Nevada

32·5·7 Posts
Default

With the latest drivers, 326.41 for windows and 325.15 for linux, the unstable memory problem (if that's what it was) is fixed. There is still a bug with the driver that causes the ffts to hang occasionally. Its been reported and I presume being worked on. This bug affects all cards, not just the titans.

I've been doing something similar to what you suggested, but instead, looping on a non zero exit value. That way ^C still exits the program. I also don't think the checkpoint iterations set so low is necessary. You will loose as much time by doing the extra checkpoints as you gain by having a more recent checkpoint when it dies.
owftheevil is offline   Reply With Quote
Old 2013-08-19, 22:55   #330
kladner
 
kladner's Avatar
 
"Kieren"
Jul 2011
In My Own Galaxy!

2·3·1,693 Posts
Default

Quote:
Originally Posted by owftheevil View Post
With the latest drivers, 326.41 for windows and 325.15 for linux, the unstable memory problem (if that's what it was) is fixed. There is still a bug with the driver that causes the ffts to hang occasionally. Its been reported and I presume being worked on. This bug affects all cards, not just the titans.
Is this other recent cards, i.e. 600 and 700 series, or does it extend back to the 500s and 400s? I would love to find out that my 570 can actually run at stock RAM clock.

Last fiddled with by kladner on 2013-08-19 at 22:58
kladner is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
mfaktc: a CUDA program for Mersenne prefactoring TheJudger GPU Computing 3498 2021-08-06 21:07
World's second-dumbest CUDA program fivemack Programming 112 2015-02-12 22:51
World's dumbest CUDA program? xilman Programming 1 2009-11-16 10:26
Factoring program need help Citrix Lone Mersenne Hunters 8 2005-09-16 02:31
Factoring program ET_ Programming 3 2003-11-25 02:57

All times are UTC. The time now is 23:24.


Fri Aug 6 23:24:04 UTC 2021 up 14 days, 17:53, 1 user, load averages: 4.56, 4.16, 4.08

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.