![]() |
|
|
#320 |
|
"Carl Darby"
Oct 2012
Spring Mountains, Nevada
32·5·7 Posts |
Many thanks to frmky, here's a 64bit windows build of CUDAPm1, using CUDA toolkit 5.0. I have tested this very little, but seems to be working OK.
|
|
|
|
|
|
#321 |
|
"Kieren"
Jul 2011
In My Own Galaxy!
2·3·1,693 Posts |
|
|
|
|
|
|
#322 |
|
Jul 2003
So Cal
1000010100002 Posts |
The DP performance of the GTX 780 has been cut to GTX 7xx levels, so for DP compute it is really no different than the earlier chip. A GTX 580 should still give better performance at a much lower price.
|
|
|
|
|
|
#323 | |
|
"James Heinrich"
May 2004
ex-Northern Ontario
23·149 Posts |
Quote:
I'm not sure how relative performance varies between CUDAlucas and CUDAPm1. |
|
|
|
|
|
|
#324 | |
|
"Kieren"
Jul 2011
In My Own Galaxy!
2×3×1,693 Posts |
Quote:
|
|
|
|
|
|
|
#325 |
|
Mar 2003
Melbourne
51510 Posts |
If you are after DP* result throughput efficiency.
Your best best is to skip GPUs and buy multiple low-clocked quad core machines +high clock ram. Capex might be more, opex is lower for a given throughput. -- Craig *I stress DP. TF - GPUs blow CPUs out of the water. |
|
|
|
|
|
#326 | |
|
"Carl Darby"
Oct 2012
Spring Mountains, Nevada
32·5·7 Posts |
Quote:
|
|
|
|
|
|
|
#327 |
|
"Kieren"
Jul 2011
In My Own Galaxy!
1015810 Posts |
|
|
|
|
|
|
#328 | |
|
Mar 2010
3×137 Posts |
Quote:
I see some changes (like full S1 and S2 checkpoints) from the old one I've had (dated 06 May 2013). Owners of the defective Titan may run CUDAPm1/CUDALucas on Windows like this: Code:
start CUDAPm1 [flags if not using ini file] goto :start For this to work effectively, I suggest setting the checkpoint iterations to a thousand, so checkpoints would be written every couple of seconds and running CUDAPm1 from a RAM disk, so that the checkpoints would not wear your storage media. One drawback of this method is that it will never go out of the loop, even if there are no tasks in the worktodo file. Another one is related to the volatile nature of RAM disks: if your system crashes or reboots, you lose all the work. Comments are welcome
Last fiddled with by Karl M Johnson on 2013-08-18 at 09:22 |
|
|
|
|
|
|
#329 |
|
"Carl Darby"
Oct 2012
Spring Mountains, Nevada
32·5·7 Posts |
With the latest drivers, 326.41 for windows and 325.15 for linux, the unstable memory problem (if that's what it was) is fixed. There is still a bug with the driver that causes the ffts to hang occasionally. Its been reported and I presume being worked on. This bug affects all cards, not just the titans.
I've been doing something similar to what you suggested, but instead, looping on a non zero exit value. That way ^C still exits the program. I also don't think the checkpoint iterations set so low is necessary. You will loose as much time by doing the extra checkpoints as you gain by having a more recent checkpoint when it dies. |
|
|
|
|
|
#330 | |
|
"Kieren"
Jul 2011
In My Own Galaxy!
2·3·1,693 Posts |
Quote:
Last fiddled with by kladner on 2013-08-19 at 22:58 |
|
|
|
|
![]() |
| Thread Tools | |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| mfaktc: a CUDA program for Mersenne prefactoring | TheJudger | GPU Computing | 3498 | 2021-08-06 21:07 |
| World's second-dumbest CUDA program | fivemack | Programming | 112 | 2015-02-12 22:51 |
| World's dumbest CUDA program? | xilman | Programming | 1 | 2009-11-16 10:26 |
| Factoring program need help | Citrix | Lone Mersenne Hunters | 8 | 2005-09-16 02:31 |
| Factoring program | ET_ | Programming | 3 | 2003-11-25 02:57 |