![]() |
|
|
#1552 |
|
Romulan Interpreter
Jun 2011
Thailand
3×3,221 Posts |
You can't. I use different sm's for CL and mfaktc, they run perfect as long as I don't mix them for the same card. I can mix them in the computer in the same time if they target different cards and the cards are not SLI. Keeping many versions in the same time in the computer is easy, you only put the right dlls in each folder, as both mfaktc and CL look in the folder for the dll if it is not loaded. But you can't RUN two versions on the same card in the same time.
|
|
|
|
|
|
#1553 | |
|
Romulan Interpreter
Jun 2011
Thailand
3·3,221 Posts |
Quote:
And the cards are gtx580, gtx570, tesla c2050, no difference between them. Smaller granulation of FFT (smoother number) is always faster then smaller FFT size with bigger granulation (not so smooth), with very few exceptions. 1440k is such exception which id 5-smooth but still very fast! Higher then 1440 (default FFT) the default size can be almost always tuned to a better one. I can't say for sure if this is not card/OS/whatever dependent. Someone should try FFT 2592k against the smaller defaults on gtx580 on linux. I constantly get (beside of smaller/safer rounding errors) a speed improvement of 13-14% on win64/gtx580 (which is the main setup). This translates into 46-49 hours for a 4xM expo, instead of 52-55 hours. edit: I am going home now, but you can search the forum for "2592k" I am 100% sure for this number (it seems to be only multiple of 2 and 3 too :D) and you should find my former posts. Trust better the numbers in those posts then the numbers in the current post. Last fiddled with by LaurV on 2012-08-29 at 11:22 |
|
|
|
|
|
|
#1554 | |
|
Basketry That Evening!
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88
11100001101012 Posts |
Quote:
Code:
#define COUNT 119
int multipliers[COUNT] = { 6, 8, 12, 16, 18, 24, 32,
40, 48, 64, 72, 80, 96, 120,
128, 144, 160, 192, 224, 240, 256,
288, 320, 336, 384, 448, 480, 512,
576, 640, 672, 768, 800, 864, 896,
960, 1024, 1120, 1152, 1200, 1280, 1344,
1440, 1536, 1600, 1680, 1728, 1792, 1920,
2048, 2240, 2304, 2400, 2560, 2688, 2880,
3072, 3200, 3360, 3456, 3584, 3840, 4000,
4096, 4480, 4608, 4800, 5120, 5376, 5600,
5760, 6144, 6400, 6720, 6912, 7168, 7680,
8000, 8192, 8960, 9216, 9600, 10240, 10752,
11200, 11520, 12288, 12800, 13440, 13824, 14366,
15360, 16000, 16128, 16384, 17920, 18432, 19200,
20480, 21504, 22400, 23040, 24576, 25600, 26880,
29672, 30720, 32000, 32768, 34992, 36864, 38400,
40960, 46080, 49152, 51200, 55296, 61440, 65536 };
// Largely copied from Prime95's jump tables, up to 32M
// Support up to 64M, the maximum length with threads == 1024
...
int len, i, estimate = q/20;
for(i = 0; i < COUNT; i++) {
len = 1024*multipliers[i];
if( len >= estimate )
{
return len;
}
}
Last fiddled with by Dubslow on 2012-08-29 at 20:25 Reason: [strike] |
|
|
|
|
|
|
#1555 |
|
"Jerry"
Nov 2011
Vancouver, WA
1,123 Posts |
We need a switch like -fft that does more than just q/20 and then increase until >=. When enabled it can a test several FFT lengths, log the time and error for each and then select the best one for that particular exponent. If a worktodo file is used, then it runs the FFT test when each exponent is started. Once an FFT is selected, it will need to be able to put the FFT into the worktodo file for that exponent. The main problem is how many FFTs to test before it's a waste of time. (If LaurV's suggestion can be vetted, it may be possible to narrow down the FFTs to a small enough number to test all each time). Once enough test data is collected and reviewed, it may be possible to have the program select a particular set of FFTs to test based on the exponent number and GPU chipset.
One thing I noticed, when the .ini file contains a particular FFT length, if the program needs to change FFT sizes, it always goes up. However, I was testing smaller exponents that needed smaller FFTs (it took me a while to figure out why the program was failing; then I remembered the FFT size in the .ini file). The mentioned test above could also be used to select correct FFTs for all exponents if the default FFT is too big for the exponent (which caused serious rounding errors). (I guess if the -fft switch can be implemented, there will be no reason to specify FFTs in the .ini file. One could put an FFT that is incorrect in the worktodo though.) Thoughts? ------ So far, testing of the new 2.04 beta is going well, for me. I was able to place many smaller exponents in the worktodo file and they all continued just fine. My DC still has a while left though... How is the testing going for everyone else? |
|
|
|
|
|
#1556 |
|
"Kieren"
Jul 2011
In My Own Galaxy!
2×3×1,693 Posts |
I have successfully completed 13 DC's and 2 LL's with 2.04-Beta-3.2-sm_13-x64. I think there were two times when I saw the Corrupt Save File cause a restart. I spotted these pretty quickly and was able to resume from a very recent good Save File with little lost work time.
|
|
|
|
|
|
#1557 | |
|
"Jerry"
Nov 2011
Vancouver, WA
112310 Posts |
Quote:
|
|
|
|
|
|
|
#1558 |
|
"Kieren"
Jul 2011
In My Own Galaxy!
2×3×1,693 Posts |
Would this creation date be the latest?
Friday, August 03, 2012, 9:21:17 AMI just downloaded it to be sure, but the one I was running has the same date. So....I guess I probably have been running the latest version. I confess that I do not entirely understand the file locking issue. I think most or all of the savefile corruption episodes were associated with unrelated (I think) BSODs. I have not seen CL restart (corrupt savefile) in the last 5-6 runs. Please ask if there's other data you want. Thanks to flash and dubslow (EDIT: and LaurV!) for all their work on this project. Bravo, Guys! Last fiddled with by kladner on 2012-08-30 at 16:30 |
|
|
|
|
|
#1559 |
|
Basketry That Evening!
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88
160658 Posts |
|
|
|
|
|
|
#1560 |
|
"Kieren"
Jul 2011
In My Own Galaxy!
100111101011102 Posts |
|
|
|
|
|
|
#1561 | |
|
"Jerry"
Nov 2011
Vancouver, WA
1,123 Posts |
Quote:
Go here. The lastest build is 28 Aug 2012Agree, and many others! I just make it compile on Windows
Last fiddled with by flashjh on 2012-08-30 at 18:24 |
|
|
|
|
|
|
#1562 |
|
"Kieren"
Jul 2011
In My Own Galaxy!
2×3×1,693 Posts |
Thanks Jerry. Done!
|
|
|
|
![]() |
| Thread Tools | |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Don't DC/LL them with CudaLucas | LaurV | Data | 131 | 2017-05-02 18:41 |
| CUDALucas / cuFFT Performance on CUDA 7 / 7.5 / 8 | Brain | GPU Computing | 13 | 2016-02-19 15:53 |
| CUDALucas: which binary to use? | Karl M Johnson | GPU Computing | 15 | 2015-10-13 04:44 |
| settings for cudaLucas | fairsky | GPU Computing | 11 | 2013-11-03 02:08 |
| Trying to run CUDALucas on Windows 8 CP | Rodrigo | GPU Computing | 12 | 2012-03-07 23:20 |