20120319, 07:54  #1024 
Mar 2010
3×137 Posts 
The results are always the same for 4 different modes: gpu0 cl1, gpu0 cl2, gpu1 cl1, gpu1 cl2.
Code:
DEVICE:1 name GeForce GTX 480 totalGlobalMem 1610612736 sharedMemPerBlock 49152 regsPerBlock 32768 warpSize 32 memPitch 2147483647 maxThreadsPerBlock 1024 maxThreadsDim[3] 1024,1024,64 maxGridSize[3] 65535,65535,65535 totalConstMem 65536 major.minor 2.0 clockRate 1640000 textureAlignment 512 deviceOverlap 1 multiProcessorCount 15 Iteration 10000 M( 86243 )C, 0x23992ccd735a03d9, n = 8192, CUDALucas v1.67 err = 1.901e007 (0:02 real, 0.2024 ms/iter, ETA 0:14) Iteration 10000 M( 132049 )C, 0x4c52a92b54635f9e, n = 8192, CUDALucas v1.67 err = 0.0004187 (0:02 real, 0.2025 ms/iter, ETA 0:24) Iteration 10000 M( 216091 )C, 0x30247786758b8792, n = 16384, CUDALucas v1.67 err = 1.15e005 (0:02 real, 0.2015 ms/iter, ETA 0:40) Iteration 10000 M( 756839 )C, 0x5d2cbe7cb24a109a, n = 40960, CUDALucas v1.67 err = 0.0317 (0:03 real, 0.2481 ms/iter, ETA 3:03) Iteration 10000 M( 859433 )C, 0x3c4ad525c2d0aed0, n = 49152, CUDALucas v1.67 err = 0.009213 (0:02 real, 0.2503 ms/iter, ETA 3:30) Iteration 10000 M( 1257787 )C, 0x3f45bf9bea7213ea, n = 73728, CUDALucas v1.67 err = 0.006912 (0:03 real, 0.3152 ms/iter, ETA 6:30) Iteration 10000 M( 1398269 )C, 0xa4a6d2f0e34629db, n = 73728, CUDALucas v1.67 err = 0.08477 (0:04 real, 0.3244 ms/iter, ETA 7:27) Iteration 10000 M( 2976221 )C, 0x2a7111b7f70fea2f, n = 163840, CUDALucas v1.67 err = 0.04649 (0:05 real, 0.4984 ms/iter, ETA 24:35) Iteration 10000 M( 3021377 )C, 0x6387a70a85d46baf, n = 163840, CUDALucas v1.67 err = 0.06791 (0:06 real, 0.5889 ms/iter, ETA 29:32) Iteration 10000 M( 6972593 )C, 0x88f1d2640adb89e1, n = 393216, CUDALucas v1.67 err = 0.04772 (0:10 real, 1.0405 ms/iter, ETA 2:00:41) Iteration 10000 M( 13466917 )C, 0x9fdc1f4092b15d69, n = 786432, CUDALucas v1.67 err = 0.0295 (0:18 real, 1.7384 ms/iter, ETA 6:29:41) Iteration 10000 M( 20996011 )C, 0x5fc58920a821da11, n = 1179648, CUDALucas v1.67 err = 0.08511 (0:22 real, 2.2505 ms/iter, ETA 13:06:55) Iteration 10000 M( 24036583 )C, 0xcbdef38a0bdc4f00, n = 1310720, CUDALucas v1.67 err = 0.2073 (0:26 real, 2.5972 ms/iter, ETA 17:19:44) Iteration 10000 M( 25964951 )C, 0x62eb3ff0a5f6237c, n = 1572864, CUDALucas v1.67 err = 0.01915 (0:31 real, 3.0897 ms/iter, ETA 22:16:18) Iteration 10000 M( 30402457 )C, 0x0b8600ef47e69d27, n = 1835008, CUDALucas v1.67 err = 0.02111 (0:35 real, 3.4515 ms/iter, ETA 29:08:11) Iteration 10000 M( 32582657 )C, 0x02751b7fcec76bb1, n = 1835008, CUDALucas v1.67 err = 0.1135 (0:35 real, 3.4586 ms/iter, ETA 31:17:25) err = 0.378309, increasing n from 1966080 Iteration 10000 M( 37156667 )C, 0x67ad7646a1fad514, n = 2097152, CUDALucas v1.67 err = 0.1061 (0:35 real, 3.4426 ms/iter, ETA 35:30:59) Iteration 10000 M( 42643801 )C, 0x8f90d78d5007bba7, n = 2359296, CUDALucas v1.67 err = 0.1855 (0:43 real, 4.2987 ms/iter, ETA 50:54:15) Iteration 10000 M( 43112609 )C, 0xe86891ebf6cd70c4, n = 2359296, CUDALucas v1.67 err = 0.2697 (0:43 real, 4.3005 ms/iter, ETA 51:29:13) 
20120319, 08:10  #1025 
Jul 2009
Tokyo
2·5·61 Posts 

20120319, 08:17  #1026 
Mar 2010
3×137 Posts 
Actually, it way my mistake, since GPU2, which has no monitor output attached, and is not in SLI, was not stress tested.
I found out that it was unstable at certain clock. Now running DC on smallest exponent again. 
20120319, 09:23  #1027 
Romulan Interpreter
Jun 2011
Thailand
5^{2}×7×53 Posts 
Any sources and binaries for v1.68? (the one with interactive aggressive/polite mode). I will be home in about 23 hours and I am eager to try it. Anyhow, if not, I will still keep you posted with v1.65's progress. I understand that you have other things to do too, sorry for being such a pain in the butt.

20120319, 12:00  #1028 
Mar 2010
3×137 Posts 
DC successful !
2^6972593  1 is indeed a prime 
20120319, 12:37  #1029 
Aug 2008
Good old Germany
8D_{16} Posts 
Hi guys.
Could someone please tell me, how the inputfile for CL had to look? I tried the exponents alone and the line from the worktodo.txt of P95 but there always CL tells me to start with the first exponent and then closes. Thanks in advance. 
20120319, 12:51  #1030 
Mar 2010
3·137 Posts 
Code:
CUDALucas.exe d 1 threads 512 c 25000 t agressive 6972593 
20120319, 12:54  #1031 
Romulan Interpreter
Jun 2011
Thailand
243B_{16} Posts 
version 1.67, polite and aggressive:
(still not interactively changeable) Code:
CUDALucas1.67.cuda4.1.sm_20.x64.exe d 1 r DEVICE:1 name GeForce GTX 580 totalGlobalMem 1610612736 sharedMemPerBlock 49152 regsPerBlock 32768 warpSize 32 memPitch 2147483647 maxThreadsPerBlock 1024 maxThreadsDim[3] 1024,1024,64 maxGridSize[3] 65535,65535,65535 totalConstMem 65536 major.minor 2.0 clockRate 1564000 textureAlignment 512 deviceOverlap 1 multiProcessorCount 16 Iteration 10000 M( 86243 )C, 0x23992ccd735a03d9, n = 8192, CUDALucas v1.67 err = 1.919e007 (0:02 real, 0.2334 ms/iter, ETA 0:16) Iteration 10000 M( 132049 )C, 0x4c52a92b54635f9e, n = 8192, CUDALucas v1.67 err = 0.0004515 (0:02 real, 0.2340 ms/iter, ETA 0:28) Iteration 10000 M( 216091 )C, 0x30247786758b8792, n = 16384, CUDALucas v1.67 err = 1.14e005 (0:03 real, 0.2316 ms/iter, ETA 0:46) Iteration 10000 M( 756839 )C, 0x5d2cbe7cb24a109a, n = 40960, CUDALucas v1.67 err = 0.0295 (0:03 real, 0.2828 ms/iter, ETA 3:29) Iteration 10000 M( 859433 )C, 0x3c4ad525c2d0aed0, n = 49152, CUDALucas v1.67 err = 0.009473 (0:02 real, 0.2930 ms/iter, ETA 4:06) Iteration 10000 M( 1257787 )C, 0x3f45bf9bea7213ea, n = 73728, CUDALucas v1.67 err = 0.006119 (0:04 real, 0.3601 ms/iter, ETA 7:26) Iteration 10000 M( 1398269 )C, 0xa4a6d2f0e34629db, n = 73728, CUDALucas v1.67 err = 0.09116 (0:04 real, 0.3570 ms/iter, ETA 8:12) Iteration 10000 M( 2976221 )C, 0x2a7111b7f70fea2f, n = 163840, CUDALucas v1.67 err = 0.04841 (0:05 real, 0.5641 ms/iter, ETA 27:49) Iteration 10000 M( 3021377 )C, 0x6387a70a85d46baf, n = 163840, CUDALucas v1.67 err = 0.06637 (0:06 real, 0.5643 ms/iter, ETA 28:18) Iteration 10000 M( 6972593 )C, 0x88f1d2640adb89e1, n = 393216, CUDALucas v1.67 err = 0.05295 (0:11 real, 1.1262 ms/iter, ETA 2:10:38) Iteration 10000 M( 13466917 )C, 0x9fdc1f4092b15d69, n = 786432, CUDALucas v1.67 err = 0.02841 (0:19 real, 1.8848 ms/iter, ETA 7:02:30) Iteration 10000 M( 20996011 )C, 0x5fc58920a821da11, n = 1179648, CUDALucas v1.67 err = 0.08614 (0:25 real, 2.4236 ms/iter, ETA 14:07:26) Iteration 10000 M( 24036583 )C, 0xcbdef38a0bdc4f00, n = 1310720, CUDALucas v1.67 err = 0.216 (0:27 real, 2.6855 ms/iter, ETA 17:55:06) Iteration 10000 M( 25964951 )C, 0x62eb3ff0a5f6237c, n = 1572864, CUDALucas v1.67 err = 0.01812 (0:32 real, 3.1922 ms/iter, ETA 23:00:37) Iteration 10000 M( 30402457 )C, 0x0b8600ef47e69d27, n = 1835008, CUDALucas v1.67 err = 0.02299 (0:35 real, 3.5650 ms/iter, ETA 30:05:40) Iteration 10000 M( 32582657 )C, 0x02751b7fcec76bb1, n = 1835008, CUDALucas v1.67 err = 0.1126 (0:36 real, 3.5962 ms/iter, ETA 32:32:08) err = 0.384875, increasing n from 1966080 Iteration 10000 M( 37156667 )C, 0x67ad7646a1fad514, n = 2097152, CUDALucas v1.67 err = 0.1081 (0:35 real, 3.5168 ms/iter, ETA 36:16:52) Iteration 10000 M( 42643801 )C, 0x8f90d78d5007bba7, n = 2359296, CUDALucas v1.67 err = 0.1898 (0:45 real, 4.4142 ms/iter, ETA 52:16:15) Iteration 10000 M( 43112609 )C, 0xe86891ebf6cd70c4, n = 2359296, CUDALucas v1.67 err = 0.2643 (0:41 real, 4.1197 ms/iter, ETA 49:19:18) >CUDALucas1.67.cuda4.1.sm_20.x64.exe d 1 aggressive r DEVICE:1 name GeForce GTX 580 totalGlobalMem 1610612736 sharedMemPerBlock 49152 regsPerBlock 32768 warpSize 32 memPitch 2147483647 maxThreadsPerBlock 1024 maxThreadsDim[3] 1024,1024,64 maxGridSize[3] 65535,65535,65535 totalConstMem 65536 major.minor 2.0 clockRate 1564000 textureAlignment 512 deviceOverlap 1 multiProcessorCount 16 Iteration 10000 M( 86243 )C, 0x23992ccd735a03d9, n = 8192, CUDALucas v1.67 err = 1.919e007 (0:01 real, 0.0802 ms/iter, ETA 0:05) Iteration 10000 M( 132049 )C, 0x4c52a92b54635f9e, n = 8192, CUDALucas v1.67 err = 0.0004515 (0:00 real, 0.0802 ms/iter, ETA 0:09) Iteration 10000 M( 216091 )C, 0x30247786758b8792, n = 16384, CUDALucas v1.67 err = 1.14e005 (0:01 real, 0.0792 ms/iter, ETA 0:15) Iteration 10000 M( 756839 )C, 0x5d2cbe7cb24a109a, n = 40960, CUDALucas v1.67 err = 0.0295 (0:01 real, 0.1082 ms/iter, ETA 1:20) Iteration 10000 M( 859433 )C, 0x3c4ad525c2d0aed0, n = 49152, CUDALucas v1.67 err = 0.009473 (0:02 real, 0.1181 ms/iter, ETA 1:39) Iteration 10000 M( 1257787 )C, 0x3f45bf9bea7213ea, n = 73728, CUDALucas v1.67 err = 0.006119 (0:01 real, 0.1842 ms/iter, ETA 3:48) Iteration 10000 M( 1398269 )C, 0xa4a6d2f0e34629db, n = 73728, CUDALucas v1.67 err = 0.09116 (0:02 real, 0.1939 ms/iter, ETA 4:27) Iteration 10000 M( 2976221 )C, 0x2a7111b7f70fea2f, n = 163840, CUDALucas v1.67 err = 0.04841 (0:04 real, 0.3753 ms/iter, ETA 18:30) Iteration 10000 M( 3021377 )C, 0x6387a70a85d46baf, n = 163840, CUDALucas v1.67 err = 0.06637 (0:04 real, 0.3770 ms/iter, ETA 18:54) Iteration 10000 M( 6972593 )C, 0x88f1d2640adb89e1, n = 393216, CUDALucas v1.67 err = 0.05295 (0:08 real, 0.7606 ms/iter, ETA 1:28:13) Iteration 10000 M( 13466917 )C, 0x9fdc1f4092b15d69, n = 786432, CUDALucas v1.67 err = 0.02841 (0:14 real, 1.4295 ms/iter, ETA 5:20:26) Iteration 10000 M( 20996011 )C, 0x5fc58920a821da11, n = 1179648, CUDALucas v1.67 err = 0.08614 (0:20 real, 1.9823 ms/iter, ETA 11:33:09) Iteration 10000 M( 24036583 )C, 0xcbdef38a0bdc4f00, n = 1310720, CUDALucas v1.67 err = 0.216 (0:23 real, 2.2765 ms/iter, ETA 15:11:21) Iteration 10000 M( 25964951 )C, 0x62eb3ff0a5f6237c, n = 1572864, CUDALucas v1.67 err = 0.01812 (0:28 real, 2.7817 ms/iter, ETA 20:03:04) Iteration 10000 M( 30402457 )C, 0x0b8600ef47e69d27, n = 1835008, CUDALucas v1.67 err = 0.02299 (0:31 real, 3.1177 ms/iter, ETA 26:19:07) Iteration 10000 M( 32582657 )C, 0x02751b7fcec76bb1, n = 1835008, CUDALucas v1.67 err = 0.1126 (0:31 real, 3.1220 ms/iter, ETA 28:14:44) err = 0.373917, increasing n from 1966080 Iteration 10000 M( 37156667 )C, 0x67ad7646a1fad514, n = 2097152, CUDALucas v1.67 err = 0.1081 (0:32 real, 3.1166 ms/iter, ETA 32:09:09) Iteration 10000 M( 42643801 )C, 0x8f90d78d5007bba7, n = 2359296, CUDALucas v1.67 err = 0.1898 (0:39 real, 3.9440 ms/iter, ETA 46:42:13) Iteration 10000 M( 43112609 )C, 0xe86891ebf6cd70c4, n = 2359296, CUDALucas v1.67 err = 0.2643 (0:40 real, 3.9444 ms/iter, ETA 47:13:22) 
20120319, 14:47  #1032  
"Kieren"
Jul 2011
In My Own Galaxy!
2^{2}·2,539 Posts 
Quote:
Code:
E:\CUDA\CUDALucas166.x64>CUDALucas1.66.cuda4.1.sm_21.x64 t c10000 threads 512 s check worktodo.txt DEVICE:0 name GeForce GTX 460 totalGlobalMem 1073741824 sharedMemPerBlock 49152 regsPerBlock 32768 warpSize 32 memPitch 2147483647 maxThreadsPerBlock 1024 maxThreadsDim[3] 1024,1024,64 maxGridSize[3] 65535,65535,65535 totalConstMem 65536 major.minor 2.1 clockRate 1700000 textureAlignment 512 deviceOverlap 1 multiProcessorCount 7 mkdir: cannot create directory `check': File exists Start test of file 'worktodo.txt' continuing work from a partial result M26116807 fft length = 1572864 iteration = 14178 Iteration 20000 M( 26116807 )C, 0xca672378e7d6596a, n = 1572864, CUDALucas v1.66 err = 0.02349 (0:37 real, 3.6748 ms/iter, ETA 26:37:56) Iteration 30000 M( 26116807 )C, 0x3252f697aa7b19ce, n = 1572864, CUDALucas v1.66 err = 0.02716 (1:03 real, 6.3077 ms/iter, ETA 45:41:43) ^C caught. Writing checkpoint. I hope this helps. EDIT: I stated incorrectly in a previous post that the worktodo.txt in the command line would be preceded by r. LaurV corrected this error. "r" runs a selftest. Last fiddled with by kladner on 20120319 at 14:50 

20120319, 15:07  #1033 
Aug 2008
Good old Germany
3×47 Posts 
Thanks a lot.
I found my error. CL created a ini file with the number of the line where to start. I deleted thiese file and then it worked. Again, thanks a lot. 
20120319, 16:17  #1034  
Dec 2009
Peine, Germany
331_{10} Posts 
Timings (best values)
Quote:
Code:
1.65 polite : M( 29309279 )C, n = 1835008, CUDALucas v1.65 err = 0.009593 (1:01 real, 6.0932 ms/iter, ETA 49:20:17) 1.67 polite : M( 29359303 )C, n = 1835008, CUDALucas v1.67 err = 0.009615 (0:57 real, 5.6353 ms/iter, ETA 39:39:58) 1.67 aggressive: M( 29359303 )C, n = 1835008, CUDALucas v1.67 err = 0.009195 (0:53 real, 5.3320 ms/iter, ETA 37:28:58) Code:
DEVICE:0 name GeForce GTX 560 Ti totalGlobalMem 1073741824 sharedMemPerBlock 49152 regsPerBlock 32768 warpSize 32 memPitch 2147483647 maxThreadsPerBlock 1024 maxThreadsDim[3] 1024,1024,64 maxGridSize[3] 65535,65535,65535 totalConstMem 65536 major.minor 2.1 clockRate 1645000 textureAlignment 512 deviceOverlap 1 multiProcessorCount 8 Last fiddled with by Brain on 20120319 at 16:18 Reason: typo 

Thread Tools  
Similar Threads  
Thread  Thread Starter  Forum  Replies  Last Post 
Don't DC/LL them with CudaLucas  LaurV  Data  131  20170502 18:41 
CUDALucas / cuFFT Performance on CUDA 7 / 7.5 / 8  Brain  GPU Computing  13  20160219 15:53 
CUDALucas: which binary to use?  Karl M Johnson  GPU Computing  15  20151013 04:44 
settings for cudaLucas  fairsky  GPU Computing  11  20131103 02:08 
Trying to run CUDALucas on Windows 8 CP  Rodrigo  GPU Computing  12  20120307 23:20 