![]() |
|
|
#2157 | |
|
Romulan Interpreter
Jun 2011
Thailand
25BF16 Posts |
Quote:
(edit: the only idea is to not lose a LONG fft file with all reasonable sizes inside, without notification (like it is happening). Maybe I worked one full day to get that file and I don't have backup! I would be very angry than! - luckily I had more folders with the same content, having more of the same cards, and I had copies of the file in those folders, it may not always be the case)(edit 2: optimization of threads works very nice, and faster than the older version. The only unchanged thing is that the work is saved at the end, which may result in trouble if there is a crash, but here is no problem, this optimization is only done once in the lifetime, and it can be split in few consecutive small jobs, I mean I don't need to use "-threadbench 1 20480 6 0", but use 3-4 "splits". Which I was enough stupid not to think about, and the job took since the last post. Fortunately finished with success )
Last fiddled with by LaurV on 2014-03-21 at 16:56 |
|
|
|
|
|
|
#2158 | |
|
Apr 2014
7·17 Posts |
Thanks for the restart batch file.
I am getting the API runtime errors, even with the latest beta build r65 (running toolkit 5.0 and latest 335.23 nvidia drivers )....however this is only happening on my gtx 570, not my 280. I have noticed that the 570 will run stable until I stop the job and go to mfaktc and then switch back to the LL job. It'll continue happening until I reboot my box. So far that seems to be what triggers the API errors for me. I have never seen this behavior on my 280 even when switching between cuda lucas and mfaktc. Quote:
|
|
|
|
|
|
|
#2159 |
|
Mar 2012
Germany
2×13 Posts |
Round off error at iteration = 21463800, err = 0.5 > 0.40, fft = 3584K.
Increasing fft and restarting from last checkpoint. Using threads: square 128, splice 256. Continuing M62494429 @ iteration 21460001 with fft length 4096K, 34.34% done After some errors more, the programm stops. If I restart it tells at the end (example): Processing result: M( x )C, 0xy, offset = 6684, n = 4096K, CUDALucas v2.05 Beta, g_AID: A6ACCD2C719C7543871E42683998589C. I think, the result ist bad. |
|
|
|
|
|
#2160 |
|
"Carl Darby"
Oct 2012
Spring Mountains, Nevada
32·5·7 Posts |
Can you recall what the "more errors" were?
The root problem is most likely memory, at least that's the only time I see a roundoff error like that. But I don't know whats going on with the apparent output of a result after the errors. |
|
|
|
|
|
#2161 |
|
Mar 2012
Germany
2×13 Posts |
No, I cant The Logfile doesn´t exist anymore. I tried the next with the "savefile"-Option. Only if I reduce the memory-speed (-500 MHz) and the "Power Limit" (57%) -> GPU = 692 MHz, I get less errors. But what does "g_AID" mean? (last result: M( 62494429 )C, 0x7191357b114a13__, offset = 31262106, n = 4096K, CUDALucas v2.05 Beta, g_AID: EEFBC9895C77C54B1AC676621FFA____)
How can I see, how my "Computer must be proven reliable"? Ohhhh, I forget to log in in GIMPS and lost my result. 157 GHz-days! Now I found in my assginments the same exponent as double check! Last fiddled with by Batalov on 2014-04-09 at 17:04 Reason: masked parts of output |
|
|
|
|
|
#2162 | ||
|
"Serge"
Mar 2008
Phi(4,2^7658614+1)/2
9,497 Posts |
Quote:
Quote:
|
||
|
|
|
|
|
#2163 | |
|
Mar 2012
Germany
1A16 Posts |
Thank very much for this hint. Maybe, he will solve my old Problem with M332,224,379
Quote:
Last fiddled with by MikeBerlin on 2014-04-09 at 18:43 |
|
|
|
|
|
|
#2164 |
|
Mar 2012
Germany
328 Posts |
|
|
|
|
|
|
#2165 |
|
Sep 2006
The Netherlands
2D916 Posts |
hi! Sorry to jump in on this thread.
How efficient is the code running? https://developer.nvidia.com/cuFFT I see there at bit larger transforms cuFFT gets at M2090 tesla efficiency of under 100 Gflop. Didn't checkout code yet - will soon. This Tesla delivers 666 Gflop. Not counting fused-multiply-adds (didn't check yet whether their code uses them - assuming not) then it's 333 Gflop. So efficiency of around 30%. How is efficiency there for CUDALucas at bit larger transforms? Interested in gpgpu fft for Riesel :) |
|
|
|
|
|
#2166 | |
|
"Jerry"
Nov 2011
Vancouver, WA
1,123 Posts |
CUDALucas 2.05Beta r67 is posted for Windows. CUDA 4.2, 5.0, 5.5 and 6.0
CUDA 6.0 Libs are here Quote:
|
|
|
|
|
|
|
#2167 |
|
Apr 2010
Over the rainbow
2·1,303 Posts |
Code:
CUDALucas_205Beta_CUDA6.0-x64_r67.exe -cufftbench 1 4096 1 ------- DEVICE 0 ------- name GeForce GTX 750 Ti Compatibility 5.0 clockRate (MHz) 1110 memClockRate (MHz) 2700 totalGlobalMem 2147483648 totalConstMem 65536 l2CacheSize 2097152 sharedMemPerBlock 49152 regsPerBlock 65536 warpSize 32 memPitch 2147483647 maxThreadsPerBlock 1024 maxThreadsPerMP 2048 multiProcessorCount 5 maxThreadsDim[3] 1024,1024,64 maxGridSize[3] 2147483647,65535,65535 textureAlignment 512 deviceOverlap 1 Using threads: square 256, splice 128. Last fiddled with by firejuggler on 2014-04-28 at 06:36 |
|
|
|
![]() |
| Thread Tools | |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Don't DC/LL them with CudaLucas | LaurV | Data | 131 | 2017-05-02 18:41 |
| CUDALucas / cuFFT Performance on CUDA 7 / 7.5 / 8 | Brain | GPU Computing | 13 | 2016-02-19 15:53 |
| CUDALucas: which binary to use? | Karl M Johnson | GPU Computing | 15 | 2015-10-13 04:44 |
| settings for cudaLucas | fairsky | GPU Computing | 11 | 2013-11-03 02:08 |
| Trying to run CUDALucas on Windows 8 CP | Rodrigo | GPU Computing | 12 | 2012-03-07 23:20 |