![]() |
|
|
#1541 |
|
Feb 2012
19516 Posts |
I upgraded from CUDALucas-2.04 Beta-4.1-sm_21-x64.exe to CUDALucas-2.04 Beta-4.2-sm_30-x64.exe and now I am getting the following error:
CUDALucas.cu(163) : cufftSafeCall() CUFFT error 6: CUFFT_EXEC_FAILED I thought I had bad CUDA DLLs, but I downloaded fresh ones from the recommended site, and I still get the error. Please advise. Thank you. |
|
|
|
|
|
#1542 |
|
Basketry That Evening!
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88
11100001101012 Posts |
What card are you using it on? If you're using sm_30 then you need at least a Kepler. Use the architecture which your card belongs to. [45][78]0s are 2.0, [45][1-6]0 are 2.1, and everything <= GTX 2** are 1.x.
|
|
|
|
|
|
#1544 | |
|
Basketry That Evening!
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88
3·29·83 Posts |
Quote:
Note that there probably won't be a performance increase from switching CUDA versions. |
|
|
|
|
|
|
#1545 |
|
Feb 2012
34·5 Posts |
I see. Thank you.
|
|
|
|
|
|
#1546 |
|
"Jerry"
Nov 2011
Vancouver, WA
21438 Posts |
I can compile just about any combination, but the problem is what Dubslow already pointed out: you won't see much of an improvement.
The version that fastest on all of my 580s is CUDA_3.2 | sm_1.3. You should try that one and let us know how it works for you. |
|
|
|
|
|
#1547 |
|
"Jerry"
Nov 2011
Vancouver, WA
1,123 Posts |
New versions of 2.04 beta are uploaded here. They are based on r32 from SourceForge which is the baseline update discussed that included the filelocking. I had to make some minor adjustments to CUDALucas.cu and Parse.c to get it to compile, but I did not make any changes to the functions.
Without having to figure out what went wrong right now, as I was reviewing the changes between r32 and r37 to make it compile, I found that the modified open_s function that I added for MSVS, which used _sopen_s, was probably wrong and caused the problem... hopefully. I have committed the changes to r38. Everyone please test this build for the filelocking error. Thanks! I compiled a 4.2 | sm_20 version, in case you need it. Last fiddled with by flashjh on 2012-08-28 at 05:13 |
|
|
|
|
|
#1548 | ||
|
"Oliver"
Mar 2005
Germany
11·101 Posts |
Hi,
Quote:
Quote:
There is only one point where I'm unsure: I don't know whether you can run both apps (with different CUDA versions) concurrently or not. Oliver |
||
|
|
|
|
|
#1549 | |
|
Basketry That Evening!
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88
160658 Posts |
Quote:
Edit: r33 and r37 had some changes, including updated FFT lengths. Those need to be reincorporated. I'll try and make those into an r39. Last fiddled with by Dubslow on 2012-08-28 at 18:13 |
|
|
|
|
|
|
#1550 |
|
Basketry That Evening!
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88
3·29·83 Posts |
Okay, slight change of plans: I recall LaurV somewhere saying that a larger FFT length was faster than some smaller ones in CUDALucas' table, but I wasn't able to relocate that post. In addition, I will also add the signal-handling fix discussed before to r39.
In the meantime, all Windows users should test flash's latest compile for the filelocking bug; note, however, that compared to earlier beta releases, some FFT lengths might not appear. If the bug is confirmed killed, then the final release (non-beta) of 2.04 will reincorporate the changes from the old binary lost in the new ones (i.e., it will be r39). r39 will be committed when LaurV responds. |
|
|
|
|
|
#1551 | |
|
"Jerry"
Nov 2011
Vancouver, WA
1,123 Posts |
Quote:
|
|
|
|
|
![]() |
| Thread Tools | |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Don't DC/LL them with CudaLucas | LaurV | Data | 131 | 2017-05-02 18:41 |
| CUDALucas / cuFFT Performance on CUDA 7 / 7.5 / 8 | Brain | GPU Computing | 13 | 2016-02-19 15:53 |
| CUDALucas: which binary to use? | Karl M Johnson | GPU Computing | 15 | 2015-10-13 04:44 |
| settings for cudaLucas | fairsky | GPU Computing | 11 | 2013-11-03 02:08 |
| Trying to run CUDALucas on Windows 8 CP | Rodrigo | GPU Computing | 12 | 2012-03-07 23:20 |