![]() |
|
|
#2135 | |
|
Jan 2011
Dudley, MA, USA
7310 Posts |
Quote:
My maxGridSize[0] is 232-1, and 1024*maxGridSize[0] overflows 32-bit integer type. Code:
--- CUDALucas.cu.orig
+++ CUDALucas.cu
@@ -2098,7 +2098,7 @@
fft = choose_fft_length(0, &j);
while(fft <= n) {
- if(isReasonable(fft) <= 1 && fft <= 1024 * g_dev.maxGridSize[0] && fft % 1024 == 0) {
+ if(isReasonable(fft) <= 1 && fft / 1024 <= g_dev.maxGridSize[0] && fft % 1024 == 0) {
cufftSafeCall (cufftPlan1d (&g_plan, fft / 2, CUFFT_Z2Z, 1));
for(k = 0; k < 2; k++) {
for (t = s[k]; t < e[k]; t++) {
|
|
|
|
|
|
|
#2136 |
|
"Carl Darby"
Oct 2012
Spring Mountains, Nevada
32×5×7 Posts |
Thanks for pointing that out. It was fixed in r62, but by some form of inattention, found its way back into r63.
Edit: r64 is up, refixing the issue. Last fiddled with by owftheevil on 2014-03-03 at 15:30 |
|
|
|
|
|
#2137 | |
|
"Patrik Johansson"
Aug 2002
Uppsala, Sweden
1101010012 Posts |
Quote:
All eight tests I ran for that range resulted in a non-matching residue, four of which were later confirmed bad by mprime. (The other four have tests by mprime in progress.) Could it be that the underclock was stable only for certain FFT lengths, but not for others? I don't remember what self tests I ran 17 months ago. My card is a Gigabyte GTX 570: GV-N570OC-13I V2.0 Windows 7 Home Premium, 64 bit (SP1). Nvidia driver 306.97 One nearby exponent, same FFT, completed a verified test on another card and machine. I underclocked the memory of the card producing bad tests another 100 MHz down to 1700 MHz and have started another test in the range I got the bad ones. List of exponents completed by this card in 2014: Code:
M( 33336041 )C, 0xa1ba4b60955e507b, n = 1835008, CUDALucas v2.03 Verified M( 33336091 )C, 0xfeb4555212d4f4ee, n = 1835008, CUDALucas v2.03 Verified M( 33336169 )C, 0xdbd40eaf554b2675, n = 1835008, CUDALucas v2.03 Verified M( 33336197 )C, 0xbed813ed9c9d16dc, n = 1835008, CUDALucas v2.03 Verified M( 33336221 )C, 0xc9af6488b22ce5da, n = 1835008, CUDALucas v2.03 Verified M( 33336227 )C, 0xeed747f63850cd__, n = 1835008, CUDALucas v2.03 Unverified M( 33338551 )C, 0xaa883a35b9a57740, n = 1835008, CUDALucas v2.03 Verified M( 33338687 )C, 0x2aff2f81dea81d04, n = 1835008, CUDALucas v2.03 Verified M( 33338759 )C, 0x15ba6062f66d7f3c, n = 1835008, CUDALucas v2.03 Verified M( 33338801 )C, 0xd43485f4feddbec9, n = 1835008, CUDALucas v2.03 Verified M( 33338861 )C, 0xb4e5931bfa1a4323, n = 1835008, CUDALucas v2.03 Verified M( 33338911 )C, 0x2f179cdf89f39c41, n = 1835008, CUDALucas v2.03 Verified M( 33338941 )C, 0xc9654436bc93afec, n = 1835008, CUDALucas v2.03 Verified M( 33339049 )C, 0xefc01fd742be9490, n = 1835008, CUDALucas v2.03 Verified M( 33339079 )C, 0x0f5a27109253cba5, n = 1835008, CUDALucas v2.03 Verified M( 33340217 )C, 0xfebe365cd4d8262b, n = 1835008, CUDALucas v2.03 Verified M( 33340259 )C, 0x803db62a1d4878c4, n = 1835008, CUDALucas v2.03 Verified M( 33340273 )C, 0xbf815f87e5a244cd, n = 1835008, CUDALucas v2.03 Verified M( 33340331 )C, 0x5b8b46fd69e1cf3f, n = 1835008, CUDALucas v2.03 Verified M( 33340399 )C, 0x6055989608290c9f, n = 1835008, CUDALucas v2.03 Verified M( 33338969 )C, 0xac6b1e36d116665e, n = 1835008, CUDALucas v2.03 Verified M( 33340603 )C, 0x9a47d238a35be14b, n = 1835008, CUDALucas v2.03 Verified M( 33340693 )C, 0xc8a725591822a7__, n = 1835008, CUDALucas v2.03 Unverified M( 33340729 )C, 0x13115d90972b9106, n = 1835008, CUDALucas v2.03 Verified M( 33340751 )C, 0x61fd1ed6ada34567, n = 1835008, CUDALucas v2.03 Verified M( 33340969 )C, 0x12228227855bafd1, n = 1835008, CUDALucas v2.03 Verified M( 33340973 )C, 0xf6bd7103e731cf__, n = 1835008, CUDALucas v2.03 Unverified M( 33341003 )C, 0xe5da51c09e871f95, n = 1835008, CUDALucas v2.03 Verified M( 33340999 )C, 0x2bb766951d08d1b6, n = 1835008, CUDALucas v2.03 Verified M( 33341023 )C, 0x9bef45d1fac75275, n = 1835008, CUDALucas v2.03 Verified M( 33341047 )C, 0x9350917fded037b7, n = 1835008, CUDALucas v2.03 Verified M( 33341069 )C, 0x5ad9645e801a3b16, n = 1835008, CUDALucas v2.03 Verified M( 34440337 )C, 0x8c07d05f66675a37, n = 1966080, CUDALucas v2.03 Unverified, later bad M( 34440383 )C, 0xa5e3277786752569, n = 1966080, CUDALucas v2.03 Unverified, later bad M( 34440391 )C, 0xac5826c2d7e1caf5, n = 1966080, CUDALucas v2.03 Unverified, later bad M( 34440491 )C, 0x4752da9b55ebaea5, n = 1966080, CUDALucas v2.03 Unverified, later bad M( 34440643 )C, 0xa9e459ad5338c6__, n = 1966080, CUDALucas v2.03 Unverified M( 34440647 )C, 0x5c2e2245038073__, n = 1966080, CUDALucas v2.03 Unverified M( 34440649 )C, 0x266f082ba2d23f__, n = 1966080, CUDALucas v2.03 Unverified M( 34440739 )C, 0x0624bcde0c1aeb__, n = 1966080, CUDALucas v2.03 Unverified M( 33407953 )C, 0x3b0945f27b1c7100, n = 1835008, CUDALucas v2.03 Verified M( 33408797 )C, 0x5c92f5d23fb209b9, n = 1835008, CUDALucas v2.03 Verified |
|
|
|
|
|
|
#2138 |
|
Mar 2012
Germany
1A16 Posts |
Somebody said: CudaLucas.ini SaveAllCheckpoints=1. A very important feature to enable if you want to roll back to a "good" check point.
But I don“t understand, how to roll back. Anybody any idea? Thx. |
|
|
|
|
|
#2139 |
|
"Jerry"
Nov 2011
Vancouver, WA
112310 Posts |
That feature will save a new file every time it writes a checkpoint. It takes up a lot of space, but you can use it to go back to one of the files. To roll back you use one of those save files by renaming it so when CUDALucas is restarted it uses the renamed file.
|
|
|
|
|
|
#2140 |
|
Mar 2012
Germany
2·13 Posts |
|
|
|
|
|
|
#2141 |
|
Romulan Interpreter
Jun 2011
Thailand
25BF16 Posts |
renaming to cxxxxxxx or txxxxxxx overwriting the existent last checkpoints. Look into the folder and see how the files are created and you will understand.
Caution: (in attention of owftheevil) last version of cudalucas has a bug (nonexistent in 2.04): if you have a mismatched residue, you have to resume not from the last file with the same residue, but from one before. This is because the residues in the name of the file are shifted by one compared with real residues written on screen. Dubslow's version didn't make this mess, so it is something newly introduced. If you resume from the last saved file having the same residue, it is possible that the respective file is already corrupted. To understand what I mean, compare the names of the checkpoint files with the residues written on screen. edit: I think this affects cudaPM1 too Last fiddled with by LaurV on 2014-03-18 at 18:47 |
|
|
|
|
|
#2142 |
|
"Jerry"
Nov 2011
Vancouver, WA
21438 Posts |
This is in 2.05?
BTW - Thanks LaurV for the help Last fiddled with by flashjh on 2014-03-18 at 18:58 |
|
|
|
|
|
#2143 | |
|
"Carl Darby"
Oct 2012
Spring Mountains, Nevada
32×5×7 Posts |
Quote:
|
|
|
|
|
|
|
#2144 |
|
"Jerry"
Nov 2011
Vancouver, WA
1,123 Posts |
I haven't posted Windows executable since r62. I'll get updates done once you confirm it's fixed.
|
|
|
|
|
|
#2145 |
|
Mar 2012
Germany
2×13 Posts |
|
|
|
|
![]() |
| Thread Tools | |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Don't DC/LL them with CudaLucas | LaurV | Data | 131 | 2017-05-02 18:41 |
| CUDALucas / cuFFT Performance on CUDA 7 / 7.5 / 8 | Brain | GPU Computing | 13 | 2016-02-19 15:53 |
| CUDALucas: which binary to use? | Karl M Johnson | GPU Computing | 15 | 2015-10-13 04:44 |
| settings for cudaLucas | fairsky | GPU Computing | 11 | 2013-11-03 02:08 |
| Trying to run CUDALucas on Windows 8 CP | Rodrigo | GPU Computing | 12 | 2012-03-07 23:20 |