![]() |
|
|
#155 | |
|
Nov 2010
Germany
3×199 Posts |
Quote:
I'll update my home PC over the weekend, maybe I can reproduce the error there. The symptom looks a bit like something not initialized in the correct place ... and now it depends on memory layout or other rather random things. |
|
|
|
|
|
|
#156 |
|
"Ethan O'Connor"
Oct 2002
GIMPS since Jan 1996
2·72 Posts |
|
|
|
|
|
|
#157 |
|
"Ethan O'Connor"
Oct 2002
GIMPS since Jan 1996
2×72 Posts |
I reordered the test cases to see if the failure pattern was the same, and it turns out that the order of the kernels within a testcase is irrelevant -- mfakto_cl_71_4, mfakto_cl_barrett79, and mfakto_cl_barrett92 are failing, but mfakto_cl_71_8 is working.
|
|
|
|
|
|
#158 |
|
"Oliver"
Mar 2005
Germany
5·223 Posts |
Hello,
just a shot into the dark: The average wait is 0 when the known factor is not found: does the GPU-kernel run at all? Oliver Last fiddled with by TheJudger on 2011-11-12 at 01:31 |
|
|
|
|
|
#159 | |
|
"Ethan O'Connor"
Oct 2002
GIMPS since Jan 1996
2×72 Posts |
Quote:
32 bit build cl_71_4: Code:
########## testcase 1/1558 ##########
tf(50804297, 67, 68, ...);
k_min = 1599999998520 - k_max = 1900000000000
Using GPU kernel "mfakto_cl_71_4"
class | candidates | time | avg. rate | SievePrimes | ETA | avg. wait
mfakto_cl_71: tid=0: p=3073649, *2 =6:e6c92, k=0, 0, 0, 0:17487, 17487, 17487, 1
7487:6e8773, 6ef3bb, 6f05c7, 6f3beb, f=8d029, 8d029, 8d029, 8d02a:fccff8, ff5fc2
, ffcd0e, 114f3:77c397, 53e4a7, a33f7f, 915007, shift=19, b=0, 0, 0, 0:1, 1, 1,
1:0, 0, 0, 0:0, 0, 0, 0:0, 0, 0, 0:0, 0, 0, 0
mod_144_72#1: qf=3.51844E+013, nf=6.15105E-021, *=2.16421E-007, qi=0
mod_144_72#1: q=0:1:0:0:0:0, n=8d029:fccff8:77c397, qi=0
mod_144_72#1.1: nn=0:0:0:0:0:0
mod_144_72#1.2: nn=0:0:0:0:0:0
mod_144_72#1.3: nn=0:0:0:0:0:0Error: The arguments don't match the printf format
string. printf(mod_144_72#1.3: nn=%x:%x:%x:%x:%x:%x
Code:
########## testcase 1/1558 ##########
tf(50804297, 67, 68, ...);
k_min = 1599999998520 - k_max = 1900000000000
Using GPU kernel "mfakto_cl_71_4"
class | candidates | time | avg. rate | SievePrimes | ETA | avg. wait
mfakto_cl_71: tid=0: p=3073649, *2 =6:e6c92, k=0, 0, 0, 0:17487, 17487, 17487, 1
7487:6e8773, 6ef3bb, 6f05c7, 6f3beb, f=8d029, 8d029, 8d029, 8d02a:fccff8, ff5fc2
, ffcd0e, 114f3:77c397, 53e4a7, a33f7f, 915007, shift=19, b=0, 0, 0, 0:0, 0, 0,
0:0, 0, 0, 0:0, 0, 0, 0:0, 0, 0, 0:0, 0, 0, 0
mod_144_72#1: qf=0.000000, nf=6.15105E-021, *=0.000000, qi=0
mod_144_72#1: q=0:0:0:0:0:0, n=8d029:fccff8:77c397, qi=0
mod_144_72#1.1: nn=0:0:0:0:0:0
mod_144_72#1.2: nn=0:0:0:0:0:0
mod_144_72#1.3: nn=0:0:0:0:0:0Error: The arguments don't match the printf format
string. printf(mod_144_72#1.3: nn=%x:%x:%x:%x:%x:%x
Code:
########## testcase 1/1558 ##########
tf(50804297, 67, 68, ...);
k_min = 1599999998520 - k_max = 1900000000000
Using GPU kernel "mfakto_cl_71_8"
class | candidates | time | avg. rate | SievePrimes | ETA | avg. wait
mfakto_cl_71: tid=0: p=3073649, *2 =6:e6c92, k=0, 0, 0, 0, 0, 0, 0, 0:17487, 174
87, 17487, 17487, 17487, 17487, 17487, 17487:6e8773, 6ef3bb, 6f05c7, 6f3beb, 6fd
e57, 70147b, 706eb7, 71c59b, f=8d029, 8d029, 8d029, 8d02a, 8d02a, 8d02a, 8d02a,
8d02a:fccff8, ff5fc2, ffcd0e, 114f3, 4eca2, 63487, 85704, 1073ae:77c397, 53e4a7,
a33f7f, 915007, 5b819f, 499227, d6585f, ba1667, shift=19, b=0, 0, 0, 0, 0, 0, 0
, 0:1, 1, 1, 1, 1, 1, 1, 1:0, 0, 0, 0, 0, 0, 0, 0:0, 0, 0, 0, 0, 0, 0, 0:0, 0, 0
, 0, 0, 0, 0, 0:0, 0, 0, 0, 0, 0, 0, 0
mod_144_72#1: qf=3.51844E+013, nf=6.15105E-021, *=2.16421E-007, qi=0
mod_144_72#1: q=0:1:0:0:0:0, n=8d029:fccff8:77c397, qi=0
mod_144_72#1.1: nn=0:0:0:0:0:0
mod_144_72#1.2: nn=0:0:0:0:0:0
mod_144_72#1.3: nn=0:0:0:0:0:0Error: The arguments don't match the printf format
string. printf(mod_144_72#1.3: nn=%x:%x:%x:%x:%x:%x
Last fiddled with by Ethan (EO) on 2011-11-12 at 02:08 |
|
|
|
|
|
|
#160 |
|
"Oliver"
Mar 2005
Germany
5×223 Posts |
qf = 0.00000 doesn't look good.
|
|
|
|
|
|
#161 | |
|
Nov 2010
Germany
3·199 Posts |
Quote:
Looks like some work to do ... Edit: And the average wait can be zero for mfakto because the necessary wait time for the last block of a class is not included in the calculation (one of the differences to the earlier mfaktc versions, to work better on small classes). Last fiddled with by Bdot on 2011-11-12 at 19:43 |
|
|
|
|
|
|
#162 |
|
Nov 2010
Germany
3×199 Posts |
The kernels do not receive the input parameter that holds the pre-processing information, but get a zero there.
With the kernel tracing fixed and set to at least level 3, the mfakto_cl_71_4 kernel will receive the correct parameters and find the factors. So far I did not get the barrett kernels to receive all input parameters. My guess is that the optimizer removed them as it did not deem them important. But trying to build the kernel non-optimized crashes the kernel compiler. In the light of this it is probably not helping that the barrett kernels are ~4% faster with 11.10. Probably because crucial parts have been optimized away. I guess we just need to skip the Catalyst 11.10 version :-( |
|
|
|
|
|
#163 |
|
Oct 2011
12478 Posts |
I just recently got myself an HD 6770 and picked up mfakto .09 and when I try to run the 64 bit windows exe I get multiple errors about too many instances of mad24 and then a message saying there were 27 errors and the program shutdown. (paraphrasing as I am not sitting AT the machine atm) If I run the 32 bit exe, everything appears to run normal. I have 11.9 drivers installed as I read on here about the problems with 11.10.
|
|
|
|
|
|
#164 | |
|
Nov 2010
Germany
3×199 Posts |
Quote:
Code:
Select device - Get device info - Compiling kernels.
BUILD OUTPUT
C:\Users\root\AppData\Local\Temp\OCLCEF5.tmp.cl(2192): error: more than one
instance of overloaded function "mad24" matches the argument list:
function "mad24(int, int, int) C++"
function "mad24(uint, uint, uint) C++"
argument types are: (uint, int, uint)
*res_hi = mad24(mul_hi(a,b), 256, (*res_lo >> 24));
^
...
C:\Users\root\AppData\Local\Temp\OCLCEF5.tmp.cl(2726): error: more than one
instance of overloaded function "mad24" matches the argument list:
function "mad24(int, int, int) C++"
function "mad24(uint, uint, uint) C++"
argument types are: (uint, int, uint)
nn.d2 = mad24(mul_hi(n.d1, qi), 256, tmp >> 24);
^
27 errors detected in the compilation of "C:\Users\root\AppData\Local\Temp\OCLCEF5.tmp.cl".
Internal error: clc compiler invocation failed.
END OF BUILD OUTPUT
Error -11: clBuildProgram
init_CL(5, 0) failed
I tried deinstalling 11.10 and went back as far as 11.6 - the errors remain. It's not the first time that the ATI drivers do not correctly deinstall themselves. Maybe they do but some hardware switch remained in a bad position. Anyway: the sad result is: once in that state, I could not get out. (I cannot try reinstalling the machine.) I'll see if I can build an "11.10-workaround-version" for trapped folks like me. There will certainly be a performance-penalty. Where it still works, it's probably faster to run the 32-bit version for now - on my machine the 32-bit version fails as well. Strange, strange, strange. Maybe there's still a bug in the main program that just has these side effects. |
|
|
|
|
|
|
#165 |
|
Oct 2011
7×97 Posts |
Hmmm, does that mean the exp's I've been doing on the 32 bit client are suspect?
|
|
|
|
![]() |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| gpuOwL: an OpenCL program for Mersenne primality testing | preda | GpuOwl | 2938 | 2023-06-30 14:04 |
| mfaktc: a CUDA program for Mersenne prefactoring | TheJudger | GPU Computing | 3628 | 2023-04-17 22:08 |
| LL with OpenCL | msft | GPU Computing | 433 | 2019-06-23 21:11 |
| OpenCL for FPGAs | TObject | GPU Computing | 2 | 2013-10-12 21:09 |
| Program to TF Mersenne numbers with more than 1 sextillion digits? | Stargate38 | Factoring | 24 | 2011-11-03 00:34 |