![]() |
![]() |
#122 |
Sep 2010
Annapolis, MD, USA
BD16 Posts |
![]()
I was able to test the Linux build as a success:
Code:
Selftest statistics number of tests 3332 successfull tests 3332 selftest PASSED! real 43m31.235s user 9m22.360s sys 31m10.151s Do we need to redo "no factor" work that was done under 0.07 or can those be submitted? I have not downloaded the source to check a diff to see if it is reasonable thing to do. |
![]() |
![]() |
![]() |
#123 | ||
Nov 2010
Germany
25516 Posts |
![]() Quote:
Quote:
In version 0.07, the single-vectored MUL24 kernel did not work with Catalyst 11.8. In your self-compiled version you removed that kernel from the selftest, but not from the program. If you never changed the mfakto.ini-Parameter VectorSize (i.e. if you left it at 4), then that faulty kernel has not been used and you can submit the previous results without re-running them. |
||
![]() |
![]() |
![]() |
#124 | |
Sep 2010
Annapolis, MD, USA
33×7 Posts |
![]() Quote:
![]() |
|
![]() |
![]() |
![]() |
#125 |
Nov 2010
Germany
3·199 Posts |
![]() |
![]() |
![]() |
![]() |
#126 | |
Jan 2009
43 Posts |
![]() Quote:
mfakto_cl_barrett79 avg rate: 55M/s mfakto_cl_71 avg rate: 50M/s |
|
![]() |
![]() |
![]() |
#127 |
Sep 2010
Annapolis, MD, USA
BD16 Posts |
![]()
I am finding similar results. My HD5450 seems to do about 8.6M/s on the mfakto_cl_71 and about 9.1M/s on the mfakto_cl_barrett79, doing TF on M41774351 from 68 to 69.
|
![]() |
![]() |
![]() |
#128 |
"James Heinrich"
May 2004
ex-Northern Ontario
32×7×59 Posts |
![]()
Since it appeared to be missing, I've created a stub article on MersenneWiki for mfakto:
http://www.mersennewiki.org/index.php/Mfakto But since I don't actually use mfakto, perhaps someone else could fill in and fix all the details in the article. |
![]() |
![]() |
![]() |
#129 |
Jul 2008
308 Posts |
![]()
In the GPUGRID forum :
there's a bug in the latest sdk that makes a full use of a cpu-core whenever an opencl app is running. They promised a fix, but still not here in 11.8 maybe in 11.9?? Maybe mfakto suffers from this as well? One of the threads using 100% of one cpu happens to be in the ATI libs.... |
![]() |
![]() |
![]() |
#130 | |
Nov 2010
Germany
3·199 Posts |
![]() Quote:
However, mfakto is not hit that badly as mfakto passes the prepared factor candidates to the GPU but does not wait for the results immediately. Instead, the next block of factor candidates is prepared on the CPU. Only when the CPU is faster preparing the stuff than the GPU can process it, then mfakto will synchronize with the GPU. And of course at the end of a class. So yes, mfakto will also consume a full CPU core, but it will do something useful most of that time. |
|
![]() |
![]() |
![]() |
#131 | |
Jan 2010
510 Posts |
![]() Quote:
mfakto_cl_barrett79 avg rate: 51.9M/s mfakto_cl_71 avg rate: 48.7M/s GPU load is 91-95% CPU load will almost cap out 2 cores on my Phenom II x4 955 |
|
![]() |
![]() |
![]() |
#132 |
Nov 2010
Germany
3·199 Posts |
![]()
I´m sorry to report: yesterday I found a bug, mfakto up to 0.08 does not find the factor for k=3 for M6599953.
The reason is an invalid "optimization" that I made over the mfaktc-code. Mfaktc does not have this problem. I have fixed the bug and added a test case for it to the selftests. The mfakto kernel "mfakto_cl_71" (all vector sizes) sometimes calculated a bad modulus when the factor candidate was <248. Smaller FCs (~224) had a higher chance for the error to occur, FCs >248 were always calculated correctly. The problem does not depend on the exponent size. I´m sorry for possibly having wasted effort and resources, but I hope it´s not too many tests that need to be repeated as it´s only about small FCs. I will provide a fixed version within the next few days. |
![]() |
![]() |
![]() |
Thread Tools | |
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
gpuOwL: an OpenCL program for Mersenne primality testing | preda | GpuOwl | 2760 | 2022-05-15 00:00 |
mfaktc: a CUDA program for Mersenne prefactoring | TheJudger | GPU Computing | 3541 | 2022-04-21 22:37 |
LL with OpenCL | msft | GPU Computing | 433 | 2019-06-23 21:11 |
OpenCL for FPGAs | TObject | GPU Computing | 2 | 2013-10-12 21:09 |
Program to TF Mersenne numbers with more than 1 sextillion digits? | Stargate38 | Factoring | 24 | 2011-11-03 00:34 |