![]() |
|
|
#1409 | |
|
"Oliver"
Mar 2005
Germany
11·101 Posts |
Quote:
Code:
| CUDA 3.2 | CUDA 4.1-RC2 mfaktc 0.17 | 260.94M/s | 261.93M/s mfaktc 0.18-pre10 | 260.80M/s | 265.39M/s but there are no changes in the code of the barrett79 kernel from -pre9 to -pre10...Factory overclocked GTX 560Ti (1701MHz), barrett92 kernel, raw GPU speed (without sieving), M3321932839 from 279 to 280 Code:
| CUDA 4.1-RC2 mfaktc 0.17 | 170.62M/s mfaktc 0.18-pre10 | 173.32M/s ![]() Oliver |
|
|
|
|
|
|
#1410 |
|
"Oliver"
Mar 2005
Germany
11·101 Posts |
Hello!
http://www.mersenneforum.org/mfaktc/mfaktc-0.18.tar.gz http://www.mersenneforum.org/mfaktc/mfaktc-0.18.win.zip http://www.mersenneforum.org/mfaktc/...linux64.tar.gz The executables need at least a CUDA 4.0 capable driver (270 series driver or newer). The Windows zip archive contains both, the 32 bit and 64 bit version. I'll upload new executables once CUDA 4.1 is public available. The sources should compile with older CUDA version, too, but they might be slower. CUDA 4.1 will give another performance improvement for the barrett based kernels on compute capability 2.x GPUs (especially on 2.0). Compared to mfaktc 0.17 there are "more than usuall" minor changes. Highlights from the Changelog.txt:
As usuall: finish your current assignments with your current version and do the update after it, mfaktc 0.18 will refuse foreign checkpoint files. Oliver |
|
|
|
|
|
#1411 |
|
"Kieren"
Jul 2011
In My Own Galaxy!
2×3×1,693 Posts |
Many thanks, sir! I am impatient for my current assignments to finish so that I can put this version into service.
|
|
|
|
|
|
#1412 |
|
Basketry That Evening!
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88
1C3516 Posts |
Would you mind posting the .dll/.so s on the mfatkc mirror? I'd rather not have to download the whole CUDA environment...
Last fiddled with by Dubslow on 2011-12-20 at 01:34 |
|
|
|
|
|
#1413 |
|
Romulan Interpreter
Jun 2011
Thailand
100101101101112 Posts |
|
|
|
|
|
|
#1414 | |
|
Feb 2004
2408 Posts |
Quote:
|
|
|
|
|
|
|
#1415 |
|
"Kieren"
Jul 2011
In My Own Galaxy!
100111101011102 Posts |
The new version seems to be working well. At least, there have been no problems reported.
|
|
|
|
|
|
#1416 | ||
|
"Oliver"
Mar 2005
Germany
111110 Posts |
Quote:
Quote:
So I guess I'll add this for 0.19? Oliver |
||
|
|
|
|
|
#1417 | |
|
"James Heinrich"
May 2004
ex-Northern Ontario
11·311 Posts |
Quote:
Along the same lines, a unified worktodo.txt would also be nice, perhaps split into [Worker #1], [Worker #2], etc sections. This is of course a little more work than a configurable results.txt, but lets us just deal with one in and one out for each machine, in a format that's already familiar to us from Prime95. Even better would be to optimize/thread the sieving such that we'd only ever need to run a single mfaktc instance (sieving would spread across as many CPU cores as needed to feed the GPU(s). But that's a whole other set of complications for a much later release.
|
|
|
|
|
|
|
#1418 |
|
May 2011
Orange Park, FL
3×5×59 Posts |
Great! Thanks for the update. I've got two instances running now.
|
|
|
|
|
|
#1419 |
|
"Kieren"
Jul 2011
In My Own Galaxy!
236568 Posts |
This was rather a quick test, showing the difference between mfaktc .17 and .18. V.18 did eventually drop to SievePrimes 5000, though the time didn't really change that much.
EDIT: These were run with the same exponent in single instances. Last fiddled with by kladner on 2011-12-20 at 15:09 |
|
|
|
![]() |
| Thread Tools | |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| mfakto: an OpenCL program for Mersenne prefactoring | Bdot | GPU Computing | 1676 | 2021-06-30 21:23 |
| The P-1 factoring CUDA program | firejuggler | GPU Computing | 753 | 2020-12-12 18:07 |
| gr-mfaktc: a CUDA program for generalized repunits prefactoring | MrRepunit | GPU Computing | 32 | 2020-11-11 19:56 |
| mfaktc 0.21 - CUDA runtime wrong | keisentraut | Software | 2 | 2020-08-18 07:03 |
| World's second-dumbest CUDA program | fivemack | Programming | 112 | 2015-02-12 22:51 |