![]() |
|
|
#1684 |
|
"James Heinrich"
May 2004
ex-Northern Ontario
11·311 Posts |
|
|
|
|
|
|
#1685 |
|
"James Heinrich"
May 2004
ex-Northern Ontario
11×311 Posts |
I've updated http://mersenne-aries.sili.net/cudalucas.php such that if you click any CPU model name down the left, it'll give you a chart of breakeven points between mfaktc TF and CUDALucas L-L (ignoring CPU entirely, including the CPU cores that CUDALucas doesn't use). Cutoff points only vary by compute version (e.g. 2.1 vs 2.0 = GTX 570 vs GTX 560), but they do vary a fair bit (due to relative performance differences between mfatkc and CUDALucas, see post #1677 above).
Last fiddled with by James Heinrich on 2012-03-26 at 23:33 Reason: localhost typo |
|
|
|
|
|
#1686 |
|
If I May
"Chris Halsall"
Sep 2002
Barbados
9,767 Posts |
Thanks very much for doing this James.
And just for clarity, this analysis is the cut-off point for a single LL test, right? As in, it doesn't take into account that a factor found in the LL range saves two tests? Nice to have hard data, rather than a gut feel....
Last fiddled with by chalsall on 2012-03-26 at 23:42 Reason: Added clarification question. |
|
|
|
|
|
#1687 | |
|
Jun 2003
508210 Posts |
Quote:
Perhaps mfaktc needs 680-specific optimizations?
Last fiddled with by axn on 2012-03-26 at 23:16 |
|
|
|
|
|
|
#1688 | |
|
"James Heinrich"
May 2004
ex-Northern Ontario
D5D16 Posts |
Quote:
Feedback (including critical analysis of my approach) is welcome, since I'm not 100% confident this comparison is the best approach; if someone can suggest a better way I'm interested to hear. |
|
|
|
|
|
|
#1689 | |
|
Oct 2011
67910 Posts |
Quote:
|
|
|
|
|
|
|
#1690 |
|
Jun 2003
2×3×7×112 Posts |
Looks like you're using cumulative probability in the calculation rather than incremental probability. That can't be right.
|
|
|
|
|
|
#1691 | |
|
"James Heinrich"
May 2004
ex-Northern Ontario
11×311 Posts |
Quote:
If someone could walk through an example of how it should be calculated I'd be very grateful. |
|
|
|
|
|
|
#1692 |
|
Jun 2003
2·3·7·112 Posts |
You're nearly there. Rather than using the cum.prob., just use the probability for the given bit depth. You should see a rough doubling of the % with every bit.
|
|
|
|
|
|
#1693 |
|
Romulan Interpreter
Jun 2011
Thailand
32×29×37 Posts |
|
|
|
|
|
|
#1694 | |
|
Dec 2011
11×13 Posts |
Quote:
Number of multiprocessors: 8 / 15 / 8 Cores per multiprocessor: 192 / 32 / 48 Total cores: 1536 / 480 / 384 base clock rates (MHz): 1006 / 732 / 822.5. base Clock rate * #multiprocessors: 8048 / 10960 / 6580 From James' data: mfaktc gigahertz days per day: 206 / 281 / 168.4 If we define "efficiency" as Hz days/day divided by (Clock rate * #multiprocessors): mfaktc efficiency per multiprocessor: 29.60 / 29.59 / 29.59 From James' data: cudalucas gigahertz days per day: 28.4 / 31.5 / 20.6 cudalucas efficiency per multiprocessor: 3.5 / 2.9 / 3.1 By this metric, the performance of cudalucas on the new 680 is a bit better than I expected. (Maybe the increased memory bandwidth is especially beneficial to cudalucas.) But, by this metric, the performance of mfaktc on the new 680 is woefully below what I expected. Let me also remind everybody that Oliver didn't compile mfaktc to run the benchmarks. I wouldn't be a bit surprised if a trivial change could yield twice the performance. But until someone with the know-how and the hardware can run the profiler on a 680, we shouldn't assume these are *final* benchmarks. Last fiddled with by rcv on 2012-03-27 at 05:25 |
|
|
|
|
![]() |
| Thread Tools | |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| mfakto: an OpenCL program for Mersenne prefactoring | Bdot | GPU Computing | 1676 | 2021-06-30 21:23 |
| The P-1 factoring CUDA program | firejuggler | GPU Computing | 753 | 2020-12-12 18:07 |
| gr-mfaktc: a CUDA program for generalized repunits prefactoring | MrRepunit | GPU Computing | 32 | 2020-11-11 19:56 |
| mfaktc 0.21 - CUDA runtime wrong | keisentraut | Software | 2 | 2020-08-18 07:03 |
| World's second-dumbest CUDA program | fivemack | Programming | 112 | 2015-02-12 22:51 |