![]() |
|
|
#1695 |
|
Feb 2005
The Netherlands
3328 Posts |
I can saturate (100% GPU) my GTX680 when running two instances of mfaktc.
http://gpuz.techpowerup.com/12/03/27/a9a.png Note that the GPU core clock is constantly boosted to +-1100MHz and the power consumption hovers around 72% TDP, which could mean that the performance/Watt for this chip is higher than in James' calculations. This power consumption sensor seems to be a new feature on this chip, I've never seen it displayed in GPU-Z before on any other card. |
|
|
|
|
|
#1696 |
|
Feb 2005
The Netherlands
DA16 Posts |
It turns out that I plugged my brand new shiny bling-bling GTX680 into a PCI-E 2.0 x8 slot instead of a PCI-E 2.0 x16 slot...
I'll change it tonight, and also try to fix a crazy problem that causes my motherboard to refuse more than one memory module, forcing it to use single channel DDR3. I don't expect radically improved CUDA performance, but we'll see.
|
|
|
|
|
|
#1697 | ||
|
Jul 2009
Tokyo
2·5·61 Posts |
Quote:
Quote:
|
||
|
|
|
|
|
#1698 |
|
Mar 2003
Melbourne
5×103 Posts |
I'd be curious if you can weave some more TheJudger magic to get more out of the GTX680. :)
Now with some performance figures out, I'm pretty disappointed. I was hoping to buy some GTX680s to replace some hardware here to reduce my power bill. It doesn't even surpass what I have on performance per watt metrics. -- Craig |
|
|
|
|
|
#1699 | |
|
"James Heinrich"
May 2004
ex-Northern Ontario
1101010111012 Posts |
Quote:
http://mersenne-aries.sili.net/cudalucas.php?model=13 |
|
|
|
|
|
|
#1700 | |
|
Feb 2005
The Netherlands
2·109 Posts |
Quote:
Code:
CC 1.x CC 2.0 CC 2.1 CC 3.0
32-bit floating
point add, 8 32 48 192
multiply,
multiply-add
64-bit floating
point add, 1 16 4 8
multiply,
multiply-add
32-bit
integer add 10 32 48 168
32-bit integer
multiply,
multiply-add, Multiple 16 16 32
sum of absolute instructions
difference
Not much love for 32-bit integer multiply & multiply-add, compared to 32-bit floating point operations. |
|
|
|
|
|
|
#1701 | |
|
Jun 2003
2·3·7·112 Posts |
Quote:
|
|
|
|
|
|
|
#1702 | |
|
"Kieren"
Jul 2011
In My Own Galaxy!
2×3×1,693 Posts |
Quote:
Thanks for doing it. BTW: I wasn't thinking too well, either, when I ran the CuLu benchmarks. Sorry for the incomplete data, James. |
|
|
|
|
|
|
#1703 |
|
"James Heinrich"
May 2004
ex-Northern Ontario
342110 Posts |
|
|
|
|
|
|
#1704 | |
|
Jul 2009
Tokyo
11428 Posts |
Quote:
GTX-680 each Multiprocessor have 192 core,But only 32 32-bit integer multiply exec. Lots of thread wait exec. Code:
CC 1.x CC 2.0 CC 2.1 CC 3.0 32-bit integer shift compare 8 16 16 8 |
|
|
|
|
|
|
#1705 | |
|
Feb 2005
The Netherlands
2×109 Posts |
Quote:
![]() One instance of mfaktc is now taking +-70% GPU instead of the 74% I reported yesterday, and nVidia's Visual Profiler shows transfer rates of 6Gb/s instead of 3 Gb/s, but since the amount of data to transfer is relatively small, there's no earth-shattering improvement. I could run the same benchmark I did yesterday again if James would like me to do that. |
|
|
|
|
![]() |
| Thread Tools | |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| mfakto: an OpenCL program for Mersenne prefactoring | Bdot | GPU Computing | 1676 | 2021-06-30 21:23 |
| The P-1 factoring CUDA program | firejuggler | GPU Computing | 753 | 2020-12-12 18:07 |
| gr-mfaktc: a CUDA program for generalized repunits prefactoring | MrRepunit | GPU Computing | 32 | 2020-11-11 19:56 |
| mfaktc 0.21 - CUDA runtime wrong | keisentraut | Software | 2 | 2020-08-18 07:03 |
| World's second-dumbest CUDA program | fivemack | Programming | 112 | 2015-02-12 22:51 |