![]() |
|
|
#2124 | |
|
"Oliver"
Mar 2005
Germany
45716 Posts |
Hi Mike,
Quote:
With the GHz-d/day measurement it is easy to compare throughput. Oliver |
|
|
|
|
|
|
#2125 |
|
Aug 2002
Termonfeckin, IE
22·691 Posts |
If you reduce your GPUSieveSize, try reducing the GPUSieveProcessSize to 8 and optionally the GPUSievePrimes a bit. I got better throughput by reducing the GPUSieveProcessSize.
|
|
|
|
|
|
#2126 |
|
"GIMFS"
Sep 2002
Oeiras, Portugal
3·491 Posts |
Me too (GTX560Ti). But raised the GPUSieveSize from the default 64 to 128.
GPUSieveProcessSize is currently at 8, down from the default 16. That´s the setting that appears to work best on my system, at least for the mainstream exponents (GPUto72 tasks). |
|
|
|
|
|
#2127 |
|
Bemusing Prompter
"Danny"
Dec 2002
California
5×479 Posts |
I reduced GPUSieveProcessSize from 16 to 8, and the time per iteration (factoring from 71 to 73 bits in the 60M range) dropped by about 0.15 seconds. It's not a huge difference, but I guess it all adds up. For the record, this was on my GTX 555.
|
|
|
|
|
|
#2128 |
|
"Mike"
Aug 2002
2·23·179 Posts |
We are using our GTX 660Ti as our primary display card. We stop factoring for games but we have found that by sacrificing a (small?) portion of the card's throughput we are able to use the computer for any other task, including 1080p videos, without any lag whatsoever.
Code:
# Minimum: GPUSieveSize=4 # Maximum: GPUSieveSize=128 # Default: GPUSieveSize=64 GPUSieveSize=4 # Minimum: GPUSieveProcessSize=8 # Maximum: GPUSieveProcessSize=32 # Default: GPUSieveProcessSize=16 GPUSieveProcessSize=8 We are now running the 64-bit binaries. The performance hit for doing so does not seem to be very much. We purposely purchased the slowest 660Ti card that Asus makes. We have read that in some cases that the more highly (factory) overclocked cards are more likely to produce faulty calculations. By running our card at a lower load and temperature it possibly will be more reliable. Certainly, the fact that it affects our desktop experience in no way means we are willing to let it run continuously, which in the long term might result in a greater overall throughput than if we had to pause it here and there. FWIW, our system, factoring on both video cards and running 4 instances of P-1 factoring on an i7 3770 CPU, draws 258 watts. (This does not count our display or speakers.) Disclaimer: We are very sensitive to lag and it irks us greatly. (We are also severely impaired by flickering lights, like fluorescent lights.) |
|
|
|
|
|
#2129 |
|
Aug 2012
New Hampshire
23·101 Posts |
Observing this conversation I played around with
GPUSieveSize and GPUSieveProcessSize (everything other than default decreased throughput) Setting GPUSieveSize=128 increased my GhzDays from 412 to 422 I confirmed increased throughput on both of my GTX570s running 0.20 (Win 7, 64bit and Win 7, 32bit PCs) Last fiddled with by swl551 on 2013-01-19 at 22:28 |
|
|
|
|
|
#2130 |
|
"Serge"
Mar 2008
Phi(4,2^7658614+1)/2
2·47·101 Posts |
Code:
GPUSieveSize=4 ... GPUSieveProcessSize=8 |
|
|
|
|
|
#2131 |
|
Oct 2002
France
33·5 Posts |
Hi,
could someone please make a version of mfaktc 0.20 for cuda 4.0? Thanks a lot. |
|
|
|
|
|
#2132 | |
|
"Mike"
Aug 2002
2·23·179 Posts |
Quote:
YMMV |
|
|
|
|
|
|
#2133 |
|
May 2011
Orange Park, FL
11011101012 Posts |
Same here with GTX 580. Those two changes increased my GPU utilization from 98% to 99% and raised the GHz-d/day from 431 to 435. Extremely minor video lag which doesn't bother me.
|
|
|
|
|
|
#2134 |
|
"Bill Staffen"
Jan 2013
Pittsburgh, PA, USA
23·53 Posts |
Same here as well on a gtx480. Increasing the GPUSieveSize from 64 to 128 increased ghz days from ~388.4 (wobbly) to a locked on 395.00. Changing the GPUSieveProcessSize from 16 to 32 tropped ghzdays to 295, and changin it to 8 put us back to ~395 but it was wobbly. I set it back to the default 16.
I am tempted to muck with the GPUSievePrimes number again on these new settings. I had gained about 3 ghz days by dropping it from 82486 to 70000 (gpu uses 69941 at that setting). These numbers are for doing TF in the 61M range. |
|
|
|
![]() |
| Thread Tools | |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| mfakto: an OpenCL program for Mersenne prefactoring | Bdot | GPU Computing | 1676 | 2021-06-30 21:23 |
| The P-1 factoring CUDA program | firejuggler | GPU Computing | 753 | 2020-12-12 18:07 |
| gr-mfaktc: a CUDA program for generalized repunits prefactoring | MrRepunit | GPU Computing | 32 | 2020-11-11 19:56 |
| mfaktc 0.21 - CUDA runtime wrong | keisentraut | Software | 2 | 2020-08-18 07:03 |
| World's second-dumbest CUDA program | fivemack | Programming | 112 | 2015-02-12 22:51 |