![]() |
|
|
#45 |
|
Jun 2005
3×43 Posts |
If we're looking to optimize, I guess one of the questions is should we use hardware X for factoring another bit level or turn them over to LL/DC tests. Even for GPUs, there may come a time when it makes more sense to run 2 first time LL tests than it would to run enough TFs at a given bit level to find 1 expected factor.
For CPUs, we have a lot of data to compare this switch-over point (ignore for now that we know that CPUs shouldn't be used to TF period). So instead of comparing TF time on GPU to LL time on a CPU, should we instead compare the time of each type of test running on a GPU? You can add in a fudge factor for the CPU cores needed to do TF on a GPU to make it more reasonable. The next question : on CPUs, I believe the TF vs LL efficiency is pretty consistent across CPU families, so the switchover point is the more or less the same regardless of which architecture you're running. Is this true for GPUs as well, or do different types of GPUs behave radically differently on the two types of tests? If they scale pretty similarly, we end up with a nice way to compare TF vs LL effort without having to figure out how to compare a CPU to a GPU* * - and probably create a bunch of other unanswerable questions, but it's a start :) Last fiddled with by kjaget on 2011-12-01 at 18:24 |
|
|
|
|
|
#46 | |||
|
If I May
"Chris Halsall"
Sep 2002
Barbados
2×5×7×139 Posts |
Quote:
Quote:
Quote:
|
|||
|
|
|
|
|
#47 | |
|
Basketry That Evening!
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88
3×29×83 Posts |
Quote:
Indeed, I have mostly stopped paying attention this thread (though I have at least skimmed every post) |
|
|
|
|
|
|
#48 | |
|
Jun 2003
5,051 Posts |
Quote:
Think about it -- GPUs are doing TF so that they can eliminate candidates, thus advancing the LL wavefront. Doesn't matter if you eliminate via TF or via LL -- what matters is only how quickly can you eliminate them. The only reason GPUs are doing higher TFs than the CPU-optimal, is becuase they have relatively higher efficiency for TF compared to LL (x100 vs x10). If they were equally as good in TF & LL, we would _not_ be doing additional TF with them -- we'll use them as we would any other CPU. Further, suppose that tomorrow a GPU comes out that is somehow _more_ efficient for LL than TF. In which case, the recommended option for it would be to do LLs that have less than optimal TF done. Last fiddled with by axn on 2011-12-02 at 06:32 |
|
|
|
|
|
|
#49 | |
|
Basketry That Evening!
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88
3×29×83 Posts |
It takes two LL's to clear an exponent, so I would disagree with "I would make a stronger statement that you're better of LL-ing, if you can do 1 LL quicker that the "expected time to find a factor by TF"." It doesn't matter when the 2 tests take place, only that 2 are necessary to clear an exponent.
Quote:
|
|
|
|
|
|
|
#50 | |
|
If I May
"Chris Halsall"
Sep 2002
Barbados
260216 Posts |
Quote:
|
|
|
|
|
|
|
#51 |
|
Dec 2010
Monticello
5×359 Posts |
The promised Benchmark:
AMD Phenom II x6, 1 core running mfaktc 0.17 with 4620 classes feeding Galaxy GTX440, all at stock speed. TF(52765829,69,70) = 1hr, 3 minutes. Computer is otherwise unloaded. SievePrimes is stable at 80,900. OS is Xubuntu 11.10 Last fiddled with by Christenson on 2011-12-06 at 02:25 Reason: Benchmark needs sievePrimes reported. |
|
|
|
|
|
#52 |
|
Aug 2002
Termonfeckin, IE
22·691 Posts |
no factor for M25406911 from 2^68 to 2^69 [mfaktc 0.17-Win....]
tf(): total time spent: 34m 41.908s GTX550Ti, fully loaded one instance fed by one core of an i5-750. |
|
|
|
|
|
#53 |
|
Banned
"Luigi"
Aug 2002
Team Italia
32×5×107 Posts |
|
|
|
|
|
|
#54 |
|
May 2003
Belgium
2·139 Posts |
If I understand the last posts here, I should not have Prime95 run on all cores with CUDA running. Because when I check what my GTX 570M does in 24h, I don't get 100GHz days credit.
But then again, I got 2 instances of an MMORPG running on idle (AFK fishing), which might be eating on the GPU as well :) |
|
|
|
|
|
#55 | |
|
Feb 2004
A016 Posts |
Quote:
CUDALucas? Mfaktc? Other? It's perfectly fine to have Prime95 running with CUDALucas. While with mfaktc I would liberate 1 core on Prime95 per mfaktc instance that you are running. |
|
|
|
|
![]() |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| What percentage of CPUs/GPUs have done a double check? | Mark Rose | Data | 4 | 2016-06-17 14:38 |
| Anyone using GPUs to do DC, LL or P-1 work? | chalsall | GPU to 72 | 56 | 2014-04-24 02:36 |
| GPUs impact on TF | petrw1 | GPU Computing | 0 | 2013-01-06 03:23 |
| LMH Factoring on GPUs | Uncwilly | LMH > 100M | 60 | 2012-05-15 08:37 |
| Compare interim files with different start shifts? | zanmato | Software | 12 | 2012-04-18 14:56 |