![]() |
|
|
#133 | |
|
"Jacob"
Sep 2006
Brussels, Belgium
2×977 Posts |
Quote:
Jacob |
|
|
|
|
|
|
#134 |
|
Einyen
Dec 2003
Denmark
1101011111002 Posts |
If if only happens with multiple factors and at least one of the factors is always found it doesn't matter. We only need one factor to skip doing LL for that exponent.
|
|
|
|
|
|
#135 |
|
"Oliver"
Mar 2005
Germany
111510 Posts |
Hi,
it can happen if there is more than one factor found at the same time. On my GTX 275 there are 30720 (30 "multiprocessors" * 256 threads per block * 4 blocks per multiprocessor) threads "in flight" at the same time. The problem is that I haven't written code to handle access to the result array. I need some kind of locking and/or atomic read/writes. S485122: did you know that prime95 missed some factors asweel? Oliver Last fiddled with by TheJudger on 2010-02-12 at 13:15 |
|
|
|
|
|
#136 |
|
Account Deleted
"Tim Sorbera"
Aug 2006
San Antonio, TX USA
11×389 Posts |
IIRC Prime95 always looks for the smallest factor in the bit range. If George considers that important to be a GIMPS-contributing TF algorithm/program, and this program isn't guaranteed to find the smallest of the factors, then that might be a roadblock.
|
|
|
|
|
|
#137 | |
|
Jun 2005
100000012 Posts |
Quote:
g++ -O1 -fno-dce -fno-ipa-reference -fno-split-wide-types -fno-tree-dominator-opts -fno-tree-copyrename -fno-tree-loop-optimize -momit-leaf-frame-pointer -fno-tree-sink -fno-inline-functions-called-once -frerun-cse-after-loop -ftree-pre -fipa-cp -freorder-blocks -fipa-cp-clone -fsee -mtune=nocona -msse4.1 This gained me ~20% over default -O2 or -O3 settings. I'm testing on my Core2Duo laptop, so obviously results will be different for different CPUs and even different versions of the compilers. Still, give it a shot and see if it helps. |
|
|
|
|
|
|
#138 | |
|
"Lucan"
Dec 2006
England
2×3×13×83 Posts |
Quote:
that the probability of it being a factor decreases with the size of the trial factor. That this procedure finds the smallest factor is merely an added bonus. Besides, does P-1 necessarily find the lowest factor? David PS The probability of 2kp + 1 being a factor goes as 1/klogk ? Last fiddled with by davieddy on 2010-02-12 at 16:16 |
|
|
|
|
|
|
#139 | |
|
Account Deleted
"Tim Sorbera"
Aug 2006
San Antonio, TX USA
11×389 Posts |
Quote:
No. Certainly not. But that is a quite different scenario, and doesn't necessarily mean that they don't care about the discovery order for factors found by TF. Last fiddled with by TimSorbet on 2010-02-12 at 16:15 |
|
|
|
|
|
|
#140 |
|
Jun 2003
23×683 Posts |
|
|
|
|
|
|
#141 | |
|
"Lucan"
Dec 2006
England
194A16 Posts |
Quote:
that 120 business. I think it comes from neatly storing possible primes mod 30 as eight bits, implicitly excluding multiples of 2,3 and 5. Any factor found saves the LL tests and is stronger information than "is composite". David Last fiddled with by davieddy on 2010-02-12 at 16:39 |
|
|
|
|
|
|
#142 | |
|
Einyen
Dec 2003
Denmark
22×863 Posts |
Quote:
|
|
|
|
|
|
|
#143 |
|
Account Deleted
"Tim Sorbera"
Aug 2006
San Antonio, TX USA
11·389 Posts |
Does it still cover all factors? It seems to me that if you only cover 16 of the 30 potential values, you'd be missing about half the factors.
|
|
|
|
![]() |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| mfakto: an OpenCL program for Mersenne prefactoring | Bdot | GPU Computing | 1724 | 2023-06-04 23:31 |
| gr-mfaktc: a CUDA program for generalized repunits prefactoring | MrRepunit | GPU Computing | 42 | 2022-12-18 05:59 |
| The P-1 factoring CUDA program | firejuggler | GPU Computing | 753 | 2020-12-12 18:07 |
| mfaktc 0.21 - CUDA runtime wrong | keisentraut | Software | 2 | 2020-08-18 07:03 |
| World's second-dumbest CUDA program | fivemack | Programming | 112 | 2015-02-12 22:51 |