mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Math

Reply
 
Thread Tools
Old 2016-05-19, 03:30   #100
LaurV
Romulan Interpreter
 
LaurV's Avatar
 
Jun 2011
Thailand

7×1,373 Posts
Default

That's what I mainly said in the third paragraph. We are in violent agreement here...
LaurV is offline   Reply With Quote
Old 2016-05-20, 03:20   #101
ewmayer
2ω=0
 
ewmayer's Avatar
 
Sep 2002
República de California

103·113 Posts
Default

Quote:
Originally Posted by GP2 View Post
For each value of p mod 60, there are exactly 16 out of 60 valid values for k mod 60. Earlier, we saw that for each value of p mod 12, there were exactly 4 out of 12 possible value for k mod 12. Also, for each value of p mod 4, there were exactly 2 out of 4 possible values for k mod 4. So by increasing the modulus from 4 to 12 to 60, we reduced the number of cases to test from 50% to 33.33% to 26.67%.
Note that 'reduced the number of cases to test' is misleadingly optimistic, because in practice such a highly-composite-modulo sieve is always combined with a small-prime-factor sieve, which will catch nearly all the 'extra' candidates left over by a smaller-modulus. In other words, the number of expensive operations, that is, modular powerings to test whether a given q = 2kp+1 divides M(p), changes very little from e.g. using (mod 60) to (mod 420).

There are other reasons one might prefer larger moduli, though - last year I spend a few months upgrading my own TF sieve from (mod 60) to (mod 4620) because that will support many more threads (each processing k-ranges in one of the admissible remainder classes) than the maximum of 16 permitted by (mod 60). When next I make an official release of my TF code, the intention is that it support manycore architectures.
ewmayer is offline   Reply With Quote
Old 2016-06-03, 01:57   #102
GP2
 
GP2's Avatar
 
Sep 2003

A1916 Posts
Default

At the request of bloodIce, I am posting this here:

(As is well-known, all factors of Mp are of the form 2kp+1 for some k)

Here is the distribution of p mod 4, k mod 4 for all known factors (as of 2016-06-03 00:00) of Mp where p is in the range between 900M and 910M:

Code:
Count p, k
 ----- ----
 84572 1, 0   47.9%
 91948 1, 3   52.1%

 85123 3, 0   45.9%
100367 3, 1   54.1%
bloodIce mentions that he has trial-factored this range to 64 bits and plans to complete it to 72 bits.
GP2 is offline   Reply With Quote
Old 2016-06-04, 13:26   #103
alpertron
 
alpertron's Avatar
 
Aug 2002
Buenos Aires, Argentina

2·683 Posts
Default

Since TJAOI found all missing prime factors less than 4*1018 of Mersenne numbers (including the case when a Mersenne number has several factors less than that bound), I think a better statistic would be found if we ignore the prime factors greater than 4*1018.
alpertron is offline   Reply With Quote
Old 2016-06-04, 18:17   #104
Dubslow
Basketry That Evening!
 
Dubslow's Avatar
 
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88

722110 Posts
Default

Quote:
Originally Posted by alpertron View Post
Since TJAOI found all missing prime factors less than 4*1018 of Mersenne numbers (including the case when a Mersenne number has several factors less than that bound), I think a better statistic would be found if we ignore the prime factors greater than 4*1018.
He did at least for p < 1B.
Dubslow is offline   Reply With Quote
Old 2016-06-04, 18:36   #105
Batalov
 
Batalov's Avatar
 
"Serge"
Mar 2008
Phi(4,2^7658614+1)/2

36×13 Posts
Default

I've played with the 900-910M data (fetched from m.org ab initio).
The general count of factors matches, so far so good: there are exactly 362010 reported factors. If you slice the factors in bin sizes by log2(f) (e.g. in 5-bit bins, or 2-bit bins), the whole observed difference happens for very small k's, and then all counts are equal within the margin of error in every bin up to the current search size.
Code:
[ec2-user@ip-172-30-0-237 Mfa]$ awk -F, '(k=int($2/2/$1))<2**25{print $1%4 "\t" k%4 "\t" int(log(k)/log(2)/5)*5 "-..."}' ALLFA | sort -k3n | uniq -c
  10841 1       0       0-...
  18209 1       3       0-...
  10907 3       0       0-...
  26072 3       1       0-...
  15662 1       0       5-...
  15755 1       3       5-...
  15835 3       0       5-...
  16067 3       1       5-...
  14042 1       0       10-...
  14000 1       3       10-...
  14035 3       0       10-...
  14095 3       1       10-...
  12571 1       0       15-...
  12518 1       3       15-...
  12714 3       0       15-...
  12716 3       1       15-...
  11474 1       0       20-...
  11543 1       3       20-...
  11521 3       0       20-...
  11504 3       1       20-...

[ec2-user@ip-172-30-0-237 Mfa]$ wc -l ALLFA
362010 ALLFA

[ec2-user@ip-172-30-0-237 Mfa]$ head ALLFA
900000011,16200000199,2008-05-16
900000041,7200000329,2008-05-16
900000053,5292000311641,2008-05-16
900000067,228834580435461943,2008-06-01
900000067,2268053060043937457,2014-11-04
900000083,17522227932738649217,2009-05-11
900000131,405000058951,2008-05-16
900000131,148307421586967,2011-05-09
900000131,511393347889932024943,2011-08-15
900000131,2497567103149914321769,2011-08-15
...
There can be multiple possible reasons for that. I'll try to fast-factor some Mp where only one very small factor is known; if I'll find at least one factor, then the hypothesis that TJAOI searches for "all" p would fail.

There may be other biases in this dataset as well.
Don't make assumptions about any dataset "just because it is out there" and is easy to fetch. Run sanity checks.
Batalov is offline   Reply With Quote
Old 2016-06-04, 20:12   #106
Batalov
 
Batalov's Avatar
 
"Serge"
Mar 2008
Phi(4,2^7658614+1)/2

947710 Posts
Default

Quote:
Originally Posted by R. Gerbicz View Post
I think most of these differences comes from the very small k values, say seeing only k<=100.
q1=6*p+1 is prime with roughly two times higher probability than q2=8*p+1 (you can generalize this),
and use also that for k=1 or 3 mod 4 if q=2*k*p+1 is prime then this divides 2^p-1 with 1/(2*k) probability, for k=0 mod 4 with 1/k probability.
(obviously these are only heuristics). I've gotten even that k=1 mod 4 gives more "small" factors than k=3 mod 4.
Quote:
Originally Posted by LaurV View Post
That's what I mainly said in the third paragraph. We are in violent agreement here...
We are in violent agreement here...
Code:
#Count p  k   GP2%-age    #smallk  #rest   rest %-age
 84572 1, 0   47.9%       10841    73731   50.00%
 91948 1, 3   52.1%       18209    73739   50.00%
 85123 3, 0   45.9%       10907    74216   49.97%
100367 3, 1   54.1%       26072    74295   50.03%
If GP2 posts some new stats in another half a year (please, don't, there is really [B]no need[/B]), we could see perhaps the fourth or the fifth digit behind the dot in the percentages slightly change.
Batalov is offline   Reply With Quote
Old 2016-06-04, 21:30   #107
alpertron
 
alpertron's Avatar
 
Aug 2002
Buenos Aires, Argentina

25268 Posts
Default

Quote:
Originally Posted by Batalov View Post
I'll try to fast-factor some Mp where only one very small factor is known; if I'll find at least one factor, then the hypothesis that TJAOI searches for "all" p would fail.
I ran the P-1 algorithm for several tens of thousands of composite Mersenne numbers less than 2.3M, and I've never found a prime factor less than the ones found by TJAOI, even though I found more than 10000 prime factors.
alpertron is offline   Reply With Quote
Old 2016-06-05, 07:36   #108
bloodIce
 
bloodIce's Avatar
 
Feb 2010
Sweden

173 Posts
Default

In the process of double-checking the 900-910M range by simply factoring every exponent to 64 bits, I did not see any new factor coming bellow TF60 or any newly factored exponent. So I believe that all exponents are factored to TF60 at least and quite systematically it seems.
I believe that the argument here is that the arrival of new factors in a climb-up trial factoring is biased in k mod 4, not that the final distribution of k mod 4 will be biased.
bloodIce is offline   Reply With Quote
Old 2016-06-05, 17:51   #109
Batalov
 
Batalov's Avatar
 
"Serge"
Mar 2008
Phi(4,2^7658614+1)/2

36·13 Posts
Default

Quote:
Originally Posted by bloodIce View Post
I believe that the argument here is that the arrival of new factors in a climb-up trial factoring is biased in k mod 4, not that the final distribution of k mod 4 will be biased.
On the contrary, all new (read: large k, i.e. k >> 100) factors are unbiased.

The only source of bias is grossly that one class can* have relatively small factors (with k=1, f=2p+1), the other class can only have 3x larger factors (k=3, f=6p+1, - whose density is log(6*905E6)/log(2*905E6) = 1.09 less), and two more classes can only have 4x larger factors (k=4, f=8p+1, - whose density is log(8*905E6)/log(2*905E6) = 1.11 less), that's all. On this gross bias foundation, there is a lesser bias of a few larger k values (see R.G.'s summation), and then there is no residual bias left.

_____________
*The semantics of can here is simple: if f is composite, then its prime factors are obviously ineligible to divide Mp (their effective "k" would be fractional and less than 1). So f must be prime to divide Mp and the probability of prime f is 1/ln f, nothing more, nothing less. If f is prime then its probability of dividing Mp is indistinguishable from any other small factor (all TF/P-1-reachable f << Mp).
Batalov is offline   Reply With Quote
Old 2016-06-09, 11:29   #110
science_man_88
 
science_man_88's Avatar
 
"Forget I exist"
Jul 2009
Dumbassville

838410 Posts
Default

if it was wanted to restrict k values to those that could lead to prime factors any k values greater than the original k value of the form (2*k*p+1)*n+k would be eliminated as those are all multiples of 2kp+1.
science_man_88 is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
Distribution of Mersenne Factors tapion64 Miscellaneous Math 21 2014-04-18 21:02
Known Mersenne factors CRGreathouse Math 5 2013-06-14 11:44
A strange new (?) fact about Mersenne factors ChriS Math 14 2006-04-12 17:36
Factors of Mersenne Numbers asdf Math 17 2004-07-24 14:00
Factors of Mersenne numbers ? Fusion_power Math 13 2003-10-28 20:52

All times are UTC. The time now is 18:01.


Fri Jul 16 18:01:13 UTC 2021 up 49 days, 15:48, 1 user, load averages: 1.33, 1.36, 1.42

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.