20190411, 07:58  #1 
Apr 2019
2·3^{2} Posts 
Database of kvalues for factors found without scrapping website?
Hi,
I have an idea I would like to research/test. Basically, I want to write a little program to analyze existing data that has been generated through this GIMPS project. Without going into detail here the minimal data I want to analyze would be a list containing the following values: Mersenne exponent, kvalue for the smallest known factor (if one is known) It would be even better if I got get a data set that would include all the kvalues for known factors of each Mersenne exponent, and some indication of what range trial factoring has been done, but that is not absolutely necessary for what I want to analyze. And I’d like to get my hands on as much data like this as possible — every exponent for which you know a factor would be fantastic! Is there any way to get a copy of this data from the GIMPS project? I see it exists by browsing to individual exponent status report webpages on the site. But is it available to download anywhere without having to scrap the website and build my own database? When I came here to the forum to ask this question, I searched a bit first and found a post entitled “Mass exponent reports  CSVs” from just last month that had a zip file of a csv file that provided some info on the top trial factoring level for each exponent. That's nice, but what I'm after is a data set that gives the kvalue for the smallest known factor for each exponent. Is this obtainable without scraping the webpages and building my own database? Thanks! 
20190411, 12:02  #2 
Einyen
Dec 2003
Denmark
2^{2}×17×41 Posts 
The smallest factor and the smallest kvalue is easy: k=1 and the factor is 23=2*1*11+1 which is a factor of 2^111: https://mersenne.org/M11
A total of 1,655,600 factors have k=1. Last time I downloaded all the GIMPS factors with small program was on Dec 19th 2018, so about 4 months ago, at that time there were 41,933,619 factors. Here is a list of all the kvalues from those factors: gimpsklist20181219.zip The file is ~190 MB to download and 714 MB unpacked (use https://www.7zip.org/ ). The format is: <exponent>,<kvalue> Last fiddled with by ATH on 20190411 at 14:06 
20190411, 12:30  #3  
Apr 2019
2·3^{2} Posts 
Quote:
This is good information! And relates to what I wanted to study. Thank you! Quote:
Thanks again! 

20190411, 12:49  #4 
Jun 2003
3·1,511 Posts 
Scraping the website for large data extraction is strongly discouraged as it puts unnecessary load on the servers. You can contact user madpoo who is the sysadmin with your exact request, and he will be able to either create a custom extract or point to an existing extract that will fulfill your need.

20190411, 12:57  #5  
Apr 2019
2×3^{2} Posts 
Quote:
Thanks. 

20190411, 14:06  #6 
Einyen
Dec 2003
Denmark
5344_{8} Posts 
Here are the actual factors (not the kvalues):
gimpsfactors20181219.zip (349 MB) (1033MB unpacked) Format: <exponent>,<factor> Maybe we can get Madpoo to make the server create a full factor list every month maybe or every 3 months, if it is not too taxing, and if bandwidth requirements allows some people to download the list. Last fiddled with by ATH on 20190411 at 14:09 
20190411, 14:22  #7 
Sep 2003
2×1,289 Posts 
You can use the "Factors Found" report to ask for all factors found since 20181219.
Select the "Print simple text report" checkbox. Unselect the "Display date found" checkbox. That will be a much smaller file. Then merge it with ATH's, making sure to eliminate duplicates. Obviously k = (factor − 1) / (2 * exponent) 
20190411, 15:15  #8  
Apr 2019
2×3^{2} Posts 
Quote:


20190411, 15:43  #9  
Einyen
Dec 2003
Denmark
2^{2}×17×41 Posts 
Quote:


20190412, 17:36  #10  
Apr 2019
2·3^{2} Posts 
Quote:
Looking at the factors where k=1 is actually something I wanted to do. When you pointed out that ~1,655,600 out of ~41,933,619 known factors were generated from k=1 that peaked my interested and wanted to look at how those factors were distributed by exponent range. So I made a little plot and right away noticed something interesting... I grouped the exponents into ranges from 1999999, 10000001999999, ..., 900000000999999999, and then counted the number of occurrences of factors for k=1 amongst the exponents in each range, and made a plot. (attached) There is a really interesting transition at exponents around 5 million, 50 million, and 500 million, where the frequency of factors for k=1 sharply increases before starting to slowly decrease again. I can't think of any logical reason for this behavior, but it will be something fun to look at more closely & try to explain. Last fiddled with by neomacdev on 20190412 at 17:37 

20190412, 18:19  #11 
Einyen
Dec 2003
Denmark
AE4_{16} Posts 
Very interesting. For k=1 the exponent p is a Sophie Germain prime: https://en.wikipedia.org/wiki/Sophie_Germain_prime
I did some checking of my own on the kvalues several years ago, but I was quickly accused of doing "numerology" so I did not take it any further: https://mersenneforum.org/showthread.php?t=19662 I assume you noticed k=2 (mod 4) has no factors because then 2kp+1 != +/ 1 (mod 8) Last fiddled with by ATH on 20190412 at 18:25 
Thread Tools  
Similar Threads  
Thread  Thread Starter  Forum  Replies  Last Post 
Possible values for k (in Mersenne factors)  James Heinrich  Math  127  20190923 06:22 
Factors found before ECM starts  MisterBitcoin  YAFU  1  20180810 16:58 
No factors found  aketilander  PrimeNet  9  20110517 11:32 
C program to rapidly verify all factors in GIMPS database  GP2  Programming  8  20050103 07:49 