mersenneforum.org  

Go Back   mersenneforum.org > Factoring Projects > Cunningham Tables

Reply
 
Thread Tools
Old 2007-11-26, 14:01   #45
fivemack
(loop (#_fork))
 
fivemack's Avatar
 
Feb 2006
Cambridge, England

2×132×19 Posts
Default

Experimentally, the distribution of the maximum absolute value M of the entries of qflll(m) for m a matrix of your form is interesting; there's a reasonably sharp spike and then a long tail, and the tail is fit very well by p(M<t) ~ t^-2.

The M1039 write-up says that they threw out special-Q where the reduced matrix had too-large coefficients, which makes sense given how common and how useless such matrices appear to be.

A simple fit for x going from 10^8 to 2^13*10^8 in powers of two, looking at 10^5 matrices for each x, gives (I'm confident to the number of significant figures I give)

25th-percentile = 0.81*x^0.500
median= 1.00 * x^0.500
75th-percentile = 1.34 * x^0.501
90th-percentile = 2.1*x^0.501
99th-percentile = 6*x^0.50

mean = 1.3 * x^0.500

Reduction of 2x2 matrices is essentially representation by continued fractions, and I think it's well-understood; I'd be tempted to look in volume 1 or 2 of the recent edition of Knuth's Art of Computer Programming, in the section where he writes about GCD algorithms.
fivemack is offline   Reply With Quote
Old 2007-11-26, 14:05   #46
fivemack
(loop (#_fork))
 
fivemack's Avatar
 
Feb 2006
Cambridge, England

11001000101102 Posts
Default

The section of this thread after #29 is no longer to do with 3,499+; is there any way that the moderators could be implored to move #39 and follow-ups to, say, a thread called 'Modelling lattice-sieving yields' in the Factoring forum?
fivemack is offline   Reply With Quote
Old 2007-11-26, 14:24   #47
R.D. Silverman
 
R.D. Silverman's Avatar
 
Nov 2003

11101001001002 Posts
Default

Quote:
Originally Posted by fivemack View Post
Experimentally, the distribution of the maximum absolute value M of the entries of qflll(m) for m a matrix of your form is interesting; there's a reasonably sharp spike and then a long tail, and the tail is fit very well by p(M<t) ~ t^-2.

The M1039 write-up says that they threw out special-Q where the reduced matrix had too-large coefficients, which makes sense given how common and how useless such matrices appear to be.
Experimentally, yes. The reduced coefficients seem to fit an
exponential/Erlang, or possibly a gamma distribution.

My sieve code has always thrown out 'bad' reduced lattices, but I define
bad as anything with a large *condition number*. It is not just that large
coefficients give poor yield, but so do highly skewed lattices. Checking
the condition number handles both problems.
R.D. Silverman is offline   Reply With Quote
Old 2007-11-26, 15:50   #48
fivemack
(loop (#_fork))
 
fivemack's Avatar
 
Feb 2006
Cambridge, England

2×132×19 Posts
Default

Ah yes, condition number is almost certainly the right way to recognise bad lattices.

I'm pretty sure that the distribution is too tail-heavy to be any of the exponential-based ones, though I can't get it to fit very clearly to any of the distributions in the 'heavy-tailed distributions' Wikipedia article either. The PDF dropping off for large x as almost exactly x^-3 must be telling me something, but I'm not sure what; it doesn't appear to match the distribution for random matrices of a given determinant.
fivemack is offline   Reply With Quote
Old 2007-11-26, 17:13   #49
R.D. Silverman
 
R.D. Silverman's Avatar
 
Nov 2003

22×5×373 Posts
Default

Quote:
Originally Posted by fivemack View Post
The PDF dropping off for large x as almost exactly x^-3 must be telling me something, but I'm not sure what; it doesn't appear to match the distribution for random matrices of a given determinant.
Of course not. Not all matrices of determinant p are affine transforms
of the original lattice..
R.D. Silverman is offline   Reply With Quote
Old 2007-11-27, 13:30   #50
fivemack
(loop (#_fork))
 
fivemack's Avatar
 
Feb 2006
Cambridge, England

2·132·19 Posts
Default

Well, that was odd. I'd really expect that, since I'm sieving over the same region at each stage, #relations would be an increasing function of SP - I'm sieving further, I'm leaving all the other parameters the same, I really ought to be getting more relations.

Code:
SP	SZ	Relations	Secs / relation
50	14	2956	1.51067
50	15	6123	1.86277
60	14	3209	1.55215
60	15	6742	1.79827
70	14	3417	1.6059
70	15	7085	1.80086
80	14	3610	1.66312
80	15	7045	1.87672
90	14	3755	1.73279
90,15 failed to start, giving an out-of-memory error
On further investigation, I am only losing relations as I go from a small-prime bound of 70M to 80M in gnfs-lasieve4I15e. Maybe this is why gnfs-lasieve4I15e doesn't ship as standard with ggnfs
fivemack is offline   Reply With Quote
Old 2007-11-27, 14:57   #51
bdodson
 
bdodson's Avatar
 
Jun 2005
lehigh.edu

100000000002 Posts
Default

Quote:
Originally Posted by Andi47 View Post
What is the ECM status of this one? (I don't want to run ECM on it once somebody is sieving it, but before then I could do a few curves)
M821 = 2,821- C208, along with all of the other Cunninghams with
fewer than 234-digits, is either at 3*t50 or at 4*t50, depending upon
whether it's come up yet on the que for the 4th t50. As I was saying,
if/when this one becomes a near-term sieving target, I'd expect to
finish a version of t55; which in recent practice (the base-2s with 787)
has meant 6*t50 --- so adding 2 or 3 t50's, depending upon how many
have already been done. So far, I've been discounting curves with
B1 = 43M (p50-optimal), and running either B1 = 110M or B1 = 260M
for the new curves. I haven't heard of anyone that'd be sieving M821
anytime soon.

On the remaining numbers below 768-bits, Alex's 7,269- in particular,
I'm not clear whether (t55-4*t50) more curves are indicated. The numbers
in c190-c233 are qued on the Opteron/quadcore beowulf, so scheduling
an extra two t50's would be easy --- especially if that would make the
sieving more likely to happen (sooner). Of course, in Alex's case, he
could probably slip in the extra curves over-night on the grid he's using,
if he were so inclined. -Bruce

PS - I suppose that'd be "below 768-bits" or "degree 5 or 6
with difficulty below 220". Comparing Tom's "1110 CPU-hours"
for sieving 10,309- for example; c. 8hrs on 230 cores of the
quads for a t50 gives 2*8*200 for the extra two t50's
(in t55-4*t50) = c. 3200 CPU-hours. That doesn't sound
good; those curves would_have/were better spent on harder
numbers.

Last fiddled with by bdodson on 2007-11-27 at 15:07 Reason: added timings
bdodson is offline   Reply With Quote
Old 2007-11-27, 17:33   #52
Andi47
 
Andi47's Avatar
 
Oct 2004
Austria

2·17·73 Posts
Default

Quote:
Originally Posted by bdodson View Post
M821 = 2,821- C208, along with all of the other Cunninghams with
fewer than 234-digits, is either at 3*t50 or at 4*t50, depending upon
whether it's come up yet on the que for the 4th t50. As I was saying,
if/when this one becomes a near-term sieving target, I'd expect to
finish a version of t55; which in recent practice (the base-2s with 787)
has meant 6*t50 --- so adding 2 or 3 t50's, depending upon how many
have already been done. So far, I've been discounting curves with
B1 = 43M (p50-optimal), and running either B1 = 110M or B1 = 260M
for the new curves. I haven't heard of anyone that'd be sieving M821
anytime soon.
Thanks for the info. I have done 5 curves with B1 = 44M and sheduled 100 curves with B1 = 110M (85 of them already done). I can do more curves on either 110M or 260M if wanted

P.S.: How actual is this page? It says that 4968 curves are done on M821 at B1 = 44M - according to your post it must be rather ~21000 curves? And for M1061 AFIK t55 is finished and a few thousand curves have been done at B1=260M, see this thread.
Andi47 is offline   Reply With Quote
Old 2007-11-29, 17:03   #53
bdodson
 
bdodson's Avatar
 
Jun 2005
lehigh.edu

210 Posts
Default

Quote:
Originally Posted by Andi47 View Post
I can do more curves on either 110M or 260M if wanted

P.S.: How actual is this page? It says that 4968 curves are done on M821 at B1 = 44M - according to your post it must be rather ~21000 curves? And for M1061 AFIK t55 is finished and a few thousand curves have been done at B1=260M, see this thread.
I had an exchange with Mischa in one of the base-2 discussions; and
believe I recall that the counts on his page are personal curves, not
the sum of curves over what other people did. Same for my counts.
I checked M821, and found that my 4th t50 finished Wednesday
(by coincidence, in order, from an old input file). So that's 620 curves
with B1 = 110M (my initial t45), then 5835 curves with B1 = 260M.
That's

evalf(620/17900+5835/8000) = 0.764*t55, same as in what I

reported for the new NFSNET number M787 (which started sieving
yesterday). Two more c. t50's will raise the B1 = 260M's to
8835 curves, which gives the 1.139*t55 from the thread, loc.cit.

On M1061, I'm still saying what I've been saying since finding that
the first kilobit snfs was M1039; namely that so many curves have been
run that additional curves aren't really part of ecm-factoring; but rather
ecm-pretesting, as in the description of the M1039 sieving. If you're
having trouble with the distinction, observe Aoki being _dissapointed_
at having found the 2nd largest (at the time) ecm factor of p64 ---

Quote:
...Unfortunately, after trying 11784 curves for Step 1 and 11214 curves for Step 2 a factor was found.
since it meant that the work done on setting-up R311 = 11...1 (that's 311)
= (10^311-1)/(10-1) for sieving had been wasted.

Spending the curves on other large numbers has a much better chance
of finding a factor --- the fewer curves run previously, the better. Of
course, they're your cycles; and we have George's description of the
forum objective of "having fun". I may even take my own advice and
take a 2nd pass through the c251-c366's; perhaps after New Years.
-Bruce
bdodson is offline   Reply With Quote
Old 2007-12-02, 16:40   #54
Andi47
 
Andi47's Avatar
 
Oct 2004
Austria

2·17·73 Posts
Default M821

Quote:
Originally Posted by Andi47 View Post
Thanks for the info. I have done 5 curves with B1 = 44M and sheduled 100 curves with B1 = 110M (85 of them already done).
100 curves at B1 = 110M as mentioned above are now ready. I will do a few more curves at B1 = 260M.

Last fiddled with by Andi47 on 2007-12-02 at 16:40
Andi47 is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
2^947+1 status fivemack Factoring 17 2014-05-06 18:00
Status bsquared Game 2 - ♔♕♙♘♖♙ - Shaolin Pirates 4 2013-10-01 06:18
Status of p-1.... dave_0273 Marin's Mersenne-aries 80 2008-01-28 00:18
7,295- status and discussion Raman Cunningham Tables 2 2008-01-01 14:52
status wfgarnett3 PSearch 3 2004-03-02 18:04

All times are UTC. The time now is 08:15.


Tue Jul 27 08:15:10 UTC 2021 up 4 days, 2:44, 0 users, load averages: 2.41, 1.98, 1.82

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.