mersenneforum.org  

Go Back   mersenneforum.org > Factoring Projects > Factoring

Closed Thread
 
Thread Tools
Old 2008-08-17, 03:26   #23
jasonp
Tribal Bullet
 
jasonp's Avatar
 
Oct 2004

3,541 Posts
Default

You guys are really cruising. Tom, you can make msieve's duplicate removal a lot more efficient in the presence of huge numbers of relations by changing LOG2_DUP_HASHTABLE1_SIZE in gnfs/filter/duplicate.c to something like 30 or 31; that should reduce the memory use at least. You can also try incrementing LOG2_DUP_HASHTABLE2_SIZE, though this will make the hashtables significantly larger.

I should be releasing v1.37 in the next week or two, and this will have a few filtering improvements
jasonp is offline  
Old 2008-08-18, 09:20   #24
fivemack
(loop (#_fork))
 
fivemack's Avatar
 
Feb 2006
Cambridge, England

191316 Posts
Default

We've just reached the hundred-million-Q point in relations uploaded, in under a month since sieving started; so we've been averaging about forty cores contributing to the project.

I think we have two months to go. Thanks to everyone who's contributing cycles!
fivemack is offline  
Old 2008-08-26, 21:01   #25
Batalov
 
Batalov's Avatar
 
"Serge"
Mar 2008
Phi(4,2^7658614+1)/2

36·13 Posts
Default

I've put 176M and 177M. 178-179M, 179M-180M are in progress.

There are a bit smaller than 174M and 175M. I have now realized that I had inadvertently run the 14e siever on a few small sub-chunks. But I will use consistently 15e for my other (and future) chunks. Understandably, there's nothing wrong with the 14e results, just fewer.

P.S. I have to confess that I've slowed down because I tried my muscle on two other numbers, 7,384+ and 2-1586L/gnfs. Will catch up. Promise.

-Serge
Batalov is offline  
Old 2008-08-28, 08:03   #26
fivemack
(loop (#_fork))
 
fivemack's Avatar
 
Feb 2006
Cambridge, England

144238 Posts
Default

Code:
Thu Aug 28 01:42:30 2008  restarting with 174092090 relations
Thu Aug 28 02:38:00 2008  found 15396903 duplicates and 158695148 unique relations
Thu Aug 28 02:39:57 2008  filtering rational ideals above 202571776
Thu Aug 28 02:39:57 2008  filtering algebraic ideals above 202571776
Thu Aug 28 03:14:52 2008  158695148 relations and about 99916465 large ideals
Thu Aug 28 07:10:46 2008  reduce to 28551117 relations and 20303495 ideals in 25 passes
Given duplicates, we're just under half-way there.
fivemack is offline  
Old 2008-08-28, 13:40   #27
bsquared
 
bsquared's Avatar
 
"Ben"
Feb 2007

351310 Posts
Default

The cluster is down for a bit, so it might be longer before I can finish the range I just reserved. Although it's down so that they can add 64+ more cores, and more memory, so it's a good down time :-)
bsquared is offline  
Old 2008-08-28, 17:21   #28
jasonp
Tribal Bullet
 
jasonp's Avatar
 
Oct 2004

1101110101012 Posts
Default

I can do line sieving if nobody is working on it already. What line size do you think is approriate for such a large job?
jasonp is offline  
Old 2008-08-28, 22:54   #29
fivemack
(loop (#_fork))
 
fivemack's Avatar
 
Feb 2006
Cambridge, England

72·131 Posts
Default

I haven't done any line sieving myself on this one, so I've no idea what size is appropriate; 50% of the relations in one file I chose at random have |x| < 10^11, but that's probably far too long a line to be of use. Just to pull numbers from the top of my head, (2*10^10) x 10^4 would be a nice region to look at (about 6% of the relations have |x|<10^10, and the skewness is around 10^6), but I would sieve b=3456 for line lengths 2^31 through 2^37 and see how the yield/time curve looks. I suppose I'd target a CPU-month for line sieving, but I don't know how you are for CPUs.
fivemack is offline  
Old 2008-08-29, 15:07   #30
jasonp
Tribal Bullet
 
jasonp's Avatar
 
Oct 2004

3,541 Posts
Default

Between line sizes of 2G, 4G and 8G, the 4G had the fastest time per relation found (by around 20%). With that line size a 2GHz opteron needs approximately 30 minutes per line, so 10k lines is a fairly big chunk of runtime for me. I'll put 3 CPUs on it and see how far I get in a week. Expect something under a million relations per thousand lines.
jasonp is offline  
Old 2008-08-29, 19:55   #31
fivemack
(loop (#_fork))
 
fivemack's Avatar
 
Feb 2006
Cambridge, England

11001000100112 Posts
Default

Given the skewness I think it makes sense to do 2k lines rather than 10k, though that's still an unreasonable amount of runtime.

I don't know if there's a sense in which these small-X,Y relations are better than the lattice sieving ones; unless there is, I don't think it's worth more than a few CPU-weeks.

Thanks for the data!
fivemack is offline  
Old 2008-08-29, 21:04   #32
jasonp
Tribal Bullet
 
jasonp's Avatar
 
Oct 2004

3,541 Posts
Default

Well, if 2500 lines of 1/8 the size took 3 CPU-days for 6,383+, then the current effort should be just a scale-up of that. A 2.6GHz core2duo system averages about 20 minutes per line (actually 26 minutes for odd lines and 14 minutes for even lines)
jasonp is offline  
Old 2008-08-29, 21:56   #33
Wacky
 
Wacky's Avatar
 
Jun 2003
The Texas Hill Country

32·112 Posts
Default

Quote:
Originally Posted by fivemack View Post
I don't know if there's a sense in which these small-X,Y relations are better than the lattice sieving ones; unless there is, I don't think it's worth more than a few CPU-weeks.
My experience is that not only is the line siever significantly slower than the lattice siever, but given a large number of lattice relations, a disproportionately large number of them duplicate relations already found.

I attribute this to the fact that there are very few relations which are truly smooth with respect to a limit below the special-q.

In particular, for many of the line-sieving relations, one of their "large primes" matches a q value that was used as a special-q in the lattice sieving.
Wacky is offline  
Closed Thread



Similar Threads
Thread Thread Starter Forum Replies Last Post
2801^79-1 reservations (CLOSED 27 AUGUST) fivemack Factoring 76 2010-11-06 11:36
Run to the Top Contest - Closed Joe O Sierpinski/Riesel Base 5 12 2010-10-11 16:22
Sieving - information and reservations philmoore Five or Bust - The Dual Sierpinski Problem 1 2009-09-22 07:58
Sieving reservations and coordination gd_barnes No Prime Left Behind 2 2008-02-16 03:28
Is P.I.E.S. still closed to some users? jasong Information & Answers 9 2005-10-23 19:04

All times are UTC. The time now is 09:10.


Sat Jul 17 09:10:02 UTC 2021 up 50 days, 6:57, 1 user, load averages: 1.25, 1.55, 1.55

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.