mersenneforum.org  

Go Back   mersenneforum.org > Factoring Projects > Msieve

Reply
 
Thread Tools
Old 2009-04-07, 16:27   #12
Batalov
 
Batalov's Avatar
 
"Serge"
Mar 2008
Phi(4,2^7658614+1)/2

36×13 Posts
Default

Let's simply do the gzipped relations. (This is all about 5,362+, btw.)
It will take a day to FTP; we have ~1Gb/hour transfer rate.

I will use the patch (if it isn't already in 1.41).
I will bump the starting weight to <=30 (not 20) right away, too.
Thanks, --S

Last fiddled with by jasonp on 2009-04-07 at 17:26 Reason: It's in v1.41, you're on the bleeding edge :)
Batalov is offline   Reply With Quote
Old 2009-04-07, 17:41   #13
Jeff Gilchrist
 
Jeff Gilchrist's Avatar
 
Jun 2003
Ottawa, Canada

3×17×23 Posts
Default

Quote:
Originally Posted by Batalov View Post
Let's simply do the gzipped relations. (This is all about 5,362+, btw.)
It will take a day to FTP; we have ~1Gb/hour transfer rate.
I was curious about various compression methods with sieve data so I took my 963MB msieve.dat file from a C121 and compressed them with a few different algorithms to see if it would make a big difference. Results:

gzip -9 = 457MB
bzip2 -9 = 398MB
7zip (LZMA Max) = 391MB
7zip (PPMd Max) = 366MB

So by using the free 7zip program which works in both Windows and UNIX, you can save some time in transferring the data compared to gzip.

Jeff.

Last fiddled with by Jeff Gilchrist on 2009-04-07 at 18:13
Jeff Gilchrist is offline   Reply With Quote
Old 2009-04-07, 18:05   #14
mdettweiler
A Sunny Moo
 
mdettweiler's Avatar
 
Aug 2007
USA (GMT-5)

624910 Posts
Default

Quote:
Originally Posted by Jeff Gilchrist View Post
I was curious about various compression methods with sieve data so I took my 963MB msieve.dat file from a C121 and compressed them with a few different algorithms to see if it would make a big difference. Results:

gzip -9 = 457MB
bzip2 -9 = 398MB
7zip (LZMA Max) = 391MB
7zip (PPMd Max) = 366MB

So by using the free 7zip program which works in both Windows and UNIX, you can save some time in transferring the data compared to gzip.

Jeff.
One thing to keep in mind about 7-Zip/ppmd is that it uses a LOT of RAM for compression and decompression. The exact amount is configurable when compressing. If you have plenty of RAM, this wouldn't appear to be a problem; however, I've found that if you're decompressing under Windows, if it requires more than approximately 1.2-1.5 GB of RAM, you may run into the 2 GB per-process limit that Windows (32-bit) imposes. (The reason why you can't use the full 2 GB has something to do with how it reserves a certain chunk of RAM for the system, I think.)

Of course, you shouldn't have any problems if you're either running 64-bit Windows, or Linux (32- or 64-bit). For 64-bit Windows the limit is much higher, and if memory serves Linux will let you use everything that the respective 32-bit/64-bit stuff can see.

Last fiddled with by mdettweiler on 2009-04-07 at 18:05
mdettweiler is offline   Reply With Quote
Old 2009-04-07, 18:06   #15
Joshua2
 
Joshua2's Avatar
 
Sep 2004

13·41 Posts
Default

I had actually already run this test and discovered that my 7z ppmd could get 37% compression ratio, which was better than anything else. I had it set to ultra though, I talked about it over in the aliquot team sieve thread. PPMd is the best for text, so I guess this is text-like.
Joshua2 is offline   Reply With Quote
Old 2009-04-07, 19:26   #16
Jeff Gilchrist
 
Jeff Gilchrist's Avatar
 
Jun 2003
Ottawa, Canada

3·17·23 Posts
Default

Quote:
Originally Posted by Joshua2 View Post
I had actually already run this test and discovered that my 7z ppmd could get 37% compression ratio, which was better than anything else. I had it set to ultra though, I talked about it over in the aliquot team sieve thread. PPMd is the best for text, so I guess this is text-like.
I don't follow aliquot stuff so I missed that thread. So PPMd Ultra takes it from 366MB to 362MB.

The msieve.dat file is a text file so that would explain why PPMd does so well.

Jeff.
Jeff Gilchrist is offline   Reply With Quote
Old 2009-04-08, 23:02   #17
Batalov
 
Batalov's Avatar
 
"Serge"
Mar 2008
Phi(4,2^7658614+1)/2

36×13 Posts
Default

Quote:
Originally Posted by jasonp View Post
It's in v1.41, you're on the bleeding edge :)
Well, together with a lucky guess of w<=30 (otherwise we would have waited for the third pass), the filtering went smoothly and landed on the target density in a single pass, in only 11 hours. The matrix is rather good-looking for a complexity-253 number... 10.6Mx10.6M and the ETA for linalg is 9 days on 4 threads. The highest memory use watermark was 4.3Gb, and now in BL is 3.7Gb (3.9Gb virtual). With the OS, it would have gone swapping on a 4Gb machine (but this one is 8Gb).

We'll probably use .7z for the next transfer.

The filtering log is attached; I am not sure whether it shows that the massive patch was used, but Jason may spot it...
Attached Files
File Type: zip 5p362filter.zip (9.0 KB, 108 views)
Batalov is offline   Reply With Quote
Old 2009-04-09, 03:03   #18
jasonp
Tribal Bullet
 
jasonp's Avatar
 
Oct 2004

3,541 Posts
Default

Quote:
Originally Posted by Batalov View Post
The filtering log is attached; I am not sure whether it shows that the massive patch was used, but Jason may spot it...
It wasn't used (look for 'commencing heavy relation removal'); you had a huge number of cliques but apparently did not need to prune enough for the heavy relation removal to be needed.

Actually, if you had the memory for it then it may have been profitable to use an ideal weight of 35 instead of 30. If the estimated average weight after the merge phase is a lot higher than 70.0, then it's probably because the merge phase went and merged all the ideals; but if you had a higher merge weight then the merge phase could afford to be more selective. At least I've noticed that when there is a lot of oversieving, as the max ideal weight goes up the average matrix weight also goes up, until with a high enough weight it drops down to the expected average but remains at about the same dimensions. Could you retry with weight <= 35 after the LA finishes?
jasonp is offline   Reply With Quote
Old 2009-04-09, 03:20   #19
Batalov
 
Batalov's Avatar
 
"Serge"
Mar 2008
Phi(4,2^7658614+1)/2

224058 Posts
Default

Quote:
Originally Posted by jasonp View Post
Could you retry with weight <= 35 after the LA finishes?
OK. Will do. But in the meantime, here's the full log for the 2,1618L which went all the way up exactly to w<=35 and made a fantastic matrix, which was solved in mere 94 hours.
Attached Files
File Type: zip 2L1618full.zip (29.0 KB, 104 views)
Batalov is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
Would a massive cache make much difference? tServo Hardware 14 2018-04-01 22:31
"lanczos error: only trivial dependencies found" with massive oversieving eigma Msieve 21 2015-05-28 03:27
Oversieving chris2be8 Msieve 7 2010-03-13 21:51
Oversieving in msieve fivemack Msieve 1 2007-09-28 18:26
Massive P727 found for (10^49081-1)/9-1 AntonVrba Factoring 5 2005-12-26 17:02

All times are UTC. The time now is 01:06.


Sat Jul 17 01:06:46 UTC 2021 up 49 days, 22:54, 1 user, load averages: 1.99, 1.88, 1.60

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.