mersenneforum.org  

Go Back   mersenneforum.org > Factoring Projects > Cunningham Tables

Reply
 
Thread Tools
Old 2011-02-13, 10:52   #12
xilman
Bamboozled!
 
xilman's Avatar
 
"π’‰Ίπ’ŒŒπ’‡·π’†·π’€­"
May 2003
Down not across

47·229 Posts
Default Slight modification for a multi-core machine

I've a multi-core machine and will be running several instances in parallel in the same directory. The command given in the first post doesn't work too well in that environment and I changed it to:
Code:
#!/bin/sh
../gnfs-lasieve4I16e -v -r t.poly -f 100000000 -c 125000 -o t.poly.lasieve-1.100000000-100125000 &
../gnfs-lasieve4I16e -v -r t.poly -f 100125000 -c 125000 -o t.poly.lasieve-1.100125000-100250000 &
../gnfs-lasieve4I16e -v -r t.poly -f 100250000 -c 125000 -o t.poly.lasieve-1.100250000-100375000 &
../gnfs-lasieve4I16e -v -r t.poly -f 100375000 -c 125000 -o t.poly.lasieve-1.100375000-100500000 &
../gnfs-lasieve4I16e -v -r t.poly -f 100500000 -c 125000 -o t.poly.lasieve-1.100500000-100625000 &
../gnfs-lasieve4I16e -v -r t.poly -f 100625000 -c 125000 -o t.poly.lasieve-1.100625000-100750000 &
The other 250K special-q from my initial 1M block will be run in a similar fashion on a dual-core laptop.


Paul
xilman is offline   Reply With Quote
Old 2011-02-13, 11:00   #13
xilman
Bamboozled!
 
xilman's Avatar
 
"π’‰Ίπ’ŒŒπ’‡·π’†·π’€­"
May 2003
Down not across

47×229 Posts
Default

Up and running on a
Code:
vendor_id       : AuthenticAMD
cpu family      : 16
model           : 10
model name      : AMD Phenom(tm) II X6 1090T Processor
stepping        : 0
cpu MHz         : 3780.456
cache size      : 512 KB
Seem to be getting about 0.78 sec/rel on each processor.

Curiously enough, I'm getting almost the same performance from a 2.13GHz Core2 Duo P7540 laptop running Win7-64. Perhaps these are just small-number statistics or perhaps I need to see what needs tweaking on the AMD Linux box.

Paul

Last fiddled with by xilman on 2011-02-13 at 11:20 Reason: Add laptop stats
xilman is offline   Reply With Quote
Old 2011-02-13, 18:53   #14
xilman
Bamboozled!
 
xilman's Avatar
 
"π’‰Ίπ’ŒŒπ’‡·π’†·π’€­"
May 2003
Down not across

2A0B16 Posts
Default

Quote:
Originally Posted by xilman View Post
Up and running on a
Code:
vendor_id       : AuthenticAMD
cpu family      : 16
model           : 10
model name      : AMD Phenom(tm) II X6 1090T Processor
stepping        : 0
cpu MHz         : 3780.456
cache size      : 512 KB
Seem to be getting about 0.78 sec/rel on each processor.

Curiously enough, I'm getting almost the same performance from a 2.13GHz Core2 Duo P7540 laptop running Win7-64. Perhaps these are just small-number statistics or perhaps I need to see what needs tweaking on the AMD Linux box.

Paul
After seven hours, which ought to be long enough to get credible numbers, the AMD is averaging 0.771 \pm .005 sec/rel and the Intel 0.813 \pm 0.2 sec/rel.

The ratio of these rates is 1.05 but the ratio of the clock frequencies is 1.77 so the AMD is significantly less efficient here. Perhaps I should check compilation options on the Linux siever.

Paul
xilman is offline   Reply With Quote
Old 2011-02-13, 19:17   #15
xilman
Bamboozled!
 
xilman's Avatar
 
"π’‰Ίπ’ŒŒπ’‡·π’†·π’€­"
May 2003
Down not across

47×229 Posts
Default

Quote:
Originally Posted by xilman View Post
Up and running on a
Code:
vendor_id       : AuthenticAMD
cpu family      : 16
model           : 10
model name      : AMD Phenom(tm) II X6 1090T Processor
stepping        : 0
cpu MHz         : 3780.456
cache size      : 512 KB
Seem to be getting about 0.78 sec/rel on each processor.

Curiously enough, I'm getting almost the same performance from a 2.13GHz Core2 Duo P7540 laptop running Win7-64. Perhaps these are just small-number statistics or perhaps I need to see what needs tweaking on the AMD Linux box.

Paul
After seven hours, which ought to be long enough to get credible numbers, the AMD is averaging 0.771 \pm .005 sec/rel and the Intel 0.813 \pm 0.02 sec/rel.

The ratio of these rates is 1.05 but the ratio of the clock frequencies is 1.77 so the AMD is significantly less efficient here. Perhaps I should check compilation options on the Linux siever.

Paul

Last fiddled with by xilman on 2011-02-13 at 21:55 Reason: The error bounds for the Intel were 10 times too large!
xilman is offline   Reply With Quote
Old 2011-02-13, 19:51   #16
bsquared
 
bsquared's Avatar
 
"Ben"
Feb 2007

351610 Posts
Default

Quote:
Originally Posted by xilman View Post
After seven hours, which ought to be long enough to get credible numbers, the AMD is averaging 0.771 \pm .005 sec/rel and the Intel 0.813 \pm 0.2 sec/rel.

The ratio of these rates is 1.05 but the ratio of the clock frequencies is 1.77 so the AMD is significantly less efficient here. Perhaps I should check compilation options on the Linux siever.

Paul
L1_BITS (settable at compile time) may be set to 15, which is optimal for the core2 but not the AMD. I don't know if that would be enough to explain the entire difference though.
bsquared is offline   Reply With Quote
Old 2011-02-13, 20:00   #17
Batalov
 
Batalov's Avatar
 
"Serge"
Mar 2008
Phi(4,2^7658614+1)/2

948810 Posts
Default

I have posted my own L1_bits=16 binary in the top message - it may be better for AMD. Paul, your binary seems to be a bit slow (maybe non-asm?). Give this one a try. I have 0.30-0.31s/rel on a similar 1090T.

When building from source, use the src/experimental/lasieve4_64/ (well you know that)

Last fiddled with by Batalov on 2011-02-13 at 20:37
Batalov is offline   Reply With Quote
Old 2011-02-13, 21:36   #18
xilman
Bamboozled!
 
xilman's Avatar
 
"π’‰Ίπ’ŒŒπ’‡·π’†·π’€­"
May 2003
Down not across

101010000010112 Posts
Default

Quote:
Originally Posted by Batalov View Post
I have posted my own L1_bits=16 binary in the top message - it may be better for AMD. Paul, your binary seems to be a bit slow (maybe non-asm?). Give this one a try. I have 0.30-0.31s/rel on a similar 1090T.

When building from source, use the src/experimental/lasieve4_64/ (well you know that)
Yes, that is markedly better, thank you. Even after a few seconds the rate is around 0.36 s/r and that is still influenced by the set-up time, including the creation of the factorbases

I'll kill off the currently running sievers and continue from where they finished.

(Any chance of you providing comparable builds of gnfs-lasieve4I1[1-5]e please? I'm currently fighting my way through the oft-times depressing difficulties of building anything from the Franke/Kleinjung sources. If it helps, I can provide sftp to my machine and/or ssh access to you for building on this system.)

Many thanks!

Paul
xilman is offline   Reply With Quote
Old 2011-02-13, 21:48   #19
Batalov
 
Batalov's Avatar
 
"Serge"
Mar 2008
Phi(4,2^7658614+1)/2

24×593 Posts
Default

Will do (as long as the first one runs on your system; the usual showstopper is the glibc compatibility). If you can find Tom's binary in the forum - that one is L1_bits=15.
Batalov is offline   Reply With Quote
Old 2011-02-14, 14:24   #20
R.D. Silverman
 
R.D. Silverman's Avatar
 
Nov 2003

22×5×373 Posts
Default

Quote:
Originally Posted by Batalov View Post
Yes, just allow for time in mail.
Thanks.
I have another 5 million relations to send. Let me know when
you want me to send you my data. I am gathering about 5M relations/week.
R.D. Silverman is offline   Reply With Quote
Old 2011-02-14, 18:38   #21
Batalov
 
Batalov's Avatar
 
"Serge"
Mar 2008
Phi(4,2^7658614+1)/2

24·593 Posts
Default

It is hard to predict yet, but there's most probably two weeks to go here (or more); so let's get back to this question after one week?
Batalov is offline   Reply With Quote
Old 2011-02-14, 18:54   #22
xilman
Bamboozled!
 
xilman's Avatar
 
"π’‰Ίπ’ŒŒπ’‡·π’†·π’€­"
May 2003
Down not across

47·229 Posts
Default

Quote:
Originally Posted by Batalov View Post
It is hard to predict yet, but there's most probably two weeks to go here (or more); so let's get back to this question after one week?
If it aids the ETA calculation, something over 1.35M relations have already turned up here in around 1 day effective computation (effective because I changed to a much more efficient siever on the faster machine 21 hours ago despite having started 32 hours ago).

Paul

Last fiddled with by xilman on 2011-02-14 at 18:59
xilman is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
Finishing expired LL test Damian PrimeNet 2 2017-11-16 00:36
1870L : Yield Rate R.D. Silverman Cunningham Tables 44 2010-10-21 14:32
Finishing mprime runs bill-p Software 1 2009-12-08 17:45
distributed.net completes OGR-25 ixfd64 Lounge 4 2008-11-22 01:59
LL assignments on slow PCs "die" shortly before finishing? Andi47 PrimeNet 1 2007-02-28 22:03

All times are UTC. The time now is 08:11.


Tue Jul 27 08:11:42 UTC 2021 up 4 days, 2:40, 0 users, load averages: 1.61, 1.57, 1.67

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.