mersenneforum.org  

Go Back   mersenneforum.org > Factoring Projects > Msieve

Reply
 
Thread Tools
Old 2010-11-12, 23:43   #34
jasonp
Tribal Bullet
 
jasonp's Avatar
 
Oct 2004

3,541 Posts
Default

The Teragrid cluster we've been using, with quad-core nodes, is going to be replaced at the end of the year with much more powerful nodes (12 cores each instead of 4 now). This is actually bad for us, because we get charged per-core for the CPU time we use, not per-node. So the new cluster will cost 3x as much in CPU time but will not be 3x faster for our application.

Last fiddled with by jasonp on 2010-11-12 at 23:44
jasonp is offline   Reply With Quote
Old 2010-11-16, 12:31   #35
Jeff Gilchrist
 
Jeff Gilchrist's Avatar
 
Jun 2003
Ottawa, Canada

100100101012 Posts
Default

Quote:
Originally Posted by frmky View Post
This is actually more complicated than it first appears. For Core 2 class Xeons using DDR2, I find lower runtimes using only four of the eight cores. The optimum seems to be using two MPI processes per node and two threads per process. I haven't had the opportunity to test it on Core i7 class Xeons or the DDR3-based Opterons.
Yes, so many things that can change the performance. I had to re-start the process about half way through and now they have switched, 69h left on the Infiniband and 81 h left on the gigabit from the same checkpoint. It all depends on the mix and on a busy cluster I don't have the choice of hand picking the ultimate layout.

Jeff.
Jeff Gilchrist is offline   Reply With Quote
Old 2010-12-07, 03:51   #36
EdH
 
EdH's Avatar
 
"Ed Hall"
Dec 2009
Adirondack Mtns

11·347 Posts
Default Msieve QS Taking A Bit Long For A C96...

I've been running Aliqueit on one of my linux boxes and a recent (now getting old) qs procedure seems to be taking a bit long for a c96. At least, I kind of think it is taking a bit long - it's been running it for over 72 hours:
Code:
[Dec 03 2010, 20:53:00] c96: running qs (msieve)...
  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
sieving in progress (press Ctrl-C to pause)
81521 relations (20398 full + 61123 combined from 1207085 partial), need 81272
sieving complete, commencing postprocessing
This particular machine has performed qs on two c94s in 08:22:54 and 11:26:00 and a c97 in 16:11:29. Top seems to show msieve as still working along, at around 90% CPU and 0.5% memory. It is running at normal priority.

Is there any info I should gather prior to killing the current run?

In checking this out, I found that this was the first calculation of a new run of Aliqueit on a c105 in sequence 120760, if that is of any significance. Msieve was compiled from the repository, but seems to only be showing version 1.47. I will update and recompile after I resolve the current issue.
EdH is offline   Reply With Quote
Old 2010-12-07, 04:32   #37
jasonp
Tribal Bullet
 
jasonp's Avatar
 
Oct 2004

67258 Posts
Default

If it's stuck in the postprocessing, I'd suspect bad hardware or a buggy compiler. How big is the relation file? If it's under 1GB maybe I can download the relations and take a look.
jasonp is offline   Reply With Quote
Old 2010-12-07, 16:17   #38
EdH
 
EdH's Avatar
 
"Ed Hall"
Dec 2009
Adirondack Mtns

11×347 Posts
Default

Quote:
Originally Posted by jasonp View Post
If it's stuck in the postprocessing, I'd suspect bad hardware or a buggy compiler. How big is the relation file? If it's under 1GB maybe I can download the relations and take a look.
If you're interested, the file is available at msieve,dat. I'll leave it there for a couple days. It's 42.8MB. It just passed 86 hours and I'll probably just stop and restart to see what happens, unless you'd like anything else prior to interrupting it.

It is possible I don't have enough memory in this machine. I just checked and it has only 256MB. I now remember only having that much of the right type, when I set it up.
EdH is offline   Reply With Quote
Old 2011-02-09, 18:36   #39
em99010pepe
 
em99010pepe's Avatar
 
Sep 2004

2·5·283 Posts
Default Msieve v. 1.48

I got this error while running Msieve v. 1.48:
Code:
error: corrupt state, please restart from checkpoint
I restarted the LA. Let's see what happens in the next hours.

Carlos
em99010pepe is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
Msieve 1.53 feedback xilman Msieve 149 2018-11-12 06:37
Msieve 1.50 feedback firejuggler Msieve 99 2013-02-17 11:53
Msieve 1.43 feedback Jeff Gilchrist Msieve 47 2009-11-24 15:53
Msieve 1.42 feedback Andi47 Msieve 167 2009-10-18 19:37
Msieve 1.41 Feedback Batalov Msieve 130 2009-06-09 16:01

All times are UTC. The time now is 00:48.


Sat Jul 17 00:48:16 UTC 2021 up 49 days, 22:35, 1 user, load averages: 2.00, 1.59, 1.42

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.