![]() |
|
|
#331 | ||
|
May 2009
101102 Posts |
Quote:
Now there are multiple ways a cado-nfs database can have problems. It's very rare that the database becomes corrupt from the DBMS point of view, and I'm not really talking of database corruption in that sense (for which DBMS-level backups seem to be the way to go). But for example, when the number of failed WUs increases a lot, cado-nfs refuses to resume. Likewise if some of the counts diverge (I think it's in the 'sieving' table -- these counts shouldn't exist in the first place, IMO). There's no doc, not even a cookbook, as to what can be done in such cases. Lacking that, from the user point of view, a database that cado-nfs.py refuses to work with for this kind of reason is just as useful in the end of the day as a corrupt database... Quote:
|
||
|
|
|
|
|
#332 |
|
Jul 2019
32 Posts |
|
|
|
|
|
|
#333 |
|
"Curtis"
Feb 2005
Riverside, CA
4,861 Posts |
|
|
|
|
|
|
#334 |
|
"Curtis"
Feb 2005
Riverside, CA
4,861 Posts |
We had another hiccup this morning; one of the team caught it and kicked the server again.
I've just changed to I=15, as of Q=180.07M. Workunits are now 10k in length. I=16 produced 1304M raw relations from Q=8-180M, for an average yield of 7.58. I=15 is expected to average yield of 2.0 from 180 to ~880M for the other 1400M relations. |
|
|
|
|
|
#335 |
|
"Ed Hall"
Dec 2009
Adirondack Mtns
EE916 Posts |
With the change to I=15, I'm looking at possibly adding a couple clients that would not be 24 hour machines. Since the server disregards and reassigns overdue WUs, suspension seems of no value. This means interruption by CTRL-C or letting an las session complete after suspension, when it will just get discarded.
I may insert a break in the client script between sieving runs, triggered by a stop file (as I have done elsewhere), but is there a clean way to tell the client to stop at the end of a WU? Possibly a switch I haven't discovered? Last fiddled with by EdH on 2019-07-21 at 19:25 Reason: missspelllin fixx |
|
|
|
|
|
#336 | |
|
"Seth"
Apr 2019
4438 Posts |
Quote:
|
|
|
|
|
|
|
#337 | |
|
Bamboozled!
"πΊππ·π·π"
May 2003
Down not across
10,753 Posts |
Quote:
|
|
|
|
|
|
|
#338 |
|
Banned
"Luigi"
Aug 2002
Team Italia
32×5×107 Posts |
never mind...
Last fiddled with by ET_ on 2019-07-22 at 08:45 |
|
|
|
|
|
#339 | |
|
May 2009
2×11 Posts |
Quote:
Last fiddled with by thome on 2019-07-22 at 11:42 |
|
|
|
|
|
|
#340 |
|
"Ed Hall"
Dec 2009
Adirondack Mtns
EE916 Posts |
I am trying this "--single" right now. Thanks! If this works, I can build a short script around it.
My description was actually a bit poor. What I really wanted was a way to have the client run in its normal loop for many iterations, but break it at the end of a WU when it became close to suspend time. I should be able to do just that. Unfortunately, each run of single will create a separate instance for the cloudygo site to track, since each instance will be seen as a different client. I will also have to look at the github version posted by SethTro. |
|
|
|
|
|
#341 |
|
Jul 2019
910 Posts |
Yay! We crossed 50% completion! Keep it up folks!
|
|
|
|
![]() |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Coordination thread for redoing P-1 factoring | ixfd64 | Lone Mersenne Hunters | 81 | 2021-04-17 20:47 |
| big job planning | henryzz | Cunningham Tables | 16 | 2010-08-07 05:08 |
| Sieving reservations and coordination | gd_barnes | No Prime Left Behind | 2 | 2008-02-16 03:28 |
| Sieved files/sieving coordination | gd_barnes | Conjectures 'R Us | 32 | 2008-01-22 03:09 |
| Special Project Planning | wblipp | ElevenSmooth | 2 | 2004-02-19 05:25 |