![]() |
|
|
#45 | |
|
May 2007
Kansas; USA
101×103 Posts |
Quote:
Wow, that was fast that we got down to only remaining stragglers. That makes sense now. Thanks for enlightening me. We still need to get the formatting fixed on that one web page. We also need to change "n remaining" to "pairs remaining". The option to run percentages of certain servers is outstanding! Whenever we reload for another test, Lennart's and my machines just start gobbling them up. Very cool! :-) Gary |
|
|
|
|
|
|
#46 |
|
May 2007
Kansas; USA
101×103 Posts |
When did the last run hand out its last pair? In other words, when did the server "dry" by my definition?
I'm wondering because there are still 350 stragglers remaining. Unless someone has shut off a machine, all stragglers should have been processed by now. Can that be checked? Thanks. One more thing. Is there the equivalent of a "JobMaxTime" in PRPnet? If so, what has G3000 been set to? Gary Last fiddled with by gd_barnes on 2009-08-04 at 20:54 |
|
|
|
|
|
#47 | |
|
A Sunny Moo
Aug 2007
USA (GMT-5)
3×2,083 Posts |
Quote:
It's a little hard to tell you exactly when the server "dried" by your definition. I'm presuming that by that, you mean when did the server run out of its "main" stash of work until everything that's left was stragglers? If so, that will take a bit of digging to find out. |
|
|
|
|
|
|
#48 |
|
A Sunny Moo
Aug 2007
USA (GMT-5)
141518 Posts |
I just checked the server, and it seems that the last time any test was handed out was at 22:17 GMT, 8/3. Within the next couple of hours Lennart's various machines returned their various results, but since then there's been no activity besides the server sending out "no available candidates on server" messages. I've changed the time limit to 6 hours; I see now that a large number (possibly all, I didn't do an exact count) of the stragglers have been expired and are being reassigned.
|
|
|
|
|
|
#49 | |
|
May 2007
Kansas; USA
101000101000112 Posts |
Quote:
This is important to figure out because in the first test, the same thing happened. I observed that the server was dry by my definition about 4-5 hours before you confirmed that it was really dry by your definition. If the pairs are coming back in a reasonable time frame, that difference should have been < 1 hour. If something is causing some pairs to "get stuck" for an extended period, we need to figure that out. To clarify again: Dry by my defintion: No new work is available to hand out. Some straggling pairs still need results to be returned. Dry by your definition: All pairs have been processed and returned. Let me come up with a better way to state this: Your definition is probably more accurate in a purely technical sense so how about we call my definition "nominally dried" and stick with calling your definition simply "dried". Gary Last fiddled with by gd_barnes on 2009-08-04 at 23:54 |
|
|
|
|
|
|
#50 | |
|
A Sunny Moo
Aug 2007
USA (GMT-5)
624910 Posts |
Quote:
Regarding dried vs. nominally dried: okay, that works.
|
|
|
|
|
|
|
#51 | |
|
"Lennart"
Jun 2007
100011000002 Posts |
Quote:
[2009-08-03 17:49:38 GMT] crus: Returning work to server nplb-gb1.no-ip.org at port 3000 [2009-08-03 17:49:43 GMT] crus: ERROR: Workunit 124221*6^148285+1 not found on server [2009-08-03 17:49:43 GMT] crus: The client will delete this workunit Here you see that the candidates was deleted when we had the conection error. so if you had one day in delay file i don't get them again before those 24 hr. Lennart |
|
|
|
|
|
|
#52 | |
|
A Sunny Moo
Aug 2007
USA (GMT-5)
141518 Posts |
Quote:
Lennart, could you provide debug.log excerpts from when your clients behaved like in your example? This might be just the key we're looking for. Meanwhile, I'll check and see if on the server, a test was barfed at the same time shown in your logs for your example. Edit: oh, never mind, looks like that particular example was from before I put the server on debug logging. Lennart, do you have a similar example from somewhere after 2009-08-03 18:20:04 GMT? Last fiddled with by mdettweiler on 2009-08-05 at 03:38 |
|
|
|
|
|
|
#53 |
|
May 2007
Kansas; USA
101×103 Posts |
Your last sentence notwithstanding, THAT is EXACTLY what I was afraid of and thought it might be!! (caps for emphasis not yelling) I had been seeing "result not accepted" and something about it being deleted in some of my files. Yet my one quad was definitely connected the entire time.
This clearly has to be a PRPnet bug -or- related to load on my server that either the server or the PRPnet software cannot handle. For some reason, it is not accepting some returned results even though it should be. It seems to think they are already done when in fact they are not. Whew, and I thought I was hallucinating about the huge difference between the nominal drying time and actual drying time. It seems I was not as all machines were connected at all times. Good luck both Max and Rogue figuring it out. That doesn't sound easy. If you need examples from my machine, let me know. You can thank me now or thank me later for observing the unusually large difference between the two drying times for machines that were connected the entire time. lol Gary Last fiddled with by gd_barnes on 2009-08-05 at 09:44 Reason: thank me now or later :-) |
|
|
|
|
|
#54 | |
|
"Mark"
Apr 2003
Between here and the
635210 Posts |
Quote:
|
|
|
|
|
|
|
#55 | |
|
A Sunny Moo
Aug 2007
USA (GMT-5)
186916 Posts |
Quote:
)This means, of course, that if there's any data to be had on the client side of things, it's already sitting in one of Lennart's debug.log's and waiting for us to collect.
|
|
|
|
|
![]() |
| Thread Tools | |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| PRPNet server for personal use | johnadam74 | Software | 2 | 2016-01-01 15:58 |
| New SR5 PRPnet server online | ltd | Sierpinski/Riesel Base 5 | 15 | 2013-03-19 18:03 |
| First PSP PRPnet 4.0.6 server online | ltd | Prime Sierpinski Project | 9 | 2011-03-15 04:58 |
| PRPnet 3.1.3 stress-test server | mdettweiler | No Prime Left Behind | 40 | 2010-01-30 18:05 |
| First pass PRPNet server out of work? | opyrt | Prime Sierpinski Project | 6 | 2009-09-24 18:14 |