![]() |
|
|
#78 | |
|
May 2008
Wilmington, DE
22·23·31 Posts |
Quote:
Any ideas as to why I lost 1/5th of my candidates file during that barf? Over 1MB of data right in the middle of the file. |
|
|
|
|
|
|
#79 |
|
May 2008
Wilmington, DE
22·23·31 Posts |
Another stange message only after I upgraded to 2.3.4
[2009-08-07 20:12:12 GMT] Error sending <<<link rel="icon" type="image/ico" href="prpnet.ico">>> to localhost:7102 BTW, the barf's seems to have stopped. I still see 2 updates happening together, but no error messages. Last fiddled with by MyDogBuster on 2009-08-07 at 20:15 |
|
|
|
|
|
#80 | |
|
May 2008
Wilmington, DE
22·23·31 Posts |
Quote:
Last fiddled with by MyDogBuster on 2009-08-07 at 22:30 |
|
|
|
|
|
|
#81 | |
|
"Mark"
Apr 2003
Between here and the
24×397 Posts |
Quote:
I should consider allowing an administrator the ability to split the prpserver.candidates file so that it doesn't take as much memory and reducing the I/O. It could (theoretically) be striped across multiple files only reading in a subsequent file when the server is running dry. Thoughts? |
|
|
|
|
|
|
#82 | |
|
A Sunny Moo
Aug 2007
USA (GMT-5)
3×2,083 Posts |
Quote:
Also, you're saying that it has to update prpserver.candidates with EVERY new line it outputs? That doesn't seem right. What's your "savefrequency=" set to in prpserver.ini? At the default of 5 minutes, I don't get any problems with that kind of bottlenecking. Mark, should I go ahead and upgrade the server to 2.2.4, or do you need more data with 2.2.3? Either way, considering as how Ian's reporting further problems I won't load all of 100K-150K but just 140K-150K as another test run. |
|
|
|
|
|
|
#83 | ||
|
May 2008
Wilmington, DE
22×23×31 Posts |
Quote:
I'll reset it to 3 minutes and try my testing again. I was not getting any blank records this time. Mark, thanks for considering changing the I/O to the file. Quote:
I'm currently testing a candidates file with 196K tests. The removal works just fine. Great feature. If we split it into 3 files then I would have to do some manual deleting. I guess doing more analysis before setting up a candidates file would have helped. But then again, stress testing should ferret out all problems. Maybe for future reference someone could set some reasonable limits on file sizes and stuff like that. All in all, great program(s) Mark. |
||
|
|
|
|
|
#84 |
|
May 2008
Wilmington, DE
54448 Posts |
Okay, setting the savefrequency to 5 did not get rid of the bottleneck problem. I still got the error messages but no blank records.
I checked the client log and it showed that all 20 tests were accepted. I waited the 5 minutes and then checked the candidates file for the errored tests and they were still marked as inprogress on that machine. That's an inconsistency. The client thinks all's okay with the upload, but the server didn't accept the bottlenecked ones. Last fiddled with by MyDogBuster on 2009-08-08 at 00:12 |
|
|
|
|
|
#85 |
|
"Mark"
Apr 2003
Between here and the
11000110100002 Posts |
Max, upgrade to 2.2.4. I don't think anything else is necessary at this point.
MyDogBuster, I don't understand your last statement. Are you saying that the all workunits failed to be reported to the server and the server still had them marked as inprogress? If so, that is not a problem as the server would still be waiting for valid test results. It is only a problem if the client dropped the workunits. If that is the case I would need to see debug logs from both. It is possible that a bug is still lurking in the client. savefrequency=0 would cause a huge amount of I/O. I think that setting it to an hour or more is reasonable as long as the server is stable. I also suggest setting maxworkunits to a value that allows clients to build up an hour or more of work before reporting. That would reduce the number of times clients need to communicate with the server to get and report work. It should also reduce bottlenecks. I max out on PrimeGrid projects and get 5+ hours of work each time. At this time the server is not multi-threaded. I'm not certain what happens if multiple clients try to connect at the same time. I presume that if one is connected, then the others have to wait. Multi-threading the server would not be easy, which is why I've been avoiding it. |
|
|
|
|
|
#86 | ||
|
May 2008
Wilmington, DE
22·23·31 Posts |
Quote:
I see this message on the server: [2009-08-07 04:09:06 GMT] Error sending <<ServerType: 1>> to localhost:7102 [2009-08-07 04:09:06 GMT] Error sending <<WorkUnit: 16641*24^39588+1 1249618146 16641 24 39588 1>> to localhost:7102 [2009-08-07 04:09:06 GMT] IMGunn1654@gmail.com (Sophie#3) at 192.168.2.100: Sent 16641*24^39588+1 [2009-08-07 04:09:06 GMT] Error sending <<ServerType: 1>> to localhost:7102 [2009-08-07 04:09:06 GMT] Error sending <<WorkUnit: 28701*24^39588+1 1249618146 28701 24 39588 1>> to localhost:7102 [2009-08-07 04:09:06 GMT] IMGunn1654@gmail.com (Sophie#3) at 192.168.2.100: Sent 28701*24^39588+1 [2009-08-07 04:09:06 GMT] Error sending <<ServerType: 1>> to localhost:7102 But I see this message on the client log. [2009-08-07 04:09:08 GMT] Base24: INFO: All 20 test results were accepted Quote:
Last fiddled with by MyDogBuster on 2009-08-08 at 01:17 |
||
|
|
|
|
|
#87 | |
|
A Sunny Moo
Aug 2007
USA (GMT-5)
3·2,083 Posts |
Quote:
Last fiddled with by mdettweiler on 2009-08-08 at 01:47 |
|
|
|
|
|
|
#88 | |
|
May 2008
Wilmington, DE
285210 Posts |
Quote:
|
|
|
|
|
![]() |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| PRPNet server for personal use | johnadam74 | Software | 2 | 2016-01-01 15:58 |
| New SR5 PRPnet server online | ltd | Sierpinski/Riesel Base 5 | 15 | 2013-03-19 18:03 |
| First PSP PRPnet 4.0.6 server online | ltd | Prime Sierpinski Project | 9 | 2011-03-15 04:58 |
| PRPnet 3.1.3 stress-test server | mdettweiler | No Prime Left Behind | 40 | 2010-01-30 18:05 |
| First pass PRPNet server out of work? | opyrt | Prime Sierpinski Project | 6 | 2009-09-24 18:14 |