mersenneforum.org  

Go Back   mersenneforum.org > Prime Search Projects > Prime Sierpinski Project

Reply
 
Thread Tools
Old 2009-11-03, 08:37   #1
opyrt
 
opyrt's Avatar
 
Apr 2008
Oslo, Norway

D916 Posts
Default PRPNet servers down?

Hi.

I'm unable to get any work from any of the PRPNet servers. Are they down, or did I manage to grab all the WUs again?
opyrt is offline   Reply With Quote
Old 2009-11-03, 12:16   #2
Mini-Geek
Account Deleted
 
Mini-Geek's Avatar
 
"Tim Sorbera"
Aug 2006
San Antonio, TX USA

17·251 Posts
Default

Well, both are saying there aren't any candidates left, so I'd guess that's what's happening.
Mini-Geek is offline   Reply With Quote
Old 2009-11-03, 12:28   #3
opyrt
 
opyrt's Avatar
 
Apr 2008
Oslo, Norway

3318 Posts
Default

There should have been several hundred available on both servers from what I understand. So I suspect a faulty client has been able to reserve all candidates. I just hope it's not my fault again... I had a faulty switch here, so my computers Morbo, Fry and Zapp were unable to write to their config/checkpoint/log files which are stored on a network share (but still intermittantly able to reach the internet). If prpclient behaves the same way as llrnet when this happens, it just continues to download candidates until the server is empty.

Hopefully ltd will find that that's not the case and someone else than me is to blame for once.
opyrt is offline   Reply With Quote
Old 2009-11-03, 12:53   #4
Joe O
 
Joe O's Avatar
 
Aug 2002

3×52×7 Posts
Default

[QUOTE=rogue;194619]I found a major bug in 2.4.3 that occurs when the server allows for more than 15 or so workunits at a time and the client grabs that many. This causes the client to crash when returning them.

Here is a consolidated list of changes from 2.4.3:
  • all: Fix a crash that occurs when large messages (> 1000 bytes) are received.
  • server: Prevent server from double-checking primes.
  • server: Output a message if unable to open of the .removed files and keep candidate in the main file until the .removed file can be opened. This addresses a potential crash in which the server presumes that the .removed file could be opened.
Joe O is offline   Reply With Quote
Old 2009-11-03, 14:44   #5
opyrt
 
opyrt's Avatar
 
Apr 2008
Oslo, Norway

7·31 Posts
Default

Quote:
Originally Posted by Joe O View Post
Quote:
Originally Posted by rogue View Post
I found a major bug in 2.4.3 that occurs when the server allows for more than 15 or so workunits at a time and the client grabs that many. This causes the client to crash when returning them.

Here is a consolidated list of changes from 2.4.3:
  • all: Fix a crash that occurs when large messages (> 1000 bytes) are received.
  • server: Prevent server from double-checking primes.
  • server: Output a message if unable to open of the .removed files and keep candidate in the main file until the .removed file can be opened. This addresses a potential crash in which the server presumes that the .removed file could be opened.
That could be it, but default the clients are set to only download 1 WU at the time.
opyrt is offline   Reply With Quote
Old 2009-11-03, 15:49   #6
Joe O
 
Joe O's Avatar
 
Aug 2002

3·52·7 Posts
Default

Try commenting out the double check line ie port 7101.
I can get to the port 7100 server but not the port 7101 server.

Code:
k*b^n+/-c Total N Min N Max N FT Done FT Done Thru Max FT Done  
79309*2^n+1 70 8397254 8499134 0 0 0  
79817*2^n+1 167 8328351 8499791 0 0 0  
90527*2^n+1 207 8324351 8499791 0 0 0  
152267*2^n+1 106 8364867 8499963 0 0 0  
156511*2^n+1 65 8346000 8498568 0 0 0  
168451*2^n+1 114 8373240 8499528 0 0 0  
222113*2^n+1 324 8380773 8499453 0 0 0  
225931*2^n+1 122 8320568 8499656 0 0 0  
237019*2^n+1 169 8388502 8499886 0 0 0
Joe O is offline   Reply With Quote
Old 2009-11-03, 17:05   #7
opyrt
 
opyrt's Avatar
 
Apr 2008
Oslo, Norway

21710 Posts
Default

Quote:
Originally Posted by Joe O View Post
Try commenting out the double check line ie port 7101.
I can get to the port 7100 server but not the port 7101 server.
I can't get any work from 7100 either... :-/
opyrt is offline   Reply With Quote
Old 2009-11-03, 22:12   #8
ltd
 
ltd's Avatar
 
Apr 2003

22·193 Posts
Default

The server are working but one client ran wild and reserved all tests from both machine within a very short time. I will load some tests into both queues.

For the DC server this will be low n tests from another k. Both activities will take some time.

Sorry that I did not react earlier but I had no time to watch the forum before.

By the way it was not one of your machine opyrt.

Last fiddled with by ltd on 2009-11-03 at 22:25
ltd is offline   Reply With Quote
Old 2009-11-03, 23:11   #9
ltd
 
ltd's Avatar
 
Apr 2003

30416 Posts
Default

Hopefully the machine should be up again in the next 10 minutes.
Took a little bit longer as the admin program refused to work for me for unknown reason. So I had to edit some files by hand. ( Hope I made no error)

Used the downtime to upgrade to revision 2.4.4 of the server also.
ltd is offline   Reply With Quote
Old 2009-11-03, 23:14   #10
opyrt
 
opyrt's Avatar
 
Apr 2008
Oslo, Norway

7×31 Posts
Default

Great job ltd, thanks for fixing it so fast!
opyrt is offline   Reply With Quote
Old 2009-11-03, 23:23   #11
ltd
 
ltd's Avatar
 
Apr 2003

22·193 Posts
Default

Both server are up again.
The main server has already handed out several tests and there were no requests from the runaway client. So I hope that there will be no empty queue anymore.

Sorry once more that I did not notice the problem earlier but today I totally ignored the forum.
ltd is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
LLRnet and PRPnet servers for automated LLR mdettweiler Twin Prime Search 230 2020-04-01 03:30
PRPnet servers for NPLB mdettweiler No Prime Left Behind 228 2018-12-26 04:50
PRPnet Servers for CRUS MyDogBuster Conjectures 'R Us 76 2018-03-09 19:05
Public PRPNet Servers rogue Open Projects 26 2013-01-16 01:33
SR5 PRPnet 2.4.7 Servers - Shutting Down Joe O Sierpinski/Riesel Base 5 6 2010-12-06 20:41

All times are UTC. The time now is 17:50.

Mon May 25 17:50:11 UTC 2020 up 61 days, 15:23, 1 user, load averages: 2.28, 2.33, 2.21

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2020, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.