mersenneforum.org

mersenneforum.org (https://www.mersenneforum.org/index.php)
-   No Prime Left Behind (https://www.mersenneforum.org/forumdisplay.php?f=82)
-   -   Server outrages (https://www.mersenneforum.org/showthread.php?t=13840)

gd_barnes 2014-07-03 07:34

I got home about an hour ago. A power blip caused the problems. I got my home phone line working by recycling the modem. After recycling everything else that I can think of including disconnecting and reconnecting the main cable line from the wall, my internet works on my main Windows desktop machine but does not work on any of my other machines including, unfortunately, the server machine. All of the lights are on or flashing as they should be on both my router and modem so I don't know what it causing the connection issue.

I will keep playing around for a little while longer. If I can't come up with something within the hour, I'll call Time Warner Thurs. afternoon.

gd_barnes 2014-07-03 09:31

I'm at a complete loss. I think there is something flaky with the router because my laptop is unable to connect either and it is the only wireless machine in my place. I have started all of the servers in case everything finally connects while I am sleeping. They are all temporarily set to 7 days to allow everyone to return their work.

I don't know if Time Warner will be able to fix the problem over the phone but I'll try them this afternoon. I hope my router has not crapped out. I wouldn't have time to get a new one until Saturday and trying to configure a new one of those things for all of these servers will surely be a nightmare.

gd_barnes 2014-07-03 10:09

I guess I did enough unplugs and replugs to get it to work. I decided to try out the idea that my router might be bad so I tried plugging the server machine as well as a couple of others directly into the modem and bypassing the router. No luck. Still only my Windows desktop machine worked. So I replugged everything back into the router and suddenly everything connected! Go figure. I've had power outages do flaky things to the connections before but nothing to this extent.

Bottom line: All servers are now back up. I will leave a 7-day window on the LLRnet/PRPnet servers for a couple of days to allow everyone to return their work.

I'm very sorry about the extended outage.

AMDave 2014-07-03 10:38

Just the comms were out then.
It looks like the server itself was running fine the whole time, aside from the manual reboots.
I just received every hourly log file email since the start of the episode.
The server is fine ;)

(in other news: the NO-IP addresses are back on line too)

mdettweiler 2014-07-03 15:45

Confirmed that port 1400 is back online and has accepted my backlogged pairs. :smile:

AMDave 2014-07-05 10:24

Complete backup set offsite download completed for 2014-07-04. (4.9GB)

mdettweiler 2014-07-18 05:26

Server is down again - I wonder what it is this time...

gd_barnes 2014-07-18 05:41

Temporary internet blip here. I don't know what caused it. Everything appears OK now.

mdettweiler 2014-07-18 05:53

Confirmed everything is back on my end. For what it's worth, I think this is the second time (that I know of) your internet has blipped out today - one of my PRPnet clients fell back to a PrimeGrid server earlier at 10:43 PM CDT. That too appeared to be a short-term outage, since by the time I noticed it things were back online again.

mdettweiler 2014-07-20 06:38

Gary, I think there's still some lingering issues with your internet connection - maybe your router is flaking out (dying perhaps?). It had another "extended short term" outage just an hour ago, around 12:00 - 12:20 AM CDT. (That's the minimum time window I was able to confirm - it may have been out longer, but that's when 3 of my clients fell back to PrimeGrid servers. One of the clients is in another state than the other 2, so this is definitely not an issue on my end.)

AMDave 2014-07-20 11:53

I don't think so.
That's the exact time that the daily refresh script ran the weekly result table optimize task.
This task has been in place since "# HISTORY : AMDave 20120524 original"
That keeps mysql very busy for more than a few minutes due to the size of the results table.
If mysql is very very busy it can impact the prpnet database I/O, as I think it may have done here.
I will turn the weekly task off for a while I analyse and try to find a 'softer' solution.


All times are UTC. The time now is 22:11.

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.