mersenneforum.org

mersenneforum.org (https://www.mersenneforum.org/index.php)
-   No Prime Left Behind (https://www.mersenneforum.org/forumdisplay.php?f=82)
-   -   Server outrages (https://www.mersenneforum.org/showthread.php?t=13840)

MyDogBuster 2013-02-13 08:52

As usual, a great job Dave. :tu:

AMDave 2013-02-13 09:06

You would think so, but it just went down again within minutes of my last post.
Arg!
It's got to be a network issue.
More diagnostics required.

edit - it came back again, and as before, everything is running fine. I cannot see yet what is causing this.

AMDave 2013-02-13 10:32

1 Attachment(s)
still going up and down
right now its down

henryzz 2013-02-13 10:54

[QUOTE=AMDave;329292]still going up and down
right now its down[/QUOTE]

Are you able to reboot the PC and then get access again?

AMDave 2013-02-13 11:15

Nope. I can't log into it at the moment.
I don't think the server is the problem.
Ram is OK. (running at around 35% usage - using htop)
Disk is OK. (No bad blocks found, SMART test passed - using smartctl short test)
Processes appear to be OK. (using htop and testing IO, reads and writes)
Database is OK. (already completed full mysqlcheck on all databases)
Port servers are OK. (when the connection is up, the ports are serving and recieving ok)
Power is OK. (Server is still on, no down time)
Weather is OK. (no lightning etc)
I'm not going to reboot the server if there is nothing wrong with it.

Also the DynDNS is OK (the server reports its IP to both the DNS services - no problem there when the link is up)

It really looks like a network/ISP issue the more I eliminate things.
Unfortunately I don't have the ability to remotely reboot the router.
Also, I can't find the service status page for the ISP. It looks like they don't have one.

I'm going to take a break.

AMDave 2013-02-13 12:22

Just for completeness, I tested the NIC (in the current up-time)
The NIC is fine too.
Nothing exceptional in the netstat either.
Looks like local traffic (LAN) is fine.
Also looks like WAN traffic is fine - while it is working

there are 2 more networking tests I want to do but ...
blah - out it goes again.

Going to stalk some Zzzz's now.

AMDave 2013-02-13 12:35

caught another opening
checked another server on the LAN - all ok, LAN comms ok, local off-server copies of backups are current
outbound tracert show no unexpected external influences
pinning it down to the router now I think.

I'll be dreaming of a cat chewing on an CAT-5 cable tonight :P

gd_barnes 2013-02-13 12:44

Sorry guys. I'm out of town until Friday. I'll look at it when I get back. My machines are crunching away on my personal port so the server isn't down. Like Ian said, it's likely router or IP problems.

Lennart 2013-02-14 06:23

[QUOTE=gd_barnes;329306]Sorry guys. I'm out of town until Friday. I'll look at it when I get back. My machines are crunching away on my personal port so the server isn't down. Like Ian said, it's likely router or IP problems.[/QUOTE]


Seems to be working now.

Lennart

MyDogBuster 2013-02-14 18:36

Back to acting up again.

Edit: I'm starting to lead toward this being an ISP problem. Curious that it returns to normal around 3AM and starts
going flaky again around 9AM. That, or you have some kid downloading every porn movie in existence that lives in your neighborhood.

gd_barnes 2013-02-16 01:52

Major re-cycle has been completed. Everything seems to be running smoothly now.


All times are UTC. The time now is 13:54.

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.