mersenneforum.org  

Go Back   mersenneforum.org > Prime Search Projects > No Prime Left Behind

Reply
Thread Tools
Old 2014-06-30, 03:28   #309
gd_barnes
 
gd_barnes's Avatar
 
May 2007
Kansas; USA

28AC16 Posts
Default

Wow. Bad timing. I'm out of town until Thursday morning. My home phone line is out too. I'll try contacting my internet provider to see if they see anything on their end.

When I get it reconnected, I'll set all of the servers to 2 weeks for a period of 2-3 days to allow everyone to return their work.

Sorry everyone.
gd_barnes is offline   Reply With Quote
Old 2014-06-30, 09:31   #310
AMDave
 
AMDave's Avatar
 
Jan 2006
deep in a while-loop

10100100102 Posts
Default

ITMT I restored the backups from 2014-06-13 on my DRP server. The restore is confirmed successful.

DRP URL for viewing only - http://nplb.no-ip.org/stats/index.php?content=port
To those not in the know: do not attempt to use DRP ports - like your brain-sucker you will starve ;)
The ports remain closed and untested on the DRP server and some - as yet untested - config would be required to make it active.

Have a safe trip, Gary. Hopefully all comes back up ok when you get home.

EDIT -
I got asked the last time I mentioned DRP:
DRP = Disaster Recovery Plan - http://en.wikipedia.org/wiki/Disaster_recovery_plan
Due to the differences between the current server and the DRP server I have not yet been able to upgrade the DRP plan to a BCP plan as the DRP config is not backward compatible to the current host. I could not implement the automatic fail-over as it could not fail-back. This may be resolved at some point in the future if necessary.
Our DRP plan is tested about twice per year to keep it valid and up to date. Although the full administrative functionality has not yet been fully tested under the DRP, there is a high level of confidence that the 'automagic wand' of linux admin SMACK-FU will 'Make it so.' - Yes. I like Picard quotes too ;)

Last fiddled with by AMDave on 2014-06-30 at 10:07
AMDave is offline   Reply With Quote
Old 2014-06-30, 16:34   #311
mdettweiler
A Sunny Moo
 
mdettweiler's Avatar
 
Aug 2007
USA (GMT-5)

3×2,083 Posts
Default

Ah, thanks for getting that set up Dave. As you mentioned, the PRPnet/LLRnet ports are obviously not open on the DRP server - that would indeed be somewhat tricky to implement, since there's not really a "clean" way to communicate assigned pairs and results back to the original ports when they come back up.

This got me thinking...Dave, what did you have in mind for implementing the automatic fail-back? In this case, the most recent backup we had available to restore onto the DRP server was from over 2 weeks ago (the last monthly backup). That's great if the main server were to fail completely (i.e. if the hard drive bailed and we lost everything on it) - rolling back 2 weeks is certainly better than losing it entirely - but, in situations like the present one where the main server is expected to come back online soon, with no loss of data, the two servers would be completely out of sync if we attempted to run PRP/LLRnet ports on the DRP server int he meantime. For a (relatively) short downtime like we anticipate this one to be, by the time the main server came back up we'd still be processing "historical" work on the DRP server that's long been completed.

The only way I can think of to do this practically (i.e. so we're not spending days on end spinning our wheels on historical work) would be to:
  • Have the backup server running continuously, keeping its stats database updated daily from the master server's results files.
  • Likewise, download a snapshot of the PRPnet port directories and databases to the DRP server on a daily basis.
  • In case of main server failure, the DRP server would be activated, and clients could fall back to it. (This is easy for PRPnet since it allows backup servers to be configured; with LLRnet it would require manual intervention.)
  • When the main server comes back online, instead of bringing its own ports online immediately, it would redirect (port forward) to the DRP server, meanwhile getting its own stats database back in sync by pulling down daily and hourly files from the DRP server.
  • We'd then need to manually migrate the current port state back from the DRP server to the main server. This would involve shutting down the port, taking a mysqldump of the respective database and a tarball of the port directory, SCPing the files to jeepford, restoring the database and port directory, then restarting the server.
Aside from the logistical hassles of keeping the second server "in touch" with the main one on a daily basis to be ready for fallback (which may or may not be practical), the last one would seem to be the kicker - sometimes those mysqldumps can get kind of big. The raw size of the dump file is not so much the issue; they can be compressed quite effectively, but it can take quite a while (>5 minutes) just to make the dump. When you include the time to do the SCP, and to restore the backup on the main server, the port could realistically be offline for an hour (and it does have to be offline, because you need to transfer the PRPnet database and directory together to keep them in sync and make sure no results are lost). This would need to be done individually for each port, and would probably entail a fair amount of manual admin intervention to pull off smoothly. In this light, it may not be worthwhile to even attempt to have the ports run on the DRP server.

Just thinking out loud...obviously none of this is of immediate critical importance, but, since we were talking about DRP...
mdettweiler is offline   Reply With Quote
Old 2014-07-01, 06:04   #312
gd_barnes
 
gd_barnes's Avatar
 
May 2007
Kansas; USA

22×19×137 Posts
Default

Well, I had a friend go over to my house. There is definitely internet access there because my main desktop machine was able to connect just fine. The server machine was a different story. It rebooted itself for some reason. I had him log onto it but he could not get internet access from the machine itself even after doing some recyling of routers and modems. So I don't know why it cannot connect and why it rebooted itself. Max or Dave, you might try remote access to the machine now and see if you can get to it.

I'll be home around 3 AM CDT Thursday morning. I'll look at it right away when I get home. Hopefully it's as simple as messing with the cable in the wall or tigtening a loose connection.

Sorry again for the problems.
gd_barnes is offline   Reply With Quote
Old 2014-07-01, 09:39   #313
AMDave
 
AMDave's Avatar
 
Jan 2006
deep in a while-loop

2×7×47 Posts
Default

Negative from me unfortunately.
I had set up a 'call home' trace from the server via the log file emails, a while back, so that even if the IP address changed I could trace it back, but that message is not coming out.
The IP on the DNS provider has also not updated.
So must to conclude that the server cannot communicate outbound.
If I have the IP I can connect and fix things but not until then.
EDIT - yep. a loose CAT5 terminator would do it.

Last fiddled with by AMDave on 2014-07-01 at 09:40
AMDave is offline   Reply With Quote
Old 2014-07-01, 10:07   #314
odicin
 
Sep 2011
Potsdam, Germany

7716 Posts
Default

The backup website handled by no-ip is also unreachable now, because no-ip was takedown by MS: https://www.noip.com/blog/2014/06/30...soft-takedown/

Regards Odi
odicin is offline   Reply With Quote
Old 2014-07-01, 10:25   #315
AMDave
 
AMDave's Avatar
 
Jan 2006
deep in a while-loop

29216 Posts
Default

That is an interesting development.
We stopped using NO-IP as the secondary domain back on post #257 in this thread, circa 14 Aug 2013.
"Decision made - http://nplb-gb1.no-ip.org/ will not be renewed.
Since the DNS update for www.noprimeleftbehind.net has been fixed and has been more reliable, it will be the only address moving forward."

I test and observe that the DRP link via NO-IP is currently working and confirm that the NO-IP issue should not be affecting us. However, that doesn't mean that it couldn't:
"In the meantime, NO-IP / Vitalwerks have published their answer online:
Apparently, the Microsoft infrastructure is not able to handle the billions of queries from our customers. Millions of innocent users are experiencing outages to their services because of Microsoft’s attempt to remediate hostnames associated with a few bad actors.”
The un-seized dynamic dns ".net" domains may have become caught up in this filtering overload, although I am not clear on the 'how'.
So this is potentially a feasible hypothesis from Odi.

edit - the ".net" domain may well be caught up in this after further reading, even though we are using a different provider. I can't connect via the IP, so I still suspect something else.

edit -
ITMT (M$) are boasting about it - http://blogs.technet.com/b/microsoft...isruption.aspx
Reminds me of Judge Dredd ... "I AM the law!"
May the deities help us, because justice is out to lunch.
/me facepalms

Who the heck made M$ the Internet's policeman ... oh looky there. A judge did.
SlashDot thread - http://yro.slashdot.org/story/14/07/...tm_medium=feed

Last fiddled with by AMDave on 2014-07-01 at 11:13
AMDave is offline   Reply With Quote
Old 2014-07-01, 12:02   #316
AMDave
 
AMDave's Avatar
 
Jan 2006
deep in a while-loop

2×7×47 Posts
Default

update - the no-ip addresses to NPLB DRP site and also to the FDCPS project are no longer working.
They are hosted in Australia on a linux server.
{bad words}
If it were a car company, they'd have to issue a recall, but some US judge said sure, you can highjack the international highway so you can find and stop the defective vehicles you made and sold.
I have issues with the disregard for jurisdiction and for the incorrect placement of responsibility.

Last fiddled with by AMDave on 2014-07-01 at 12:20
AMDave is offline   Reply With Quote
Old 2014-07-01, 16:10   #317
mdettweiler
A Sunny Moo
 
mdettweiler's Avatar
 
Aug 2007
USA (GMT-5)

3×2,083 Posts
Default

Yeah, this whole Microsoft/No-IP episode has been rather disappointing - Microsoft's other botnet takedowns prior to this had generally been against unequivocally shady providers, and they seem to be using the same tactics in this case, clueless to the fact that No-IP is actually a legit provider heavily used by good guys. The weird thing is that, over the last number of years I'd noticed Microsoft becoming more clueful as a company in general, but this proves a shining exception. It doesn't help that the judges involved are often quite clueless about the technological ramifications, and thus inevitably base a large part of these subpoena/court order decisions on the reputation of the party asking for it rather than the technical and situational merits of the case...definitely a recipe for disaster if there ever was one.

Interestingly, I can still access the DRP server via it's No-IP address, as well as one of my own boxes which also has an address at no-ip.org. I wonder if the "temporary infrastructure" bottlenecks are affecting international links more than the U.S. - that could be plausible, and would explain why it works for some but not others.

Last fiddled with by mdettweiler on 2014-07-01 at 16:12
mdettweiler is offline   Reply With Quote
Old 2014-07-01, 19:44   #318
odicin
 
Sep 2011
Potsdam, Germany

7·17 Posts
Default

Hmm... curious. I can't loop up the No-IP Adress with the DRP Site here from Germany. I tested it with different DNS-Servers from different providers.

Maybe I should take an american DNS ;)

Regards Odi
odicin is offline   Reply With Quote
Old 2014-07-02, 06:48   #319
gd_barnes
 
gd_barnes's Avatar
 
May 2007
Kansas; USA

22×19×137 Posts
Default

I just realized today that my home phone line is not working either. This may end up requiring a call to Time Warner after all. I'll try my best to get it working early Thursday morning after I get home. If I can't get my phone line and internet connection on the server machine working myself, it will have to wait until Thursday afternoon when I can put in the call.
gd_barnes is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
What exactly is sent to the server? paul0 NFS@Home 2 2015-03-12 23:00
Server bug -- new? Christenson Information & Answers 5 2011-07-12 21:44
Server Down? Grant Information & Answers 13 2008-11-24 19:37
New ECM-server available andi314 Factoring 3 2003-08-31 11:22
New Server Hardware and price quotes, Funding the server Angular PrimeNet 32 2002-12-09 01:12

All times are UTC. The time now is 13:53.


Mon Aug 2 13:53:07 UTC 2021 up 10 days, 8:22, 0 users, load averages: 2.01, 2.13, 2.05

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.