mersenneforum.org

mersenneforum.org (https://www.mersenneforum.org/index.php)
-   No Prime Left Behind (https://www.mersenneforum.org/forumdisplay.php?f=82)
-   -   LLRnet servers for NPLB (https://www.mersenneforum.org/showthread.php?t=10042)

MyDogBuster 2009-03-28 23:36

A couple of problems.

E-mail notification not working. I had a prime over 3 hours ago and nothing.

Gary's server's seem to be down hard. Doesn't seem to be an ip address problem.

mdettweiler 2009-03-28 23:42

[QUOTE=MyDogBuster;167038]A couple of problems.

E-mail notification not working. I had a prime over 3 hours ago and nothing.

Gary's server's seem to be down hard. Doesn't seem to be an ip address problem.[/QUOTE]
Hmm...indeed. I just checked the No-IP web site and it seems that his IP address was last updated with them about 5 minutes ago, so that indicates that it might be just the server machine that's down, not Gary's whole network (since the dynamic DNS updates come from his Windows desktop, I think). I'm going to take a whack at connecting through the alternate SSH route I set up a while back into one of the other machines on his network and see if that works.

AMDave 2009-03-29 00:37

[QUOTE=MyDogBuster;167038]E-mail notification not working. I had a prime over 3 hours ago and nothing. [/QUOTE]
edited:
I see the stats server got a copy of your prime at 2009-03-28 13:16:37 (before GB went off air)
I am looking into it immediately

AMDave 2009-03-29 01:43

fixed.
Notification is on its way.

MyDogBuster 2009-03-29 01:49

[QUOTE]
fixed.
Notification is on its way.
[/QUOTE]

Got it. All's well in email land again. Another quick fix by our crack database team.

gd_barnes 2009-03-29 08:46

It's official. My main server machine is offline. It's probably yet another blown mobo. Unfortunately with the way Max designed the remote access thing, neither of us now have access to any of my machines until I get back next Thurs.

So you'll need to move your machines from ports GB4000 and GB8000 to something else until then. Sorry. I'd probably suggest ports IB5000 or IB9000, the latter if you don't mind non-top-5000 primes. I'd rather we avoid any more machines on IB8000 for now because I'd like to keep that progressing relatively slowly until we get the new n-range loaded in. After it's loaded in, then it's all hands on deck and we take off like a rocket with it if we want. Perhaps we'll run another rally on it in 3 weeks or so.

But that's not the worst part. There are two things worse:

1. My sieving on 10 cores is due to complete late afternoon Sunday. That means that I now won't have a file sieved to P=15T ready to load into port IB8000 until next Friday. Based on the speed of processing on that port, it looks like I'll need to load in the file sieved to P=12.5T instead. I'll probably just load a small range to tide us over; likely n=500K-505K. That should be more than sufficient. For n=505K-600K, we'll use the file sieved to P=15T, the 1st part of which can be loaded after I get back.

2. The above 10 cores plus the 4 cores on my apparently bad server machine will now sit idle for 6+ days unless I can step someone through connecting the 10 cores to LLRnet. I'll see if Sherri (ex) would be willing to do that, perhaps late Sunday or Monday. ...can't do anything about the other 4 cores until I see what the problem is.

It also means I can't check my manual CRUS ranges nor my NPLB double-check range for primes.

ARGH!! :censored::rant::furious:

Karsten, Max, and Ian, I'll eventually send you the piece-mealed sieve files for k=1400-2000/n=500K-1M for the different n-ranges that are sieved to the different depths. It will be rather annoying having 2-3 different sieve depths but if I send you the correct broken up files to match to, it shouldn't be too much hassle.


Gary

gd_barnes 2009-03-29 10:26

Well, I may have what might be some good news. I just looked in the server daily report and I see that I am now officially processing NOTHING on all ports!!

That tells me that I likely had a power outage and all of the machines are turned off or are on and just not running. Power outages have been a relatively rare occurrence at my place. Assuming that is the case, I should be able to just have someone go over to my place and turn all the machines on.

Max, with the machines turned on, would we need to do the "Run VNC server" thing on them for you to get access to Crunchford? Note to all: Crunchford is the name of my server machine. Assuming so, would I have access to all of them? In other words, could you remotely do the Run VNC server thing so that I could access them?

If it's not a power outage, then maybe my place burned down. If so, my insurance will cover them although they might be kind of low priority on my to-do list. lol


Gary

mdettweiler 2009-03-29 13:54

[QUOTE=gd_barnes;167098]Well, I may have what might be some good news. I just looked in the server daily report and I see that I am now officially processing NOTHING on all ports!!

That tells me that I likely had a power outage and all of the machines are turned off or are on and just not running. Power outages have been a relatively rare occurrence at my place. Assuming that is the case, I should be able to just have someone go over to my place and turn all the machines on.

Max, with the machines turned on, would we need to do the "Run VNC server" thing on them for you to get access to Crunchford? Note to all: Crunchford is the name of my server machine. Assuming so, would I have access to all of them? In other words, could you remotely do the Run VNC server thing so that I could access them?

If it's not a power outage, then maybe my place burned down. If so, my insurance will cover them although they might be kind of low priority on my to-do list. lol


Gary[/QUOTE]
Hmm...well, if they are indeed off due to a power outage, then yes, if someone went in, turned each one on in turn, and ran the "Run VNC Server" thing on each of them (but make sure it's only run ONCE on each of them! I've seen some of these things pile up with 3 or 4 distinct VNC servers running at a time because you pressed that button while the VNC server was still running :smile:), then indeed, we would be able to access them remotely and even get sr2sieve running again quite easily. :smile:

Max :smile:

Brucifer 2009-03-29 16:12

Look at the bright side in this Gary. Here you have an "EX" that you can sometimes talk into going over and resetting your computers. I have a present long term spouse that will absolutely positively not touch anything in my computer room. I have a daughter that I cannot talk into trying to learn the on/off stuff on the linux systems. Luckily I have a son that could screw up a pair of pliers and a straight slot screwdriver and he won't have anything to do with the computers either. :smile: So when my systems tank, they just sit until I get back. So like that Campbell's Soup commercial, you have "possibilities"! :smile:

IronBits 2009-03-29 17:22

There is an option in the BIOS to turn the computer back on when the power is restored.
I have always had all my computers using that option so when the power goes out, then comes back on, so do all my computers.
You may want to look into that.
It's in the Power Menu, APM configuration, Restore on Power AC loss [Enabled]
Default is [Disabled]

mdettweiler 2009-03-29 17:51

[QUOTE=IronBits;167134]There is an option in the BIOS to turn the computer back on when the power is restored.
I have always had all my computers using that option so when the power goes out, then comes back on, so do all my computers.
You may want to look into that.
It's in the Power Menu, APM configuration, Restore on Power AC loss [Enabled]
Default is [Disabled][/QUOTE]
Hmm...but doesn't that option only work for brief power outages? I believe my computer is set similarly, but if the power outage is for more than a few seconds then the computer will just stay off when the power comes back on.


All times are UTC. The time now is 23:00.

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.