mersenneforum.org  

Go Back   mersenneforum.org > Prime Search Projects > No Prime Left Behind

Reply
 
Thread Tools
Old 2009-02-23, 23:43   #859
gd_barnes
 
gd_barnes's Avatar
 
May 2007
Kansas; USA

236228 Posts
Default

Quote:
Originally Posted by mdettweiler View Post
OH MAN! I just figured out what happened. When I tried logging on to the server's "terminal session" (as opposed to the session it gives you when you log on through VNC), lo and behold--there was a copy of the G8000 LLRnet server that had been running all along! No wonder port 8000 was already tied up by something--it was tied up by the G8000 LLRnet server which Gary had already restarted!

Gary, in the future, when you restart servers on crunchford could you possibly log into the machine via VNC, and then restart the server from there? Otherwise I can't see the restarted server's terminal window, and I think it's not even running. Thus I kept trying to restart it over on the VNC session, not realizing that it was already up and running on the console session and that that was what was hogging the port. (That would also explain why results have kept coming in for that port even after I thought the server went down. )

Okay, long story short: turns out the crashed server hadn't hung on the port after all. A new server was running the whole time!

I'll go and get the server moved over to the VNC session now so I can see it better, and get it into the while loop in case of crashes.

OK, sorry about that. But after I stopped and restarted the server, only one of my clients would run it. I even waited 10-15 mins. one time after stopping the server and then restarted it. The other clients still hung and were sleeping, even after multiple attempts to stop and restart each one of them after stopping and restarting the server. I'll go downstairs and see if they are running now.

You were seeing results because I had ONE client running against the server but the other 6 that I was running against it wouldn't connect.

What's the deal with the VNC thing anyway? I've noticed the same thing that you have. If you have a terminal window up from directly messing with the machine, it won't show in VNC. When you have a terminal window up in VNC, it won't show when you are directly messing with the machine. That seems like a bug to me. It doesn't make sense because I pulled up the task manager (Linux version) and verified exactly what was running on the machine. I then killed the server, waited an appropriate amount of time and restarted it. I tried this twice to unhang my clients. Shouldn't that have killed any terminal window in VNC or non-VNC?

Is there a way around this confusing VNC (remote access) vs. non-VNC problem? English laymen's terms please.

Also, why do you have to keep running this "while" loop? Is that because of my constant IP address changes? If so, why didn't it work this time? It was well over an hour after the crash or IP address change before I tried starting and stopping it.

Gawd, this server stuff is confusing. My question is: Why did port 8000 have the problem and port 4000 didn't? Also, why should it be so difficult to kill the server from the task manager, wait 10-15 mins., and then restart it? These servers shouldn't be rocket science but they are from my perspective.

I think in the future, I'm not going to do any kind of attempt at stopping and restarting of the servers. I just end up creating more problems than there were originally. If there's a problem, I'll just move my machines to something else and you can fix them the next day.

Personally, I think having the servers on my machines has turned out to be a bad idea. My internet connection is quite stable but keeps changing addresses. I've had a mobo crash one time but I think I've gotten that problem resolved. Knock on wood. Ian has been very patient with port GB4000 and diligently does the flushdns thingy when the IP address changes. Others, I'm sure, won't be so patient.


Gary

Last fiddled with by gd_barnes on 2009-02-23 at 23:55
gd_barnes is offline   Reply With Quote
Old 2009-02-23, 23:59   #860
gd_barnes
 
gd_barnes's Avatar
 
May 2007
Kansas; USA

2×5×1,013 Posts
Default

Port 3000 has now dried out and the range is complete. It has been removed from the 1st post of this thread.


Gary
gd_barnes is offline   Reply With Quote
Old 2009-02-24, 00:34   #861
IronBits
I ♥ BOINC!
 
IronBits's Avatar
 
Oct 2002
Glendale, AZ. (USA)

3×7×53 Posts
Default

ps aux --forest is your friend.
IronBits is offline   Reply With Quote
Old 2009-02-24, 02:06   #862
AMDave
 
AMDave's Avatar
 
Jan 2006
deep in a while-loop

2×7×47 Posts
Default

ps -ef | grep llr
AMDave is offline   Reply With Quote
Old 2009-02-24, 02:27   #863
gd_barnes
 
gd_barnes's Avatar
 
May 2007
Kansas; USA

2·5·1,013 Posts
Default

Are you guys answering anything in my post because I can't tell?

As stated above:

Quote:
English laymen's terms please.

Gary

Last fiddled with by gd_barnes on 2009-02-24 at 02:27
gd_barnes is offline   Reply With Quote
Old 2009-02-24, 03:02   #864
gd_barnes
 
gd_barnes's Avatar
 
May 2007
Kansas; USA

2×5×1,013 Posts
Default

Quote:
Originally Posted by mdettweiler View Post
In fact, we *do* use fixed IPs for all of Gary's dedicated crunch machines, one of which is running all the servers.

Gary, as David said--whenever you have a client within your network running one of the nplb-gb1.no-ip.org servers, you can configure the client to say 192.168.2.100 instead. That's the *internal* IP address of the server and it will be a much more direct connection that will never cut off when any No-IP things have to change. (Note that you can't do this on your laptop, since then you wouldn't be able to connect to the server when away from your home network.)
OK, thanks guys. I did that. Hopefully that will resolve the reconnect problem when my public IP address changes.

I'm still wondering about some of the issues that came up per my last post.

Everything seems to be working fine now. Thanks for the help.


Gary
gd_barnes is offline   Reply With Quote
Old 2009-02-24, 03:19   #865
gd_barnes
 
gd_barnes's Avatar
 
May 2007
Kansas; USA

2·5·1,013 Posts
Default

About 15 mins. ago, I just got inundated with 400+ Emails of old primes found. Can you guys look into that quickly? Thanks.
gd_barnes is offline   Reply With Quote
Old 2009-02-24, 03:25   #866
Lennart
 
Lennart's Avatar
 
"Lennart"
Jun 2007

25·5·7 Posts
Smile SPAM :)

What the f***k are you doing ?????????


Creating ads ?

/Lennart
Lennart is offline   Reply With Quote
Old 2009-02-24, 03:40   #867
Brucifer
 
Brucifer's Avatar
 
Dec 2005

313 Posts
Default

You guys been hacked or something? I just got 31 notifications for primes found under ports that haven't been run for ages, and aren't running now, plus others that are but I don't have systems crunching on those ports. ???????????????????????????????????

SUM TING WONG
Brucifer is offline   Reply With Quote
Old 2009-02-24, 03:48   #868
PCZ
 
PCZ's Avatar
 
Jun 2006
Chertsey Surrey UK

2·179 Posts
Default

Holy Spam Batman !!!
PCZ is offline   Reply With Quote
Old 2009-02-24, 03:58   #869
AMDave
 
AMDave's Avatar
 
Jan 2006
deep in a while-loop

29216 Posts
Default

No no no.
Everything is ok.

It was me.

Whoo.
When I fail, I fail spectacularly.

I forgot to update the mail_sent flag on the prime_list table when I re-activated th mail_notification script on the new server.

I completely missed it.
It was not on my checklist.

409 emails went out - thats the difference between the snapshot I took when I started the database migration and the current status of the old database.


I do apologise to Gary, and Max and everyone who just got their notifications again.

Please delete all of the emails from nplb_stats recieved in the last hour.
The table is upto date and there are no more coming.

AMDave is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
PRPnet servers for NPLB mdettweiler No Prime Left Behind 228 2018-12-26 04:50
Servers for NPLB gd_barnes No Prime Left Behind 0 2009-08-10 19:21
LLRnet servers for CRUS gd_barnes Conjectures 'R Us 39 2008-07-15 10:26
NPLB LLRnet server discussion em99010pepe No Prime Left Behind 229 2008-04-30 19:13
NPLB LLRnet server #1 - dried em99010pepe No Prime Left Behind 19 2008-03-26 06:19

All times are UTC. The time now is 00:51.

Wed Jun 3 00:51:24 UTC 2020 up 69 days, 22:24, 2 users, load averages: 1.43, 1.17, 1.18

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2020, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.