mersenneforum.org  

Go Back   mersenneforum.org > Prime Search Projects > No Prime Left Behind

Closed Thread
 
Thread Tools
Old 2008-11-22, 11:56   #56
gd_barnes
 
gd_barnes's Avatar
 
May 2007
Kansas; USA

33×5×7×11 Posts
Default

Quote:
Originally Posted by mdettweiler View Post
Okay, cool, that might just do the trick. However, you may want to do, say, 1/4 on C443 and 3/4 on G4000....I believe Carlos had mentioned once before that since he's got a Windows server he didn't want too terribly many cores on his server.

OK. How about 32 cores on port 4000 and 10 (including 2 slower) on port 443? 2 slower cores are already running on port 443.

Darn, why didn't this happen on port 5000 at the end of the 3rd drive? Was Lennart not running a Proxy server then?

BTW, I was able to connect to port 400 and it cleared out one pair each on my 2 slower cores (Windows desktop) after I had changed the --once=1 line to uncomment it. Now it has cleared its WuCache on port 400 and is running port 443.

I need to go downstairs and check the quads now to see if they've begun processing port 400 again. If so, I'll stop them, clear out the cache, and move them to ports 443 and 4000.


Gary

Last fiddled with by gd_barnes on 2008-11-22 at 11:56
gd_barnes is offline  
Old 2008-11-22, 11:57   #57
mdettweiler
A Sunny Moo
 
mdettweiler's Avatar
 
Aug 2007
USA (GMT-5)

3·2,083 Posts
Default

Quote:
Originally Posted by IronBits View Post
Max Connections was set at 60, and it handled everything thrown at it, until the proxy showed up
Hmm...maybe you could try 500? 1000? I don't know--is there a maximum theoretical upper limit for this?
mdettweiler is offline  
Old 2008-11-22, 11:59   #58
em99010pepe
 
em99010pepe's Avatar
 
Sep 2004

2·5·283 Posts
Default

Forget tunning the timings, it's a proxy bug. All cores must be spread through the other servers.

Gary, 10 is ok for C443. Just checked 2 already got work as you said.

Carlos

Last fiddled with by em99010pepe on 2008-11-22 at 12:00
em99010pepe is offline  
Old 2008-11-22, 12:00   #59
IronBits
I ♥ BOINC!
 
IronBits's Avatar
 
Oct 2002
Glendale, AZ. (USA)

21318 Posts
Default

It has nothing to do with it, the proxy runs through the sockets at a rate of 10 per second until it runs out of sockets.
Makes no difference if I set it to 1,000, the end result is always the same, runs out of sockets and dies.
Until he turns that thing off, we are screwed... I could go into my router and ignore his IP I think.
IronBits is offline  
Old 2008-11-22, 12:04   #60
mdettweiler
A Sunny Moo
 
mdettweiler's Avatar
 
Aug 2007
USA (GMT-5)

186916 Posts
Default

Quote:
Originally Posted by gd_barnes View Post
OK. How about 32 cores on port 4000 and 10 (including 2 slower) on port 443? 2 slower cores are already running on port 443.

Darn, why didn't this happen on port 5000 at the end of the 3rd drive? Was Lennart not running a Proxy server then?

BTW, I was able to connect to port 400 and it cleared out one pair each on my 2 slower cores (Windows desktop) after I had changed the --once=1 line to uncomment it. Now it has cleared its WuCache on port 400 and is running port 443.

I need to go downstairs and check the quads now to see if they've begun processing port 400 again. If so, I'll stop them, clear out the cache, and move them to ports 443 and 4000.


Gary
Yes, I think this is the first time Lennart is using a proxy. The main reason why he wanted to set up a proxy was because it was getting just too hard to manually stop/start 60 or 70 individual LLRnet clients.

Hey, wait a minute, I just had a thought! How about Lennart simply set his proxy to cache, say, 50 k/n pairs at one time--well below any of our servers' maxConnections limit--and thus, theoretically, his proxy should be no worse than *one* caching LLRnet client set to 50 k/n pairs at once (which is not a problem)? Yes, his proxy would be connecting to the server more often, but that's fine, as long as it's in more manageable, individual chunks--as is evidenced by how the servers survive him just fine when he connects without a proxy.
mdettweiler is offline  
Old 2008-11-22, 12:09   #61
IronBits
I ♥ BOINC!
 
IronBits's Avatar
 
Oct 2002
Glendale, AZ. (USA)

45916 Posts
Default

Beyond and I went through all of this before to find out what the problem was and it came down to the proxy and server not being able to talk properly.
You can't fix this one Max.
What he could do is run multiple proxy servers, with no more than 25 clients per proxy.
I just watched his proxy eat 100 sockets in under 2 seconds.
IronBits is offline  
Old 2008-11-22, 12:10   #62
IronBits
I ♥ BOINC!
 
IronBits's Avatar
 
Oct 2002
Glendale, AZ. (USA)

3·7·53 Posts
Default

You can also see another problem the proxy causes by looking at the rejected.txt file.
http://nplb.ironbits.net/rejected_400.txt

I need to get some sleep, been up all night...
if he doesn't kill his proxy server, then my server will not be able to function until it does ...

Last fiddled with by IronBits on 2008-11-22 at 12:17
IronBits is offline  
Old 2008-11-22, 12:16   #63
mdettweiler
A Sunny Moo
 
mdettweiler's Avatar
 
Aug 2007
USA (GMT-5)

141518 Posts
Default

Quote:
Originally Posted by IronBits View Post
Beyond and I went through all of this before to find out what the problem was and it came down to the proxy and server not being able to talk properly.
You can't fix this one Max.
What he could do is run multiple proxy servers, with no more than 25 clients per proxy.
I just watched his proxy eat 100 sockets in under 2 seconds.
Hmm...I see. That's why I was thinking that if he set his proxy to cache, say, 50 k/n pairs at a time (instead of 1000 or whatever he's got), then it would communicate more often, but in smaller bits, thus giving the server a chance to clean up its sockets before it connects again.

Anyway, might help (band-aid fix, at least), might not--I suggested it over at the PrimeGrid chat room thingie, hopefully Lennart will see it shortly and give it a try. And then if that doesn't help, as a last resort he can go back to managing his cores manually, without a proxy.
mdettweiler is offline  
Old 2008-11-22, 12:20   #64
gd_barnes
 
gd_barnes's Avatar
 
May 2007
Kansas; USA

101000100110112 Posts
Default

Yep, all of my quads were sleeping too. Argh! I just now killed a couple of them, changed them to once=1 to clear out the current pair on port 400, and restarted those. No luck...can't even clear out the single pair in the cache.

I know where Lennart is coming from. I now have to change 42 total cores over and the way I have things set up, I try to keep different servers in different folders and make sure they have different GUI port numbers so they don't step on each other when I'm trying to clear one out while starting on the other. So it's not quite the quick process of copying ant then cutting and pasting. This promises to be an hour task or more. If I had 68 cores or more, I'd likely spend 2 hours.

So there is definitely a need for a proxy server. If there is some compromise where the heavy hitters only cache 25 pairs at a time for their proxy server, then let's work with them on setting up that kind of thing.


Gary

Last fiddled with by gd_barnes on 2008-11-22 at 12:21
gd_barnes is offline  
Old 2008-11-22, 12:29   #65
IronBits
I ♥ BOINC!
 
IronBits's Avatar
 
Oct 2002
Glendale, AZ. (USA)

3·7·53 Posts
Default

Lennart, it is not your fault! It's the communication between the proxy and the server.

My server will be down and out for a while, because I have to get some sleep.
I'll check on the situation when I get up from my nap and hopefully it will over by then.

Lennart, you can put 25 fast cores on a proxy to see if that helps...
IronBits is offline  
Old 2008-11-22, 12:36   #66
mdettweiler
A Sunny Moo
 
mdettweiler's Avatar
 
Aug 2007
USA (GMT-5)

186916 Posts
Default

Quote:
Originally Posted by IronBits View Post
Lennart, it is not your fault! It's the communication between the proxy and the server.

My server will be down and out for a while, because I have to get some sleep.
I'll check on the situation when I get up from my nap and hopefully it will over by then.

Lennart, you can put 25 fast cores on a proxy to see if that helps...
Hey, I just had another idea: maybe somebody (i.e. somebody with at least a modicum of LUA programming language) could code up some small code to insert in place of whatever makes up the LLRnet proxy server's "send a dump of results to the server" code, that essentially does the following:

for(i = 0; i < *number of results in tosend-proxy.txt*; i++) {
send 50 results to server;
wait 10 seconds;
}

Essentially the above pseudo-pseudocode transmogrified into LUA.

Anyway, gotta run...
mdettweiler is offline  
Closed Thread



Similar Threads
Thread Thread Starter Forum Replies Last Post
Let's vote the black 21st move LaurV Game 1 - ♚♛♝♞♜♟ - Shaolin Pirates 3 2013-03-09 16:09
December 21st 2012 jasong jasong 11 2012-12-19 00:28
End of the world May 21st, 2011? jasong Lounge 67 2011-05-30 04:15
Rally Jan. 23rd-25th gd_barnes No Prime Left Behind 89 2009-01-25 22:59
LLRnet server rally port 300 May 23rd-25th gd_barnes No Prime Left Behind 172 2008-06-04 19:21

All times are UTC. The time now is 11:24.


Sat Jul 17 11:24:29 UTC 2021 up 50 days, 9:11, 1 user, load averages: 1.37, 1.41, 1.30

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.