mersenneforum.org  

Go Back   mersenneforum.org > Prime Search Projects > No Prime Left Behind

Reply
 
Thread Tools
Old 2012-02-22, 15:03   #56
mdettweiler
A Sunny Moo
 
mdettweiler's Avatar
 
Aug 2007
USA (GMT-5)

3×2,083 Posts
Default

Quote:
Originally Posted by odicin View Post
Not only at this time, I try to remember around some months ago (before avx) a similar problem with the older 3.8.4 llr and some AMD CPU's at PG. In fact the problem was the used gnum version so Primegrid updated all there stock apps to 3.8.6 llr which uses a newer gnum version.

Rebirthers AVX build of llr uses the early beta of gnum 27.3, the latest llr from jean penne still use 26.5. I suppose this is the problem there.

You can track down the problem with rebirthers non-avx x64 llr build. This one also use gnum 27.3 instead of 26.5.

Regards Odi
Hmm...so the non-AVX x64 builds are based on gwnum 27.3 as well? How does that work? Is there a compile flag or something that can enable/disable AVX for a given (v27.x) gwnum version?
mdettweiler is offline   Reply With Quote
Old 2012-02-22, 15:33   #57
Lennart
 
Lennart's Avatar
 
"Lennart"
Jun 2007

25·5·7 Posts
Default

I have big problem to reach the server ! Only 50-60% is conecting but there are no problem to reach other server on PG.

Are there some problem or some manage ongoing ?


Lennart

Last fiddled with by Lennart on 2012-02-22 at 15:33
Lennart is offline   Reply With Quote
Old 2012-02-22, 15:36   #58
odicin
 
Sep 2011
Potsdam, Germany

24×7 Posts
Default

Correct. You can swap to another gnum lib, because it's not a part of llr itself. But I don't know how exactly it works, better you ask this rebirther or rouge.

Regards Odi
odicin is offline   Reply With Quote
Old 2012-02-22, 15:41   #59
rogue
 
rogue's Avatar
 
"Mark"
Apr 2003
Between here and the

5,701 Posts
Default

Quote:
Originally Posted by odicin View Post
Correct. You can swap to another gnum lib, because it's not a part of llr itself. But I don't know how exactly it works, better you ask this rebirther or rouge.

Regards Odi
gwnum is linked into the executable.
rogue is offline   Reply With Quote
Old 2012-02-22, 15:42   #60
mdettweiler
A Sunny Moo
 
mdettweiler's Avatar
 
Aug 2007
USA (GMT-5)

186916 Posts
Default

Quote:
Originally Posted by Lennart View Post
I have big problem to reach the server ! Only 50-60% is conecting but there are no problem to reach other server on PG.

Are there some problem or some manage ongoing ?


Lennart
Yeah, I've been monitoring this. mysqld is (as you might imagine) under fairly heavy load, this being the highest amount of activity we've ever supported on a PRPnet server. Gary had three PRPnet clients running on "spare" cores of the server machine--well, under normal load they would be spare. I shut down one of them to give MySQL more headroom, and will keep an eye on things to shut down more if needed.

Edit: Okay, it looks like it's not quite enough--I'm still seeing fallbacks to other servers on my own clients. I went ahead and shut down all the PRPnet clients running on the server--to be safe we should probably leave it completely free to do its server job during the rally.

Last fiddled with by mdettweiler on 2012-02-22 at 15:47
mdettweiler is offline   Reply With Quote
Old 2012-02-22, 15:44   #61
mdettweiler
A Sunny Moo
 
mdettweiler's Avatar
 
Aug 2007
USA (GMT-5)

624910 Posts
Default

Quote:
Originally Posted by rogue View Post
gwnum is linked into the executable.
I guess what I was wondering, though, was whether it's possible to build both AVX and non-AVX versions of LLR using gwnum v27.3? I had understood that the non-AVX builds were based on v26.x, but odicin mentioned Rebirther's non-AVX x64 build being based on v27 so I wanted to make sure I had that correctly.
mdettweiler is offline   Reply With Quote
Old 2012-02-22, 16:18   #62
Lennart
 
Lennart's Avatar
 
"Lennart"
Jun 2007

46016 Posts
Default

Quote:
Originally Posted by mdettweiler View Post
Yeah, I've been monitoring this. mysqld is (as you might imagine) under fairly heavy load, this being the highest amount of activity we've ever supported on a PRPnet server. Gary had three PRPnet clients running on "spare" cores of the server machine--well, under normal load they would be spare. I shut down one of them to give MySQL more headroom, and will keep an eye on things to shut down more if needed.

Edit: Okay, it looks like it's not quite enough--I'm still seeing fallbacks to other servers on my own clients. I went ahead and shut down all the PRPnet clients running on the server--to be safe we should probably leave it completely free to do its server job during the rally.
Have you optimized the tables ? Close the server and all deamons and restart mysql. Then start prpnet servers again.

Lennart
Lennart is offline   Reply With Quote
Old 2012-02-22, 16:56   #63
rebirther
 
rebirther's Avatar
 
Sep 2011
Germany

2,357 Posts
Default

Quote:
Originally Posted by mdettweiler View Post
I guess what I was wondering, though, was whether it's possible to build both AVX and non-AVX versions of LLR using gwnum v27.3? I had understood that the non-AVX builds were based on v26.x, but odicin mentioned Rebirther's non-AVX x64 build being based on v27 so I wanted to make sure I had that correctly.
The non avx version is built with gwnum 26.6. AVX with latest 27.3.

Last fiddled with by rebirther on 2012-02-22 at 16:56
rebirther is offline   Reply With Quote
Old 2012-02-22, 17:56   #64
mdettweiler
A Sunny Moo
 
mdettweiler's Avatar
 
Aug 2007
USA (GMT-5)

3×2,083 Posts
Default

Quote:
Originally Posted by Lennart View Post
Have you optimized the tables ? Close the server and all deamons and restart mysql. Then start prpnet servers again.

Lennart
Hmm, okay...I'll try to do that in 20 minutes or so when the next stats refresh minutes. (BTW: what do you mean by optimize the tables? I don't know too much about MySQL so I don't know how to do that; does simply restarting mysqld do it automatically?)

Notice to all: all noprimeleftbehind.net PRPnet servers will be going down briefly at about 12:20 PM server time. The plan is to have them back online within 5-10 minutes at the max, and this should take care of the issues we've been seeing lately.
mdettweiler is offline   Reply With Quote
Old 2012-02-22, 18:25   #65
mdettweiler
A Sunny Moo
 
mdettweiler's Avatar
 
Aug 2007
USA (GMT-5)

3·2,083 Posts
Default

Okay, I restarted mysqld and all PRPnet servers are back online again. Hopefully that will do the trick.

Edit: looking at the stats from the last few hours, it seems that shutting down all the PRPnet clients running on the server did help, at least some--I notice a marked increase in the number of pairs/hour returned. Between that and restarting mysqld, we should be good to go now.

Edit #2: Ah yes, I'm seeing much less CPU usage from mysqld now--on the order of 15-50% CPU (on the Linux scale where 100%=1 core), versus 150-200% before the mysqld restart.

Last fiddled with by mdettweiler on 2012-02-22 at 18:28
mdettweiler is offline   Reply With Quote
Old 2012-02-22, 19:45   #66
gd_barnes
 
gd_barnes's Avatar
 
May 2007
Kansas; USA

34·53 Posts
Default

Is there some sort of "memory leak" so to speak of the CPU usage that causes this to "build up"?
gd_barnes is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
PRPnet rally Apr. 18th-25th gd_barnes No Prime Left Behind 17 2012-04-26 11:54
LLRnet/PRPnet rally Oct. 27th-Nov. 3rd mdettweiler No Prime Left Behind 33 2010-12-24 19:16
LLRnet/PRPnet rally June 4th-6th gd_barnes No Prime Left Behind 61 2010-07-30 17:28
Rally Jan. 23rd-25th gd_barnes No Prime Left Behind 89 2009-01-25 22:59
LLRnet server rally port 300 May 23rd-25th gd_barnes No Prime Left Behind 172 2008-06-04 19:21

All times are UTC. The time now is 14:24.

Thu May 28 14:24:14 UTC 2020 up 64 days, 11:57, 2 users, load averages: 1.30, 1.40, 1.46

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2020, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.