mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > PrimeNet > MISFIT

Closed Thread
 
Thread Tools
Old 2012-10-29, 00:17   #89
swl551
 
swl551's Avatar
 
Aug 2012
New Hampshire

23·101 Posts
Default Stalled Process detection now available

1.6 proto.3 is now available.

New in this build
1. Detection of stalled processes and emailing of condition
2. Minmize to System Tray



Stalled process detection works by tracking the newest checkpoint file created by mfaktO/C. If a .ckp file is detected as unchanged for 3 update cycles it is considered stalled and email is sent.

How to test. Stop one of your mfaktO/C instances an click the "Update Stats" button 3 times and you will get the email. (test emailer configuration first!)

"Auto Update" interval controls how quickly detection occurs. I recommend 15 minute interval.


Please send me any comments or bug reports. I really speed coded today due to the upcoming storm at my current location. Expecting NO power tomorrow and coastal flooding..... This could be my last.........

Get from SkyDrive http://sdrv.ms/QsaP9Y

Last fiddled with by swl551 on 2012-10-29 at 00:20
swl551 is offline  
Old 2012-10-29, 00:54   #90
flashjh
 
flashjh's Avatar
 
"Jerry"
Nov 2011
Vancouver, WA

1,123 Posts
Default

Thanks for working this.

Last fiddled with by flashjh on 2012-10-29 at 00:55
flashjh is offline  
Old 2012-10-29, 01:33   #91
kracker
 
kracker's Avatar
 
"Mr. Meeseeks"
Jan 2012
California, USA

23×271 Posts
Default

Thanks! Just so you know, the system icon is there, but doesn't minimize to it
kracker is offline  
Old 2012-10-29, 01:44   #92
swl551
 
swl551's Avatar
 
Aug 2012
New Hampshire

23·101 Posts
Default

Quote:
Originally Posted by kracker View Post
Thanks! Just so you know, the system icon is there, but doesn't minimize to it
I don't understand. Please clarify.
swl551 is offline  
Old 2012-10-29, 02:40   #93
kracker
 
kracker's Avatar
 
"Mr. Meeseeks"
Jan 2012
California, USA

23·271 Posts
Default

Quote:
Originally Posted by swl551 View Post
I don't understand. Please clarify.
Ok, what I mean is, the only thing different is that there is a icon, minimizing doesn't make it "dissapear to system tray" so etc when I do alt+tab it's still there in the task bar.
kracker is offline  
Old 2012-10-29, 03:12   #94
swl551
 
swl551's Avatar
 
Aug 2012
New Hampshire

80810 Posts
Default

Quote:
Originally Posted by kracker View Post
Ok, what I mean is, the only thing different is that there is a icon, minimizing doesn't make it "dissapear to system tray" so etc when I do alt+tab it's still there in the task bar.

Did you enable minimize to system tray in the MISC settings?
swl551 is offline  
Old 2012-10-29, 03:26   #95
kracker
 
kracker's Avatar
 
"Mr. Meeseeks"
Jan 2012
California, USA

23·271 Posts
Default

Quote:
Originally Posted by swl551 View Post
Did you enable minimize to system tray in the MISC settings?


I didn't.



Works, thanks
kracker is offline  
Old 2012-10-29, 03:40   #96
swl551
 
swl551's Avatar
 
Aug 2012
New Hampshire

23×101 Posts
Default

Quote:
Originally Posted by kracker View Post


I didn't.



Works, thanks

Well, at least it was an easy fix!!!!
swl551 is offline  
Old 2012-10-29, 16:02   #97
swl551
 
swl551's Avatar
 
Aug 2012
New Hampshire

23×101 Posts
Default 1.6.5 (beta) now available. (contains bug fix)

Fixes bug where false "stalled process" alarm is sent after a rebalance occurs.

I improved the determiniation of "stalled process" to include testing of the age of the .ckp file. It must be at least 5 minutes stale and unchanged after 3 test cycles. Using the age of the file will prevent false alarms regardless of how rapidly the "update stats" event occurs.


Get from SkyDrive http://sdrv.ms/QsaP9Y

Last fiddled with by swl551 on 2012-10-29 at 16:02
swl551 is offline  
Old 2012-10-30, 03:57   #98
flashjh
 
flashjh's Avatar
 
"Jerry"
Nov 2011
Vancouver, WA

21438 Posts
Default

Quote:
Originally Posted by swl551 View Post
Fixes bug where false "stalled process" alarm is sent after a rebalance occurs.

I improved the determiniation of "stalled process" to include testing of the age of the .ckp file. It must be at least 5 minutes stale and unchanged after 3 test cycles. Using the age of the file will prevent false alarms regardless of how rapidly the "update stats" event occurs.


Get from SkyDrive http://sdrv.ms/QsaP9Y
Testing is going well. Everything is working.
flashjh is offline  
Old 2012-10-30, 09:06   #99
swl551
 
swl551's Avatar
 
Aug 2012
New Hampshire

32816 Posts
Default More stalled detection tweaks.

I moved stalled process detection to its own timer and it executes at a fixed 10 minute interval. Decoupling from the Update Stats button makes it more reliable since the button can be clicked at irregular intervals causing unnecessary process checking.

So if you want to test the stalled process detection you have to stop an instance of mfaktO/C and let mfaktXapp run for about 30 mins while it performs 3 tests at 10 mins apart. No more clicking "Update Stats" to force the test.
swl551 is offline  
Closed Thread

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
(archive)MISFIT swl551 MISFIT 584 2013-03-19 20:13
(archive)mfaktXapp 1.1.4 released swl551 MISFIT 64 2012-09-15 22:08
Archive 2 for Other results (>155) em99010pepe Octoproth Search 161 2007-06-08 21:31
Archive 1 for Other results (>155) ValerieVonck Octoproth Search 90 2007-03-21 01:01
The Archive... Xyzzy Lounge 11 2003-03-31 20:42

All times are UTC. The time now is 08:27.


Tue Jul 27 08:27:58 UTC 2021 up 4 days, 2:56, 0 users, load averages: 2.05, 1.81, 1.77

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.