mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Software

Reply
 
Thread Tools
Old 2021-10-12, 09:36   #474
LaurV
Romulan Interpreter
 
LaurV's Avatar
 
"name field"
Jun 2011
Thailand

24×613 Posts
Default

Not sure if this was reported or maybe even fixed in the last versions, I still have few computers using v30.3, and sometimes, for whatever reasons, they can't connect to the server (it may be network/rights related, my IT guys get paranoid sometimes, which is not a bad thing). The worktodo is therefore exhausted and the computers are waiting to get work for days (usually, over the weekend, when I can't attend them).

What I found out repeatedly is that in such case the computers can't connect to the server ever, even if P95 is restarted, but they will connect to the server if the spool file is deleted (moved to another folder), even if that is done during P95 runs. Putting the spool file back - error, can't connect to the server. Taking it out, no issue, connect, get new assignments, put it back, can't connect (but the work is progressing normal, and proof files are stacked up locally - especially for PRPCF assignments, which take little time to finish).

First time (second time, third time) we assumed that the spool file got malformed or it suffered some damage, so we just deleted it and continue from there. We tried first to recover unreported stuff from it, using a hex editor (which was quite successful). But the issue re-appeared few more times, therefore we decided to zip such file and keep it.

The file will crash the P95 connection if we unzip it in P95 folder, regardless of computer (i.e. if we put it on another computer, that will not be able to connect to the server and get and/or report work either).

@George: do you need it? (maybe to track what happens, etc), the zip is 7360 bytes (i.e. not big).

Last fiddled with by LaurV on 2021-10-12 at 09:41
LaurV is offline   Reply With Quote
Old 2021-10-12, 13:32   #475
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

171E16 Posts
Default

If the worker window estimates 31 days to go on a 50M fft CERT, why does the client tell the PrimeNet server it has one day to go?
If it has a month of high priority 50M fft CERT work to do, why does it interrupt that to run unneeded-for-a-month-at-least 3360K and 3456K benchmarks?
Will v30.7bx address these?
Are there settings I can apply to address them in v30.6b4?
Attached Thumbnails
Click image for larger version

Name:	wrong eta sent to server.png
Views:	36
Size:	203.1 KB
ID:	25916  
kriesel is offline   Reply With Quote
Old 2021-10-12, 14:23   #476
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

2·11·349 Posts
Default

Quote:
Originally Posted by LaurV View Post
The file will crash the P95 connection if we unzip it in P95 folder, regardless of computer (i.e. if we put it on another computer, that will not be able to connect to the server and get and/or report work either).

@George: do you need it? (maybe to track what happens, etc), the zip is 7360 bytes (i.e. not big).
Sure. PM me and I will look into it.
Prime95 is offline   Reply With Quote
Old 2021-10-12, 14:40   #477
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

2×11×349 Posts
Default

Quote:
Originally Posted by kriesel View Post
If the worker window estimates 31 days to go on a 50M fft CERT, why does the client tell the PrimeNet server it has one day to go?
If it has a month of high priority 50M fft CERT work to do, why does it interrupt that to run unneeded-for-a-month-at-least 3360K and 3456K benchmarks?
Will v30.7bx address these?
Are there settings I can apply to address them in v30.6b4?
30.7b5 will send the estimated completion date as shown in Test/Status (which in your case is much sooner than 31 days). Auto-bench, test/status, and server estimated completion dates will all assume CERT work executes before other work types.

For now, in 30.6b4 you can turn auto-bench off.
Prime95 is offline   Reply With Quote
Old 2021-10-12, 16:08   #478
ixfd64
Bemusing Prompter
 
ixfd64's Avatar
 
"Danny"
Dec 2002
California

32·269 Posts
Default

Quote:
Originally Posted by Prime95 View Post
FYI2: Brent-Suyama is no more.
I noticed it's not mentioned in undoc.txt anymore. I'm guessing it's been completely removed from Prime95?
ixfd64 is offline   Reply With Quote
Old 2021-10-12, 16:22   #479
Viliam Furik
 
Viliam Furik's Avatar
 
"Viliam Furík"
Jul 2018
Martin, Slovakia

10110000102 Posts
Default

Quote:
Originally Posted by Prime95 View Post
30.7b5 will send the estimated completion date as shown in Test/Status (which in your case is much sooner than 31 days). Auto-bench, test/status, and server estimated completion dates will all assume CERT work executes before other work types.

For now, in 30.6b4 you can turn auto-bench off.
But that's not the correct completion date. The 31-day estimate by the worker is the correct one.
Viliam Furik is offline   Reply With Quote
Old 2021-10-12, 18:39   #480
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

171E16 Posts
Default

Quote:
Originally Posted by Prime95 View Post
30.7b5 will send the estimated completion date as shown in Test/Status (which in your case is much sooner than 31 days). Auto-bench, test/status, and server estimated completion dates will all assume CERT work executes before other work types.

For now, in 30.6b4 you can turn auto-bench off.
Thanks. Looking forward to b5 or 6.
From prime.log:

Code:
[Fri Oct  8 09:13:18 2021 - ver 30.6]
Updating computer information on the server
Sending expected completion date for M843112609: Oct  8 2021
...
[Tue Oct 12 08:23:44 2021 - ver 30.6]
Updating computer information on the server
Sending expected completion date for M63367621: Oct 16 2021
Sending expected completion date for M843112609: Oct 12 2021
Oct 12 ~1:15 pm local, downed briefly to update to v30.7b4 (can't download v30.7b5 yet)
otherwise it's been running 24/7, and is now ~12.57% complete.
So linear extrapolation from ~4.17 days to 12.57%, 12.57/4.17 * 87.43 remaining ~ 29.0 days more, Nov 10.

I note during adding to prime.txt,
AutoBench=0
that v30.6b4 had apparently flipped my manual prime.txt setting from
WorkPreference=155
to
WorkPreference=151
without my knowledge. Reset that while in the editor.

Upon resumption of the big CERT with V30.7b4, test/status claims completion late on Oct 15, ~3.3 days. Better than claiming same-day or next-day, but still seems ~8.8x too soon.
And what it reports to the server is next-day.
Code:
[Tue Oct 12 13:37:07 2021 - ver 30.7]
Exchanging program options with server
Updating computer information on the server
Sending expected completion date for M63367621: Oct 17 2021
Sending expected completion date for M843112609: Oct 13 2021

Last fiddled with by kriesel on 2021-10-12 at 18:43
kriesel is offline   Reply With Quote
Old 2021-10-26, 08:30   #481
kruoli
 
kruoli's Avatar
 
"Oliver"
Sep 2017
Porta Westfalica, DE

10111111102 Posts
Default

Quote:
Originally Posted by kruoli View Post
It is completely stuck, every hour it states:
Code:
[Worker #3 Oct 25 18:26] Restarting worker to do priority work.
[Worker #3 Oct 25 18:26] Resuming.
[Worker #3 Oct 25 18:26] No work to do at the present time.  Waiting.
I release this reservation.
What could have caused that certification to be unable to begin?

This was 30.6b3, Windows 7, Intel i7 3630QM. CPU-hours was set to 8.

Additional information:
Quote:
Originally Posted by kruoli View Post
I have CertWork=1, upload and download limits to really high values, CertWorker is set to the according worker etc. […] Prime95 shows no network activity.
kruoli is offline   Reply With Quote
Reply

Thread Tools


All times are UTC. The time now is 10:24.


Wed Dec 1 10:24:57 UTC 2021 up 131 days, 4:53, 1 user, load averages: 1.03, 1.16, 1.16

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.