mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Software

Reply
 
Thread Tools
Old 2005-08-22, 11:33   #1
rudi_m
 
rudi_m's Avatar
 
Jul 2005

2×7×13 Posts
Default SUM(INPUTS) != SUM(OUTPUTS)

Hi,
Today I found the following error in my logs:

.............
[Aug 17 22:12] Iteration: 1500000 / 34746359 [4.31%]. Per iteration time: 0.079 sec.
Iteration: 1504331/34746359, ERROR: SUM(INPUTS) != SUM(OUTPUTS), 1.351257879789209e+16 != 1.335852
066589149e+16
Possible hardware failure, consult the readme.txt file.
Continuing from last save file.
Waiting five minutes before restarting.
Resuming primality test of M34746359 at iteration 1504257 [4.32%]
[Aug 17 23:27] Iteration: 1550000 / 34746359 [4.46%]. Per iteration time: 0.083 sec
..............

Its the onliest error which happened on this PIV machine and all standard stresstests passed without probs until now.

My question is:
Does it make sense to let this LL run or should I restart?
I wonder why it resumed the test just from 64 iterations before that error occured. Is it liable that the error occured during these last iterations at all or wouldnt it be better to resume from last (30 minutes older) backup file instead?

Last fiddled with by rudi_m on 2005-08-22 at 11:34
rudi_m is offline   Reply With Quote
Old 2005-08-22, 12:29   #2
S80780
 
Jan 2003
far from M40

53 Posts
Default

Hi, rudi_m!

As can be seen from the excerpt you posted, P95 continued from the last save file which contained the interim result of iteration 1504257:
Quote:
Originally Posted by rudi_m
Continuing from last save file.
Waiting five minutes before restarting.
Resuming primality test of M34746359 at iteration 1504257 [4.32%]
If the failure doesn't recur, everything should be fine.

To clarify your last question: if the "minutes between diskwrites" option is set to "30", it means that every 30 minutes after starting/continuing a task, a save file is stored, overwriting the former save file for this task.
So if a failure occurs, you lose at most 30 minutes, if the save file isn't corrupted.

HTH

Benjamin
S80780 is offline   Reply With Quote
Old 2005-08-22, 15:33   #3
cheesehead
 
cheesehead's Avatar
 
"Richard B. Woods"
Aug 2002
Wisconsin USA

769210 Posts
Default

Quote:
Originally Posted by rudi_m
Is it liable that the error occured during these last iterations at all or wouldnt it be better to resume from last (30 minutes older) backup file instead?
The program actually does the check for sum(outputs) = sum(inputs) at the end of every iteration. It issues the error message immediately when the difference exceeds a threshold. Going back even just one iteration before the erroneous one reaches a point at which the calculation was okay. The last savefile is always at least one iteration before the erroneous one.

Last fiddled with by cheesehead on 2005-08-22 at 15:37
cheesehead is offline   Reply With Quote
Old 2005-08-22, 15:42   #4
rudi_m
 
rudi_m's Avatar
 
Jul 2005

2·7·13 Posts
Default

Quote:
Originally Posted by S80780
If the failure doesn't recur, everything should be fine.
Oki thx, I hope that the comp got just a bad day and now he is ok again :)
Quote:
To clarify your last question: if the "minutes between diskwrites" option is set to "30", it means that every 30 minutes after starting/continuing a task, a save file is stored, overwriting the former save file for this task.
So if a failure occurs, you lose at most 30 minutes, if the save file isn't corrupted.
Ok, I know - but before overwriting it makes a 2nd backup file.
(TwoBackupFiles=1 in prime.ini)
So thought that it would be more safe to resume from older backup.

-rw-r--r-- 1 rudi users 8388630 Aug 22 17:23 pY746359
-rw-r--r-- 1 rudi users 8388630 Aug 22 16:53 qY746359
(the 2nd file is the 30 minutes older backup)

Last fiddled with by rudi_m on 2005-08-22 at 15:43
rudi_m is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
ERROR: SUM(INPUTS) != SUM(OUTPUTS) AES PrimeNet 6 2007-10-05 18:48
LLR SUM(INPUTS) != SUM(OUTPUTS) error jbristow Software 4 2007-08-14 04:07
v. 2.13: SUM(INPUTS) != SUM(OUTPUTS) Kaiw Software 7 2005-10-26 14:49
ERROR: SUM(INPUTS) != SUM(OUTPUTS) flava Hardware 3 2004-01-19 17:52
ERROR: SUM(INPUTS) != SUM(OUTPUTS) ebx Software 5 2004-01-02 22:25

All times are UTC. The time now is 05:49.

Sat Mar 6 05:49:28 UTC 2021 up 93 days, 2 hrs, 0 users, load averages: 0.77, 1.14, 1.24

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.