mersenneforum.org

mersenneforum.org (https://www.mersenneforum.org/index.php)
-   PrimeNet (https://www.mersenneforum.org/forumdisplay.php?f=11)
-   -   M40, what went wrong? (https://www.mersenneforum.org/showthread.php?t=672)

jeff8765 2003-06-14 03:26

I do not really know, but I think that it was just a freak accident and that there is no inhernet flaw in Prime 95. I think that we will probably just continue as we were before M40 was reported.

Prime95 2003-06-14 03:35

[quote="jocelynl"]We don't have any technical details. It would be nice to know what type of hardware was used and overclock if not.[/quote]

P4 1.6 GHz Willamette, not overclocked

Paulie 2003-06-14 06:22

Ah, v17. Lost a few tests, but hey, I'm still here. Thanks for the ride, it's bound to get only better in the next 10 years. ;) ;)

I only win at Roullete. :D :D :D

Paulie 2003-06-14 06:27

We keep looking for M40. :D :D :D

alienz 2003-06-14 08:34

:surprised:ops: :surprised:ops: prime 40 must be getting close, will it be this month???

trif 2003-06-14 09:35

I think it is obvious that [url=http://www.mersenneforum.org/viewtopic.php?p=2459&highlight=#2459]this[/url] is to blame for the whole "M40" situation. It's obvious the universe was having a hard time coping with this paradox, and it snapped in a very strange way.

I beg you Cheesehead, don't push things this far again or who knows what could happen. :shock:

Prime95 2003-06-14 19:15

Re: M40, what went wrong?
 
[quote="ewmayer"]Is it prime95 that does the integer conversion, or the compiler? If the former, why not just do the compare-with-zero in floating-point form.[/quote]

It was the C compiler. I've written a very easy assembly routine to check for NaN and infinity. This way I can be sure it will work on Linux, FreeBSD, OS/2, etc.

In 23.5 prime95 will check for NaN and infinity before checking for zero.

Prime95 2003-06-14 19:40

Re: M40, what went wrong?
 
[quote="ewmayer"]It seems EXTREMELY unlikely to me that a run would have valid (non-NaN) data every iteration, then go awry on the very last step, the rounding-and-carry-propagation following the final IFFT. Perhaps something slipped through the on-the-fly NaN checking. Or perhaps there were in fact no NaNs at all in the run in question, but the shift count related to the subtract-2 step (or some other datum used in the carry step) getting corrupted caused the residue to get zeroed in the carry step without triggering any roundoff warnings. Actually, since prime95 only does RO checking every so often (I believe every 100th iteration or so), it's possible there may in fact have been a suspiciously large RO error in the crucial carry step that got missed isn't it?[/quote]

Several facts to consider:

Every iteration prime95 sums all the input FFT values and all the output FFT values. If sum_input^2 != sum_outputs, then you get a SUM(INPUTS) != SUM(OUTPUTS) error. The output sum is also checked for NaN or infinity. If the sum is one of these values, then you get an ILLEGAL SUMOUT error. If carry-propogation & round-off generates NaNs, then that should be picked up on the next LL iteration.

The last 50 iterations of every LL test do the extra round-off error checking.

My records show 6 LL residue results of 0000000000000002. Thus, there obviously is some way a hardware error can zero the FFT data. I just don't see an easy way for this to happen. Any ideas?

Would a check each iteration for zero or two be a good idea?

smh 2003-06-14 19:51

[quote]Would a check each iteration for zero or two be a good idea?[/quote]

Depending on how much time that costs.

Maybe check it once every million iterations, or even better, prior to writing a new save file. If the residue at that point is 0 or 2, you can go back to the last save file.

asdf 2003-06-15 02:55

Is it possible for an LL test running accurately to go to 0, 1, or 2 before completion?

Prime95 2003-06-15 04:29

[quote="asdf"]Is it possible for an LL test running accurately to go to 0, 1, or 2 before completion?[/quote]

No. And a check every iteration would be real cheap.


All times are UTC. The time now is 04:35.

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.