mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Software

Reply
 
Thread Tools
Old 2012-04-30, 13:04   #56
Dubslow
Basketry That Evening!
 
Dubslow's Avatar
 
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88

3·29·83 Posts
Default

Quote:
Originally Posted by BrianOC View Post
Run 8 theards and on SUM(INPUTS) erorr checking and Round off error checking in both.
When you say both, you mean in both 27.6 and 27.4, or do you mean in both 27.6.2 and 27.6.3? (And that's fine about your English, we can (mostly) understand enough, or at least I can )
Dubslow is offline   Reply With Quote
Old 2012-04-30, 14:56   #57
BrianOC
 
Apr 2012

11112 Posts
Default

Quote:
Originally Posted by Dubslow View Post
When you say both, you mean in both 27.6 and 27.4, or do you mean in both 27.6.2 and 27.6.3? (And that's fine about your English, we can (mostly) understand enough, or at least I can )
Test 3. (27.6.3) Crash again same time 2hour30 minutes about.

Image: http://i48.tinypic.com/2lt2bmo.png

passed 10 run 11 crashed.
[Mon Apr 30 14:18:05 2012] start, [Mon Apr 30 16:36:04 2012] crash

I decided for the time being using 27.4 stable

Last fiddled with by BrianOC on 2012-04-30 at 15:04
BrianOC is offline   Reply With Quote
Old 2012-04-30, 15:15   #58
Dubslow
Basketry That Evening!
 
Dubslow's Avatar
 
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88

3·29·83 Posts
Default

Quote:
Originally Posted by BrianOC View Post
Test 3. (27.6.3) Crash again same time 2hour30 minutes about.

Image: http://i48.tinypic.com/2lt2bmo.png

passed 10 run 11 crashed.
[Mon Apr 30 14:18:05 2012] start, [Mon Apr 30 16:36:04 2012] crash

I decided for the time being using 27.4 stable
Hmm... now crashes as well as errors. I really don't know. George? I'll probably start another blend torture test and let it run for a few hours, but I haven't had any crashes doing production work (though I'm in Linux of course, not Windows).
Dubslow is offline   Reply With Quote
Old 2012-04-30, 19:22   #59
BrianOC
 
Apr 2012

3·5 Posts
Default

Quote:
Originally Posted by Dubslow View Post
Hmm... now crashes as well as errors. I really don't know. George? I'll probably start another blend torture test and let it run for a few hours, but I haven't had any crashes doing production work (though I'm in Linux of course, not Windows).

Blend no crash in 6h44 min. Small FTT error always the same time/place.

Last fiddled with by BrianOC on 2012-04-30 at 19:27
BrianOC is offline   Reply With Quote
Old 2012-04-30, 21:30   #60
Dubslow
Basketry That Evening!
 
Dubslow's Avatar
 
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88

3×29×83 Posts
Default

Quote:
Originally Posted by BrianOC View Post
Blend no crash in 6h44 min. Small FTT error always the same time/place.
Yeah, I just ran blend for five hours, but have now started Small FFT testing. That time is roughly 2.5 hours into the test, right?
Dubslow is offline   Reply With Quote
Old 2012-04-30, 21:37   #61
BrianOC
 
Apr 2012

11112 Posts
Default

Quote:
Originally Posted by Dubslow View Post
Yeah, I just ran blend for five hours, but have now started Small FFT testing. That time is roughly 2.5 hours into the test, right?
yes
BrianOC is offline   Reply With Quote
Old 2012-04-30, 22:52   #62
Dubslow
Basketry That Evening!
 
Dubslow's Avatar
 
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88

3·29·83 Posts
Default Umm... definitely a bug

I'm pretty sure a seg fault is code error
Code:
[Worker #1 Apr 30 17:44:52] Test 27, 160000 Lucas-Lehmer iterations of M630783 using AVX FFT length 50K, Pass1=640, Pass2=80.
[Worker #3 Apr 30 17:45:06] Test 26, 160000 Lucas-Lehmer iterations of M638977 using AVX FFT length 50K, Pass1=640, Pass2=80.
[Worker #2 Apr 30 17:45:14] Test 28, 160000 Lucas-Lehmer iterations of M622593 using AVX FFT length 50K, Pass1=640, Pass2=80.
[Worker #1 Apr 30 17:45:30] Test 28, 160000 Lucas-Lehmer iterations of M622593 using AVX FFT length 50K, Pass1=640, Pass2=80.
[Worker #3 Apr 30 17:45:43] Test 27, 160000 Lucas-Lehmer iterations of M630783 using AVX FFT length 50K, Pass1=640, Pass2=80.
[Worker #2 Apr 30 17:45:50] Test 29, 160000 Lucas-Lehmer iterations of M614399 using AVX FFT length 50K, Pass1=640, Pass2=80.
[Worker #1 Apr 30 17:46:07] Test 29, 160000 Lucas-Lehmer iterations of M614399 using AVX FFT length 50K, Pass1=640, Pass2=80.
[Worker #3 Apr 30 17:46:20] Test 28, 160000 Lucas-Lehmer iterations of M622593 using AVX FFT length 50K, Pass1=640, Pass2=80.
Segmentation fault
Again, all single threaded, MPrime-64, 2600K; it's build 3, but when I run -v I get 27.6.8.

Edit: @Jwb52z: Yeah, I'm starting to doubt it, but I haven't yet encountered any problems on my desktop doing production work, and it has continued to turn in good DCs, most recently around 10-12 hours ago; my laptop ran into the first round off bug, but admittedly has been borked the last few days.

Last fiddled with by Dubslow on 2012-04-30 at 22:58
Dubslow is offline   Reply With Quote
Old 2012-04-30, 22:56   #63
Jwb52z
 
Jwb52z's Avatar
 
Sep 2002

17·47 Posts
Default

These last few posts almost make me glad I haven't started using 27.6 yet.
Jwb52z is offline   Reply With Quote
Old 2012-04-30, 23:15   #64
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

7,537 Posts
Default

Quote:
Originally Posted by Dubslow View Post
it's build 3, but when I run -v I get 27.6.8.
The ".8" refers to the OS port # -- a pretty meaningless number for end-users. I'll update mprime to output the more human-readable string that prime95 uses and mprime uses when talking to the server.
Prime95 is offline   Reply With Quote
Old 2012-04-30, 23:19   #65
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

165618 Posts
Default

Quote:
Originally Posted by BrianOC and others View Post
Blend no crash in 6h44 min. Small FTT error always the same time/place.
At first blush it looks like a problem in the 16K FFT (actually the carry propagation code) when the exponent gets too small (only 10.5 bits per FFT word). This would never happen in the real world unless one explicitly requests using a longer FFT length than the default.

Nevertheless, it is a bug and I'll track it down. Thanks for finding it.
Prime95 is offline   Reply With Quote
Old 2012-05-01, 10:48   #66
BrianOC
 
Apr 2012

178 Posts
Default

Quote:
Originally Posted by Prime95 View Post
At first blush it looks like a problem in the 16K FFT (actually the carry propagation code) when the exponent gets too small (only 10.5 bits per FFT word). This would never happen in the real world unless one explicitly requests using a longer FFT length than the default.

Nevertheless, it is a bug and I'll track it down. Thanks for finding it.
BrianOC is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
Prime95 version 27.3 Prime95 Software 148 2012-03-18 19:24
Prime95 version 26.3 Prime95 Software 76 2010-12-11 00:11
Prime95 version 25.5 Prime95 PrimeNet 369 2008-02-26 05:21
Prime95 version 25.4 Prime95 PrimeNet 143 2007-09-24 21:01
When the next prime95 version ? pacionet Software 74 2006-12-07 20:30

All times are UTC. The time now is 06:35.


Mon Aug 2 06:35:55 UTC 2021 up 10 days, 1:04, 0 users, load averages: 1.24, 1.25, 1.21

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.