![]() |
|
|
#1 |
|
Sep 2003
2·5·7·37 Posts |
There are are several categories of exponents that could qualify for a second-wave release of first-time exponents:
1) Repeat the release of exponents according to the earlier criteria of error-prone machines with bad/(bad+good) >= 0.333 and bad >= 2. The previous release imposed a maximum of no more than 4 exponents per machine, so some exponents still remain (some machines have a large number), and presumably there may be a few others from newly-detected error-prone machines. Perhaps the limit of 4 exponents per machine should be raised, or perhaps the threshold of 0.333 could be lowered to 0.250. 2) Repeat the release of exponents according to "harmful" errorcodes (errorcode not 00000000, AB00AB00, or 00XX0000), which statistically indicates a 55-60% error rate. Within this category, there are about 300 exponents that show up under the Double Checking Avail column of http://www.mersenne.org/primenet/ (starting at 12.5M) which have expired in the last month alone. These were already re-released as first-time checks (many in June and July), got automatically converted to double-checks in September by the server sync, and have since expired. A few more such exponents expire every day. 3) Make use of uv3 information, where uv3 represents unverified exponents that need a triple-or-higher check. In general, whenever a machine has returned a lot of results that need triple checks, this is an indication that the machine is probably error-prone. Most machines have a 0% error rate, so if a machine has returned a number of results that require a triple check, it's much more likely that that machine has made N errors rather than N other different, independent machines have each made one error. However, a few machines such as GW/P4-1400 seem to specialize in doing double-checks of exponents for which the first result contained a "harmful" error code. Thus, they have accumulated a lot of triple-check-required exponents, but only because they have deliberately sought probably-erroneous results to double check. We can take advantage of the fact that HRF3.TXT now contains error-code information, and look for results that require a triple check AND have a harmful error code. Call these uv3h. So we can use an additional set of criteria to identify error-prone machines: (bad + uv3h) / (bad + uv3 + good) >= 0.333 and (bad + uv3h) >= 2 This prevents machines like GW/P4-1400 from being misidentified as error-prone, because although they have a lot of results that require a triple-check, few or none of them were returned with a harmful error code. |
|
|
|
|
|
#2 |
|
Aug 2002
2×3×53 Posts |
Is there any way to have a Double Check done on every machine that is still producing??
This would give an immediate verification of good machines versus bad. Any machine failing this Double Check would have another done quickly to confirm it error-prone or not. I would think that this would help find error-prone machines and confirm to crunchers that their machines are good. |
|
|
|
|
|
#3 | |
|
Sep 2003
2×5×7×37 Posts |
Quote:
I started a new thread to discuss this. |
|
|
|
|
|
|
#4 |
|
P90 years forever!
Aug 2002
Yeehaw, FL
17·487 Posts |
GP2 - a.k.a the data miner,
Send me or post the list of exponents that have the bad error codes and I'll set them to be released (your case 2) Also, if you think you have enough data to back up your theories regarding identifying computers with at least a 33% error rate, then remove the 4 exponent restriction and send me a list of exponents to test. As to case 3, let's release the case 1 and case 2 exponents before casting a wider net for bad results. |
|
|
|
|
|
#5 | |
|
Sep 2003
2·5·7·37 Posts |
Quote:
The uv3 stuff (case 3) is more speculative at this point, I don't yet have the data to back it up although it seems very plausible. OK, here goes. First of all, recall the definitions: bad: count of verified-bad results returned by the machine. good: count of verified-good results returned by the machine. uv2: count of unverified results returned by the machine that need a 2nd check (no double check has been done). uv3: count of unverified results returned by the machine that need a 3rd-or-higher check (one or more unmatching double checks have been done). Based on the just-released Nov 9 2003 data files, the attached file shows the error-prone machines that fit the criteria bad/(bad+good) >= 0.333 bad >= 2 and the file shows only those machines with u2 > 0 For brevity, we show only the 230 error-prone machines that have uv2 > 0, not the full set of 1145 error-prone machines. Those remaining machines might contribute exponents only in the rarer case where triple checks are required because both original results are presumed not good. NOTE: the user names in this list of machines are "mapped" names. They don't necessarily correspond to the user name that was used when the result was returned to PrimeNet. Note uv2 is the count of how many exponents from that machine contribute to the release of exponents (excluding the rarer case of triple checks where both original results are not presumed good). We see that most machines contribute only a small number of exponents, but a few contribute dozens. |
|
|
|
|
|
|
#6 |
|
Sep 2003
1010000111102 Posts |
There are 1220 exponents that satisfy the criteria of case 1 (note we have lifted the restriction of only 4 exponents per machine) for exponents that need a 2nd check, limited to the range 10.5M - 19M.
Of these 381 are already assigned (many of these in the last release) and 46 are currently cleared, leaving 793 exponents. Note: if you sum up the uv2 column of the previous attachment, it adds up to 1436. The discrepancy is because there are 87 exponent that would have qualified but are below the 10.5M cutoff, and 129 exponents that would have qualified but are above the 19M cutoff. For tracking purposes here is the complete list of 1220. Note: this is not the set of exponents to release, since it includes already-assigned exponents. That will follow in the next few posts. Last fiddled with by GP2 on 2003-11-09 at 20:49 |
|
|
|
|
|
#7 |
|
Sep 2003
2·5·7·37 Posts |
There are 3240 exponents that meet the criteria of case 2 (harmful errorcodes) for exponents that need a 2nd check, limited to the range 10.5M - 19M.
Of these, 2753 are already assigned and 177 are cleared, leaving 310. Those 310 are attached here (note there may be overlap with the previous file attachment). Last fiddled with by GP2 on 2003-11-09 at 20:38 |
|
|
|
|
|
#8 |
|
Sep 2003
1010000111102 Posts |
There are 187 exponents that need a triple-or-higher check because all previous results are presumed not good by the criteria of case 1 and case 2.
Of these, 97 are already assigned and 41 are cleared, leaving 49. For tracking purposes, the complete set of 187 is attached here. Last fiddled with by GP2 on 2003-11-09 at 20:39 |
|
|
|
|
|
#9 |
|
Sep 2003
2·5·7·37 Posts |
Finally, when all is said and done and the above are combined, and everything is filtered against status.txt and cleared.txt as of Nov 9 2003 19:00 UTC, we are left with 1144 exponents.
This consists of 793 double-check exponents from case 1 (error-prone machines), 310 double-check exponents from case 2 (harmful error codes), and 49 triple checks. There's an overlap of 8 exponents between the first and second categories. This is the set of exponents I'm proposing for release: Last fiddled with by GP2 on 2003-11-09 at 20:35 |
|
|
|
|
|
#10 |
|
Sep 2003
2·5·7·37 Posts |
Hmm, I realized I haven't posted any thread tracking this November release of exponents, although there is a Tracking the October 2003 release of error-prone-machine exponents.
So this message will be a placeholder until I do that and link to the new thread. |
|
|
|
|
|
#11 |
|
Sep 2003
2·5·7·37 Posts |
I'd like to revisit the issue of "case 3", making use of uv3 and uv3h information to identify error-prone machines.
Starting with the October 9 2003 release of "weekly" data files, HRF3.TXT started showing error codes. Enough time has passed since then so that some of those entries in HRF3.TXT have now made it to LUCAS_V.TXT and BAD (they've been verified as either good or bad results). So we can do some analysis. Recall that uv3 (uv stands for "unverified") counts the number of results in HRF3.TXT for which: 1) a triple-check (or higher) is required -- only non-matching results (at least two or more) have been returned for that exponent) and uv3h counts the number of results in HRF3.TXT which satisfy both 1) and: 2) a "harmful" error code (other than 00000000, AB0000AB00, 00XX0000) Note that in general, a result with a "harmful" error code has about a 55%-60% chance of turning out to be bad. Now we might ask, what about a uv3h result -- a result with both a "harmful" error code and a triple-check required? Comparing October 9 2003 and December 31 2003 data files, we see that a uv3h result has a 92.5% chance of being bad. Namely, out of 400 uv3h results in the October 9 2003 version of HRF3.TXT that have been verified by December 31 2003, 370 turned out to be bad, 29 turned out to be good, and 1 was an apparently discarded duplicate. For more details, see the attached file. This means we can confidently use uv3h information to detect error-prone machines. Consider the following entries in usercomp_stats.txt (this can be downloaded from http://opteron.mersenneforum.org/dat...comp_stats.zip). Code:
uv3h uv3 uv2 bad good user ID/computer ID ---- --- --- --- ---- ------------------- 14 14 8 1 3 CHS,c925bba2a 11 11 17 0 2 DarekM,DAREKM12 12 12 18 1 1 K_R,katie11 11 11 15 0 1 S110459,xeon1 13 13 14 0 1 S18163,Whizkey 11 11 6 0 0 S47997,Zappa 13 13 13 0 13 Straker,Moonbase 11 11 18 0 0 Woodburner,woodbury2a 11 11 17 0 1 balu,balu 12 12 5 0 0 bourit,info11 14 14 15 0 0 cowanm,babbage7b 11 11 34 0 2 ddarknell,Atlas 11 11 14 0 0 fzttk8,C70BB7B21 12 12 41 0 3 gczajkow,il09P4 11 11 86 0 1 imad,celia 15 15 10 0 1 maathome,AMD3 13 13 13 0 9 stigmv,Frodo So, to summarize, if we do another release of early double-checking exponents some time this month, I think we can confidently include case 3 (see the first post in this thread for explanation of cases 1 to 3). The more thoroughly we do early double-checking of error-prone results, the more confident we will be that there is no missing Mersenne prime below the current M40. |
|
|
|
![]() |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Tracking the November 2003 release of error-prone exponents | nfortino | Data | 2 | 2005-07-25 13:46 |
| Tracking the October 2003 release of error-prone-machine exponents | GP2 | Data | 10 | 2005-07-25 13:14 |
| Release of Exponents | nitro | Lone Mersenne Hunters | 3 | 2004-01-02 06:41 |
| How can i get Prime95 to not release exponents? | E_tron | Software | 5 | 2003-12-19 02:59 |
| Exponents to re-release for first-time testing: "harmful" errorcodes | GP2 | Data | 4 | 2003-10-18 22:08 |