mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Software

Reply
 
Thread Tools
Old 2019-01-21, 19:51   #166
Chuck
 
Chuck's Avatar
 
May 2011
Orange Park, FL

37616 Posts
Default

I switched to build 8 in the middle of a double check test. The test finished successfully and matches the first test, but the residue is not displayed in my work results details. Instead, n/a is displayed.

You can also see this on the exponent status page.
Attached Thumbnails
Click image for larger version

Name:	gimps.jpg
Views:	77
Size:	716.6 KB
ID:	19723  
Chuck is offline   Reply With Quote
Old 2019-01-21, 22:07   #167
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

11101001110112 Posts
Default

Quote:
Originally Posted by Chuck View Post
I switched to build 8 in the middle of a double check test. The test finished successfully and matches the first test, but the residue is not displayed in my work results details. Instead, n/a is displayed.

You can also see this on the exponent status page.
I've got Aaron working on it. I didn't think the new JSON results would cause any problems since the old text is also sent to the server.
Prime95 is online now   Reply With Quote
Old 2019-01-21, 22:09   #168
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

11101001110112 Posts
Default

Quote:
Originally Posted by ATH View Post
[Work thread Jan 17 17:51:17] Iteration: 54083596/88479649, Possible error: round off (0.4366392078) > 0.42188
That error is not significant enough to change the crossovers. Your previous reports had several errors, some as high as 0.45+ and 0.46+.

BTW, build 8 will not do any round off checking during a Gerbicz PRP (build 6 did when near the limit of an FFT size)
Prime95 is online now   Reply With Quote
Old 2019-01-22, 04:14   #169
Madpoo
Serpentine Vermin Jar
 
Madpoo's Avatar
 
Jul 2014

37×89 Posts
Default

Quote:
Originally Posted by Prime95 View Post
I've got Aaron working on it. I didn't think the new JSON results would cause any problems since the old text is also sent to the server.
I think I have the new JSON results being parsed okay for the history section now. I may be missing the proper parsing for PRP results but I haven't seen any live examples of those pass through from the new P95 build yet. It may be different than what gpuowl is doing for it's JSON, so it may or may not show up correctly. I'll deal with it when the time comes.
Madpoo is offline   Reply With Quote
Old 2019-01-22, 11:24   #170
ATH
Einyen
 
ATH's Avatar
 
Dec 2003
Denmark

2·7·223 Posts
Default

The FixedHardwareUID=1 is not working in 29.5b8 or maybe I am doing it wrong.

I removed FixedHardwareUID=1 and the line HardwareGUID= from prime.txt and then started mprime again for about 1 minute.
Then I stopped it again and added FixedHardwareUID=1 back to prime.txt and started it again, but some hours later I got:

[Comm thread Jan 22 03:57:02] Updating computer information on the server
[Comm thread Jan 22 03:57:02] PrimeNet error 33: CPU identity mismatch
[Comm thread Jan 22 03:57:02] CPU identity mismatch: g=481073DBD354B3EA38D6C9286ADA4D03 hg=daf2d7cfa4eefcf9c6f2696915f78d9f wg=
[Comm thread Jan 22 03:57:02] Updating computer information on the server
[Comm thread Jan 22 03:57:02] Exchanging program options with server

and it created a new computer account again for most of my instances.

Is it because several instances cannot use the same ComputerGUID= value in local.txt at the same time?

Last fiddled with by ATH on 2019-01-22 at 11:25
ATH is offline   Reply With Quote
Old 2019-01-22, 17:54   #171
GP2
 
GP2's Avatar
 
Sep 2003

3×863 Posts
Default

Quote:
Originally Posted by ATH View Post
The FixedHardwareUID=1 is not working in 29.5b8 or maybe I am doing it wrong.
I changed my scripts to try a new way and it seems to solve the problem.

When my scripts launch a new instance, they install a prime.txt file with only FixedHardwareUID=1. There is no HardwareGUID= line and no WindowsGUID= line. And they install a local.txt file with only ComputerID=c5.large (for example, or whatever the AWS instance is), there is no ComputerGUID=.

After the new instance launches, mprime itself creates a HardwareGUID= line in prime.txt and fills in a value, it creates a WindowsGUID= line in prime.txt and leaves it blank (it's Linux after all). It also creates a ComputerGUID= line in local.txt and fills in a value.

When an old instance terminates and a new instance launches later and takes over the existing working directory, the script never overwrites the prime.txt file, and it only overwrites the local.txt file if the instance type is different from the one in the ComputerID line. The latter only happens if I manually moved the working directory to a different parent directory, for instance if c5.xlarge spot instances temporarily became cheaper than c5.large sport instances.

With this setup, I find it no longer keeps creating new entries in https://www.mersenne.org/cpus/

This setup is changed from what I described in my How-to guide, but it seems to work, so I need to update the guide. Unfortunately AWS also changed their configuration screens, so a bunch of other edits to the guide also need to be made.
GP2 is offline   Reply With Quote
Old 2019-01-22, 18:13   #172
GP2
 
GP2's Avatar
 
Sep 2003

258910 Posts
Default

Quote:
Originally Posted by GP2 View Post
I changed my scripts to try a new way and it seems to solve the problem.
However, this still isn't ideal, because a better long-term solution would probably be just to switch to AWS Batch and run each exponent as a separate batch job in a container. But then the problem of proliferating CPUs reappears again... a new CPU would get created for every single exponent you test.

Another problem is that when mprime runs out of worktodo lines, it just idles. For a batch job, it should just terminate.

It ought to be possible to create a modified mprime that is more cloud-native. Have it write its savefiles to S3 buckets instead of to a filesystem. Avoid trying to identify which physical machine ran a particular exponent, that doesn't make sense in a new world of virtual machines and containers. And so forth.
GP2 is offline   Reply With Quote
Old 2019-01-22, 19:26   #173
Chuck
 
Chuck's Avatar
 
May 2011
Orange Park, FL

2·443 Posts
Default

Quote:
Originally Posted by Madpoo View Post
I think I have the new JSON results being parsed okay for the history section now. I may be missing the proper parsing for PRP results but I haven't seen any live examples of those pass through from the new P95 build yet. It may be different than what gpuowl is doing for it's JSON, so it may or may not show up correctly. I'll deal with it when the time comes.
I've got a PRP double check running now.
Chuck is offline   Reply With Quote
Old 2019-01-22, 19:37   #174
ATH
Einyen
 
ATH's Avatar
 
Dec 2003
Denmark

2·7·223 Posts
Default

Quote:
Originally Posted by Madpoo View Post
I think I have the new JSON results being parsed okay for the history section now. I may be missing the proper parsing for PRP results but I haven't seen any live examples of those pass through from the new P95 build yet. It may be different than what gpuowl is doing for it's JSON, so it may or may not show up correctly. I'll deal with it when the time comes.
This PRPDC will finish in ~25 hours with 29.5b8: https://mersenne.org/M78106811

It was started with 29.5b6 but I assume that does not matter.

Here are 2 examples of PRPCF and PRPCFDC that was fully done with 29.5b8:
https://mersenne.org/M8786537
https://mersenne.org/M6915737

Last fiddled with by ATH on 2019-01-22 at 20:22
ATH is offline   Reply With Quote
Old 2019-01-24, 06:44   #175
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

1D3B16 Posts
Default

@ATH,GP2: I just change FixedHardwareUID to not send any HardwareGUID info to the server. This seems to work just fine in my limited testing. I think that will address both of your scenarios.

Just generate a ComputerGUID and use it in as many places as you like.

Last fiddled with by Prime95 on 2019-01-24 at 06:48
Prime95 is online now   Reply With Quote
Old 2019-01-24, 06:49   #176
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

10011110111012 Posts
Default

Started testing 29.5b8 x64 for Windows, and noted PRP or LL worker windows don't indicate what sort of calculation is being done in the worker title bar updates; same behavior as earlier builds, see https://www.mersenneforum.org/showpo...&postcount=158
It's really handy to have the computation type display if it's P-1 or ECM, and indicating PRP or LL would be a very welcome addition.

Last fiddled with by kriesel on 2019-01-24 at 06:53
kriesel is online now   Reply With Quote
Reply

Thread Tools


All times are UTC. The time now is 03:31.

Thu May 6 03:31:52 UTC 2021 up 27 days, 22:12, 0 users, load averages: 2.59, 2.99, 3.07

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.