mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware

Reply
 
Thread Tools
Old 2015-08-20, 16:54   #23
Mark Rose
 
Mark Rose's Avatar
 
"/X\(‘-‘)/X\"
Jan 2013

1011011100102 Posts
Default

Quote:
Originally Posted by airsquirrels View Post
I have noticed that a ton of CPU cycles in GIMPS get wasted on half completed assignments by users that abandon, even amongst some of my coworkers who I convinced to run prime95 on their machines... Until they stopped. In 2015 with fast and very present Internet wouldn't it be possible to actively checkpoint to PrimeNet so other users could pick up where an exponent was left off? Just wait till the residue for a specific iteration is small and upload a checkpoint :)
Yes, it would be possible. The checkpoint files aren't that big -- a few MB a piece. The problem is one of resources: someone to code the necessary changes and paying for the TBs of storage.
Mark Rose is offline   Reply With Quote
Old 2015-08-20, 17:00   #24
Mark Rose
 
Mark Rose's Avatar
 
"/X\(‘-‘)/X\"
Jan 2013

2×5×293 Posts
Default

Quote:
Originally Posted by airsquirrels View Post
That said, if I can get under full steam (~24GhzDay/Day) three weeks of work should nearly clear the DCTF pile to current release levels. Of course that all depends on adequately addressing the power, cooling, and supporting infrastructure issues....
There is about 9 PHz days of DCTF work. Not all of it is being held by GPU72. Even at 30 THz-d/d, that's still going to take 10 months... but it would be pretty awesome to have that cleared in a year!
Mark Rose is offline   Reply With Quote
Old 2015-08-20, 17:01   #25
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

19×397 Posts
Default

Quote:
Originally Posted by airsquirrels View Post
I have noticed that a ton of CPU cycles in GIMPS get wasted on half completed assignments by users that abandon, even amongst some of my coworkers who I convinced to run prime95 on their machines... Until they stopped. In 2015 with fast and very present Internet wouldn't it be possible to actively checkpoint to PrimeNet so other users could pick up where an exponent was left off?
There is a downside to implementing this. The quality of the result is only as good as the worst computer to work on the LL test. If an overclocker puts a few million low quality iterations in, then a highly reliably machine may waste tens of millions iterations finishing the LL test.
Prime95 is offline   Reply With Quote
Old 2015-08-20, 17:01   #26
VictordeHolland
 
VictordeHolland's Avatar
 
"Victor de Hollander"
Aug 2011
the Netherlands

23×3×72 Posts
Default

Quote:
Originally Posted by airsquirrels View Post
Just wait till the residue for a specific iteration is small and upload a checkpoint :)
You don't have to wait for a specific iteration, you can save the residue at every iteration, but to limit I/O writes it is usually done only every 5-15 minutes.
VictordeHolland is offline   Reply With Quote
Old 2015-08-20, 17:03   #27
airsquirrels
 
airsquirrels's Avatar
 
"David"
Jul 2015
Ohio

51710 Posts
Default

Quote:
Originally Posted by Mark Rose View Post
Yes, it would be possible. The checkpoint files aren't that big -- a few MB a piece. The problem is one of resources: someone to code the necessary changes and paying for the TBs of storage.
Hmm, well one problem at a time I suppose. It would also add a pretty significant benefit in that double checks could fail as soon as they don't match a checkpoint instead of needing a complete run. Last I heard a significant percentage still fail, so that's a not so insignificant amount of resources. Ideally that would lead to less DC backlog and also the ability for those 'slower' computers to contribute to the LL by advancing exponents the same way we do with TF, one iteration level at a time.

I know I'm new here so I don't want to overreach, but I'm just as happy to contribute code and storage when the time comes.
airsquirrels is offline   Reply With Quote
Old 2015-08-20, 17:08   #28
airsquirrels
 
airsquirrels's Avatar
 
"David"
Jul 2015
Ohio

10000001012 Posts
Default

Quote:
Originally Posted by Prime95 View Post
There is a downside to implementing this. The quality of the result is only as good as the worst computer to work on the LL test. If an overclocker puts a few million low quality iterations in, then a highly reliably machine may waste tens of millions iterations finishing the LL test.
One (perhaps unpopular) way to mitigate this would be real time DC, each iteration group assigned to two users. The benefit of catching and correcting errors early would probably save enough resources to be worth it. How many resources are wasted by the first low quality computer completing weeks of work for an exponent when it could be detected as having made a mistake within the first few days.

I imagine it would also be easier to quickly flag and quarantine bad actors in that case
airsquirrels is offline   Reply With Quote
Old 2015-08-20, 17:09   #29
airsquirrels
 
airsquirrels's Avatar
 
"David"
Jul 2015
Ohio

11×47 Posts
Default

Quote:
Originally Posted by VictordeHolland View Post
You don't have to wait for a specific iteration, you can save the residue at every iteration, but to limit I/O writes it is usually done only every 5-15 minutes.
Sorry for the triple reply, my thought on waiting was that at some points the residue is going to be much smaller than others, saving storage and bandwidth. Perhaps that is a premature optimization.

Last fiddled with by airsquirrels on 2015-08-20 at 17:17
airsquirrels is offline   Reply With Quote
Old 2015-08-20, 17:21   #30
chalsall
If I May
 
chalsall's Avatar
 
"Chris Halsall"
Sep 2002
Barbados

2·67·73 Posts
Default

Quote:
Originally Posted by Mark Rose View Post
There is about 9 PHz days of DCTF work. Not all of it is being held by GPU72. Even at 30 THz-d/d, that's still going to take 10 months... but it would be pretty awesome to have that cleared in a year!
771.7 THz Days by my calculations.

This includes everything still to be working, not just that held by GPU72.

That being said, it's been interesting watching the Estimated Days to Complete Trial Factoring for all Candidates drop precipitously in the DCTF table the last few days!
chalsall is offline   Reply With Quote
Old 2015-08-20, 17:30   #31
chalsall
If I May
 
chalsall's Avatar
 
"Chris Halsall"
Sep 2002
Barbados

2×67×73 Posts
Default

Quote:
Originally Posted by airsquirrels View Post
Sorry for the triple reply, my thought on waiting was that at some points the residue is going to be much smaller than others, saving storage and bandwidth.
Likely also wrong.

My understanding is the true residue is about as close to true noise as you can get. Read: uncompressable.

(Unless, of course, the candidate is a MP, at which point at the very last step it's *very* compressible!)
chalsall is offline   Reply With Quote
Old 2015-08-20, 17:36   #32
Mark Rose
 
Mark Rose's Avatar
 
"/X\(‘-‘)/X\"
Jan 2013

2×5×293 Posts
Default

Quote:
Originally Posted by chalsall View Post
771.7 THz Days by my calculations.
http://imgur.com/szouqSx

Quote:
That being said, it's been interesting watching the Estimated Days to Complete Trial Factoring for all Candidates drop precipitously in the DCTF table the last few days!
Yeah, it was 1400 not long ago!
Mark Rose is offline   Reply With Quote
Old 2015-08-20, 17:43   #33
airsquirrels
 
airsquirrels's Avatar
 
"David"
Jul 2015
Ohio

11·47 Posts
Default

Quote:
Originally Posted by chalsall View Post
Likely also wrong.

My understanding is the true residue is about as close to true noise as you can get. Read: uncompressable.

(Unless, of course, the candidate is a MP, at which point at the very last step it's *very* compressible!)
Well, there is certainly some debate as to how truly chaotic the residue sequence is, but for practical purposes and until we advance the state of the theory there we can treat it as a truly random number between 1 and 2^p-2. Any specific iteration (10,000, etc.) is going to be essentially random, but there will be points in the sequence where the residue is much closer to 1, and thus would take less bits to store. The more I think about this the less it is probably worth the effort to consider optimizing for those opportunities to checkpoint cheaply.
airsquirrels is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
Large Gaps >500,000 mart_r Prime Gap Searches 119 2017-08-21 12:48
48-bit large primes! jasonp Msieve 24 2010-06-01 19:14
a^n mod m (with large n) Romulas Math 3 2010-05-08 20:11
Is this a relatively large number? MavsFan Math 3 2003-12-12 02:23
New Server Hardware and price quotes, Funding the server Angular PrimeNet 32 2002-12-09 01:12

All times are UTC. The time now is 07:15.


Fri Aug 6 07:15:22 UTC 2021 up 14 days, 1:44, 1 user, load averages: 3.50, 2.94, 2.76

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.