mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > PrimeNet > GPU to 72

Reply
Thread Tools
Old 2020-01-31, 15:37   #4610
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

5,419 Posts
Default

Quote:
Originally Posted by chalsall View Post
That isn't happening. And this is GIMPS, not GIMFS.
I took your post https://www.mersenneforum.org/showpo...postcount=4550 to mean, first time testing is outrunning the project's ability to optimally TF and P-1, and you were asking for input on how to cope. Was that wrong?

If Ben Delo chose to, he could probably cause double checking to complete to M57885161 by Christmas without reducing his rate of first primality test completion by half. And reduce the DC backlog by years, and ease the pressure on TF and P-1 in the bargain. Come to think of it, his industrial grade kit would probably be quite good at P-1, which is relatively lacking in error detection and correction as an algorithm as implemented, and apparently falling behind.

He could also drop a few grand on RTX2080s, set up MISFIT, pay the utility bill for them, and improve the situation in TF a bit.
Quote:
Originally Posted by chalsall View Post
Lastly, while I appreciate that ~3 THzD/D is impressive, please note that for the last month GPU72's participants have averaged a total of ~300 THzD/D.

I would argue that it's better to keep the disciplined targeted firepower working the way it is now. And, again, work in the 10xMs (ideally to 76 or 77) is needed right now.
Soon to be 6ThzD/d, and later ~8. But of course, a small component of the whole is just that; a small percentage of the total. And my little effort is not GPU72, and it is disciplined and as targeted as GPU72 laying claim to massive amounts of exponents at the wavefront allows it to be. I do appreciate the description of the juggling act for the various categories. Simple it ain't.
kriesel is offline   Reply With Quote
Old 2020-01-31, 16:07   #4611
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

124538 Posts
Default

Quote:
Originally Posted by Uncwilly View Post
Why are you not using Misfit? Or a home brewed script to hand this?

Would 2 months be ok with you? or 3 or 4?
Assignment recycling is important. Old TF assignments should be recycled ahead of the first time LL wave in enough time that they can all get done.

Ben may discourage some (since he depresses the chance of them finding a prime). But, overall there is more total throughput. And total number of users does fall in the months after the spike around a new prime discovery.
My understanding is MISFIT only does TF. MISFIT as I recall requires MS Forms and .net on every system it's installed on. (Not so easy for linux users.) Nothing does CUDAPm1. No script that's been released does all the gpu apps I run.

I'm running a mix of everything I feel forwards the wavefronts toward discovery. TF, P-1, LLDC, PRP DC, PRP first test, LL first test, software QA, runtime scaling and limits probing, tuning tests and documentation, etc.

I have a homebrew script that does the big 6 gpu apps, to the extent of analyzing logs, determining active or dormant/hung/stopped status, gathering new results into one file per system, computing remaining worktodo in days of gpu throughput per app instance, etc. I haven't added get-work automation to the script yet; that's probably next or near it. I recently added a user-triggered-only self-update-from-file-share function (which is the 1% of it that is not OS-independent, only implemented for Win32 so far). I work on extending it now and then. It's perl compiled to an executable with no requirement for installing anything else on a system it runs on. It's been almost 2 years in the works and feels like nearing ready for release (months). I alternate among writing code, debugging, testing, and documenting, and many other unrelated tasks. Sometimes I write the documentation first and code to that.

I'd probably be fine with 3 months TF expiration. And not assigning first-tests until optimally TF and P-1 complete. Let cpu first-test fall back to DC assignments and P-1. Not really my call though.

Last fiddled with by kriesel on 2020-01-31 at 16:37
kriesel is offline   Reply With Quote
Old 2020-01-31, 16:34   #4612
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

10101001010112 Posts
Default

Quote:
Originally Posted by chalsall View Post
This is a policy decision made by George, further exasperated by the fact that TF and P-1 assignments are not constrained by the LL/DC assignment rules.
The issues are exacerbated, and so the participants may be exasperated.
Rules can be changed.
kriesel is offline   Reply With Quote
Old 2020-01-31, 16:38   #4613
LaurV
Romulan Interpreter
 
LaurV's Avatar
 
Jun 2011
Thailand

2×3×1,609 Posts
Default

We are with axn here, if the man has resources and likes to LL or PRP, then he should LL or PRP. You won't tell me what to do with my rig, and the goal of the project is finding primes. Why the hack do we have the same discussion two times every year?
LaurV is online now   Reply With Quote
Old 2020-01-31, 17:13   #4614
chalsall
If I May
 
chalsall's Avatar
 
"Chris Halsall"
Sep 2002
Barbados

9,767 Posts
Default

Quote:
Originally Posted by kriesel View Post
I took your post https://www.mersenneforum.org/showpo...postcount=4550 to mean, first time testing is outrunning the project's ability to optimally TF and P-1, and you were asking for input on how to cope. Was that wrong?
It was accurate at the time. However, some "big guns" stepped up to help readjust to the new realities. Oliver alone dumped 6.2 PHzD of work (!) in the last month. LaurV brought ~8 THzD/D of compute back to bear to help chase ahead of Cats 3 and 4 to 75 bits. And several others have stepped up their game as well.

I'll have some time to run the numbers again this weekend to see how we're looking, but there's a chance we'll be able to keep going to 77 for Cats 2 and below for the foreseeable future.

Last fiddled with by chalsall on 2020-01-31 at 17:14 Reason: Tweek.
chalsall is offline   Reply With Quote
Old 2020-01-31, 17:25   #4615
storm5510
Random Account
 
storm5510's Avatar
 
Aug 2009

111101001002 Posts
Default

Quote:
Originally Posted by kriesel View Post
...I'm personally running over 30 manually queued and reported gpu application instances...
I can see where you might want to keep all these going. I only have one now, which I use sparingly. When I do run it, I reduce it to 80% capacity. It runs cooler this way, and there is nearly no impact on throughput. Still over 1,000 Ghz-d/day.
storm5510 is offline   Reply With Quote
Old 2020-01-31, 17:34   #4616
chalsall
If I May
 
chalsall's Avatar
 
"Chris Halsall"
Sep 2002
Barbados

9,767 Posts
Default

Quote:
Originally Posted by kriesel View Post
My understanding is MISFIT only does TF. MISFIT as I recall requires MS Forms and .net on every system it's installed on. (Not so easy for linux users.) Nothing does CUDAPm1. No script that's been released does all the gpu apps I run.
Have you looked at Mark Rose's primetools codebase?

I don't use it currently myself, but I have in the past. Reliable.

If it doesn't do everything you want already, it would probably be a good base to build upon.
chalsall is offline   Reply With Quote
Old 2020-01-31, 20:31   #4617
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

5,419 Posts
Default

Quote:
Originally Posted by chalsall View Post
It was accurate at the time. However, some "big guns" stepped up to help readjust to the new realities. Oliver alone dumped 6.2 PHzD of work (!) in the last month. LaurV brought ~8 THzD/D of compute back to bear to help chase ahead of Cats 3 and 4 to 75 bits. And several others have stepped up their game as well.

I'll have some time to run the numbers again this weekend to see how we're looking, but there's a chance we'll be able to keep going to 77 for Cats 2 and below for the foreseeable future.
Wow, excellent, good to see the group respond effectively to the challenge, thanks for the update.

Quote:
Originally Posted by chalsall View Post
Have you looked at Mark Rose's primetools codebase?
...
If it doesn't do everything you want already, it would probably be a good base to build upon.
Yes, I've looked at it, and summarized its feature set in a table with the other similar stuff I could find. My impression is its feature set is a small subset of what I'm aiming for. Also I don't know Python. Reprogramming vintage wetware is slow.

Quote:
Originally Posted by LaurV View Post
You won't tell me what to do with my rig... Why the hack do we have the same discussion two times every year?
Wouldn't dream of trying, LaurV. But some things need to be discussed now and then. If nothing else, it improves someone's understanding of how things are. In this case, definitely including mine. Things changed a lot this month since Chalsall's Jan 5 post.

Last fiddled with by kriesel on 2020-01-31 at 20:31
kriesel is offline   Reply With Quote
Old 2020-02-01, 00:28   #4618
kracker
 
kracker's Avatar
 
"Mr. Meeseeks"
Jan 2012
California, USA

23×271 Posts
Default

Quote:
Originally Posted by LaurV View Post
We are with axn here, if the man has resources and likes to LL or PRP, then he should LL or PRP. You won't tell me what to do with my rig, and the goal of the project is finding primes. Why the hack do we have the same discussion two times every year?
+1
It's his firepower... he can do whatever the hell he wants with it, which at the moment seems to be prime hunting... If he wants advice or opinions about something, he'll ask for it most likely.

Something I try to live by: Just because it's your opinion doesn't automatically make it fact.
kracker is offline   Reply With Quote
Old 2020-02-01, 14:47   #4619
kriesel
 
kriesel's Avatar
 
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest

152B16 Posts
Default LL vs. LLDC time gap trend over time

At one time DC got done within the hardware lifetime of the system that did the first test.
Flaky systems got identified and corrective action could be taken regarding all their output in a timely manner (early triple check), and also the user could be notified their still-producing system was unreliable and any hardware issues addressed and output reliability improved.

Over the past 20 years, the gap between first test and double check has grown drastically, from under two years, to nearly a decade, longer than typical hardware lifetime.

Recent trends are toward more primality testing, less DC, so the gap will worsen (is worsening).

I think the required DC rate should be increased to get this large and growing lag under control.

Method used to gauge gap was to view a 1000-range of exponent beginning at the indicated base value, and find the longest time gap within that sample interval.

base LL year DC time gap years
3M (most date data missing)
4M 1998 1.5 (some first-test dates missing)
5M 1998 2.2
10M 2000 3.7
30M 2005 6.9
50M 2010 8.6
>50M tbd

Last fiddled with by kriesel on 2020-02-01 at 14:47
kriesel is offline   Reply With Quote
Old 2020-02-01, 14:54   #4620
axn
 
axn's Avatar
 
Jun 2003

508210 Posts
Default

Quote:
Originally Posted by kriesel View Post
Flaky systems got identified and corrective action could be taken regarding all their output in a timely manner (early triple check), and also the user could be notified their still-producing system was unreliable and any hardware issues addressed and output reliability improved.
You've made an excellent argument to stop LL and only hand out PRP.
axn is online now   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
Status Primeinator Operation Billion Digits 5 2011-12-06 02:35
62 bit status 1997rj7 Lone Mersenne Hunters 27 2008-09-29 13:52
OBD Status Uncwilly Operation Billion Digits 22 2005-10-25 14:05
1-2M LLR status paulunderwood 3*2^n-1 Search 2 2005-03-13 17:03
Status of 26.0M - 26.5M 1997rj7 Lone Mersenne Hunters 25 2004-06-18 16:46

All times are UTC. The time now is 08:26.


Mon Aug 2 08:26:24 UTC 2021 up 10 days, 2:55, 0 users, load averages: 2.15, 2.05, 1.85

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.