mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > PrimeNet

Reply
 
Thread Tools
Old 2014-10-07, 18:48   #166
blip
 
blip's Avatar
 
Jan 2014

2×73 Posts
Default

Quote:
Originally Posted by Madpoo View Post
I *think* that's the only page where I added the full-width borders to each sub-section... I don't know why exactly, but I think it helps the sections stand out better... they act as kind of a proxy for a table title, for example, since the tables are full width as well.
White space gets you a looong way towards making sections stand out. There is a fair amount of expertise around that in (old school, physical) typesetting and typography. Mathematicians get a feeling for that as soon as they start using TeX/LaTeX.

Quote:
Originally Posted by Madpoo View Post
Eww... I'll have to disagree with you there. :) I like tables with borders unless the table is just being used as a layout shortcut. Am I just old skool on that? Maybe?
Well, old school can be seen in pre-internet printed books. A good guide to table design can be found here.

Quote:
Originally Posted by Madpoo View Post
It doesn't *add* anything, but it's kind of cool and, no pun intended, it does make the graph pop out, visually. Now I"m thinking the bar chart for work types should also be 3D. LOL
Ockhams's razor applies here as well: If you don't need it to make a point - don't use it! On the other hand: If there is some important/interesting point to be made, make sure it pops out when you prepare the visualisation.
blip is offline   Reply With Quote
Old 2014-10-07, 19:13   #167
chalsall
If I May
 
chalsall's Avatar
 
"Chris Halsall"
Sep 2002
Barbados

100110001101102 Posts
Default

Quote:
Originally Posted by Madpoo View Post
I honestly did try to keep some things as-is, especially in the manual result/manual assignment pages, if I had a good reason to think GPU72 in particular might be keying off certain things, but even then, it was just a guess on my part. For GPU72 at least, I figured it was using URLs to get work and read the output which I didn't change at all, and for checking things in, the submit button is moved now but it shouldn't care if it's just submitting the form through a script.
Thank's again for your work Madpoo. Just popping in for a moment, then have to get back to "real work" (sorry for the high-bandwidth below)...

1. The only thing which broke any of GPU72's spiders was the removal of the "You are visitor number" string (which I was stupidly using to ensure the full dataset had been brought down. A simple fix).

2. So you know, GPU72 only reserves and unreserves work, it never submits work (except via the Proxy, but that's entirely Worker/Client to Primenet).

3. With regards to exposing data via XML, that would be really (really) cool! Personally it would help me if the data contained on the thresholds page was made available thusly ASAP. Perhaps use this as a test? ( )

3.1. I haven't yet had time to write a parsing spider for that data, and it affects the GPU72 "Manual Reservation Spider" (which reclaims candidates at below 74 when we can handle it, and releases candidates at 73 iff needed).

3.1.1. Somewhat embarrasingly, I thus have to manual view the above page every few days, and update the value for the "Cat 4 cut-off" in my database ("Never send a human to do a machine's job...").

Kindest regards to all.
chalsall is offline   Reply With Quote
Old 2014-10-07, 20:55   #168
Madpoo
Serpentine Vermin Jar
 
Madpoo's Avatar
 
Jul 2014

CF116 Posts
Default

Quote:
Originally Posted by chalsall View Post
Thank's again for your work Madpoo. Just popping in for a moment, then have to get back to "real work" (sorry for the high-bandwidth below)...

3. With regards to exposing data via XML, that would be really (really) cool! Personally it would help me if the data contained on the thresholds page was made available thusly ASAP. Perhaps use this as a test? ( )
...
We'll probably have to figure out how you'd want the data (and to make sure I actually understand which parts you're looking at).

For example, today's #'s represented one way (and I'm just doing a FOR XML query in the db itself) would be:

<DailyLimits type="1" date="2014-10-06">
<num_exponents>1500</num_exponents>
<limit_exponent>33474272</limit_exponent>
</DailyLimits>

There are 6 limit types... limit type 1 there is that first double-check section... the num_exponents and limit_exponent are maybe what you're interested in?

Format aside (I haven't explored the relationships of that data to anything else), is that the kind of info you're after?
Madpoo is offline   Reply With Quote
Old 2014-10-07, 21:13   #169
chalsall
If I May
 
chalsall's Avatar
 
"Chris Halsall"
Sep 2002
Barbados

2×67×73 Posts
Default

Quote:
Originally Posted by Madpoo View Post
Format aside (I haven't explored the relationships of that data to anything else), is that the kind of info you're after?
Yes.
chalsall is offline   Reply With Quote
Old 2014-10-07, 21:33   #170
Madpoo
Serpentine Vermin Jar
 
Madpoo's Avatar
 
Jul 2014

3,313 Posts
Default

Quote:
Originally Posted by chalsall View Post
Yes.
Okay, give this a try... I just ran this manually, so it's not set to auto-update yet or anything. If the format looks cool we can have it update at the same time that the data itself is being updated (it runs on it's own schedule @ 23:20 UTC currently).

http://www.mersenne.org/thresholds/thresholds.xml

The types 1, 2, 3, 11, 12, 13 correlate to:
DC thresholds (1, 2 and 3)
LL thresholds (11, 12 and 13)

I could add a text field to briefly describe each thing, like:
1: description="DC completed in < 60 days"
2: description="DC completed in 61-100 days"
3: description="DC completed in 101-240 days"
..
11: description="LL completed in < 90 days"
12: description="LL completed in 91-150 days"
13: description="LL completed in 151-270 days"

etc. But that's just describing what the website already says, and that's probably a better place for anyone wanting a better understanding. If George ever changes those, he wouldn't have to update the query that spits those out as well. :)
Madpoo is offline   Reply With Quote
Old 2014-10-07, 21:34   #171
snme2pm1
 
"Graham uses ISO 8601"
Mar 2014
AU, Sydney

3638 Posts
Default

Quote:
Originally Posted by Madpoo View Post
According to the source of those graphs, it's a measure of the GHz-hours for the past 24-hours and 90-days.

It should probably be labeled on there
That doesn't sound right. Is it wise to believe everything you read!
I understood the measure to be relative to registered equipment deemed capability, up to 100%.
However for many folk lodging GPU work results the red needle is persistently banging hard against the upper stop with figures in the thousand and more percent, though nowhere is there any hint as to what full scale 100 means.
The recently added title "GHz-Hours for 24 hours and 90 days" doesn't seem to serve well.
snme2pm1 is offline   Reply With Quote
Old 2014-10-07, 22:01   #172
Madpoo
Serpentine Vermin Jar
 
Madpoo's Avatar
 
Jul 2014

3,313 Posts
Default

Quote:
Originally Posted by snme2pm1 View Post
That doesn't sound right. Is it wise to believe everything you read!
I understood the measure to be relative to registered equipment deemed capability, up to 100%.
However for many folk lodging GPU work results the red needle is persistently banging hard against the upper stop with figures in the thousand and more percent, though nowhere is there any hint as to what full scale 100 means.
The recently added title "GHz-Hours for 24 hours and 90 days" doesn't seem to serve well.
Oh...well, I don't know what the heck it is apparently. :)

I skim read and saw like "ghzhrs_last90d" in the code that generates the graph... upon closer reflection, there's more. That's just the dividend to the thing. If it were *just* that, I'd be right. :)

The divisor there comes from a SQL function that I'd have to explore and see where it gets that # from. If there's a chance for that to exceed 100% then it's not working as designed and there's no use labeling it right now anyway.

I removed the (wrong) text I put there for now.
Madpoo is offline   Reply With Quote
Old 2014-10-07, 22:10   #173
James Heinrich
 
James Heinrich's Avatar
 
"James Heinrich"
May 2004
ex-Northern Ontario

23×149 Posts
Default

Quote:
Originally Posted by Madpoo View Post
If there's a chance for that to exceed 100% then it's not working as designed and there's no use labeling it right now anyway.
It most certainly can exceed 100%.
Primenet (thinks it) knows your expected daily throughput based on all the registered machines you have and past performance.
It gets totally thrown by manual results from GPUs because it knows nothing about them (they're not registered CPUs). Not to mention that GPUs tend to put out a lot more GHz-days than CPUs.
James Heinrich is offline   Reply With Quote
Old 2014-10-07, 22:38   #174
chalsall
If I May
 
chalsall's Avatar
 
"Chris Halsall"
Sep 2002
Barbados

2×67×73 Posts
Default

Quote:
Originally Posted by Madpoo View Post
Okay, give this a try... I just ran this manually, so it's not set to auto-update yet or anything. If the format looks cool we can have it update at the same time that the data itself is being updated (it runs on it's own schedule @ 23:20 UTC currently).
Why, that's rather sweet darling (or is it sir?).

Either way, thanks. That can possibly work.
chalsall is offline   Reply With Quote
Old 2014-10-07, 23:01   #175
snme2pm1
 
"Graham uses ISO 8601"
Mar 2014
AU, Sydney

35 Posts
Default Time and Space

Quote:
Originally Posted by Madpoo View Post
I'm updating the dates as I find them... I hadn't considered doing that as part of the update but now that you've pointed it out, it's kind of obvious.
It's great that you're ready to tackle those inconsistencies, and that some places have already been brought closer to iso. Thankyou.
Lot's more to pick through...

Quote:
Originally Posted by Madpoo View Post
as far as putting the UTC timestamp somewhere... maybe you're the one who asked previously, but I never could figure out a good spot for it. It's probably not super helpful to most people,
Must have been someone else.
There may be another subtle purpose for that timestamp that has been overlooked.
It provides an indicator of timezone that tends to make repeat of such indication redundant at every date/time. Indeed some pages like Account->Results, http://www.mersenne.org/results/, can have lots of times, but the timezone is not mentioned at every table row, nor need it be in a heading row if such is made plain by other means.

I note that mersenneforum.org for an unauthenticated user states at page bottom something like All times are GMT. The time now is 20:57., and for an authenticated user something like All times are GMT +11. The time now is 08:01.
I am not suggesting that mersenne.org similarly vary it's output to the local time of an authenticated user, but the point I'm making is that having clearly established the timezone once on the page it then becomes redundant to anywhere else duplicate that signal. Potential for space saving in some headings (especially tables) also.
snme2pm1 is offline   Reply With Quote
Old 2014-10-07, 23:22   #176
Madpoo
Serpentine Vermin Jar
 
Madpoo's Avatar
 
Jul 2014

63618 Posts
Default

Quote:
Originally Posted by snme2pm1 View Post
...though nowhere is there any hint as to what full scale 100 means...
Here's where that # comes from (the divisor in that percentage):
Quote:
potential ghz hours in a 24-hour period = sum( p4_cpu_speed_mhz * cpu_processor_cores ) * 24.0 / 1680.0

GHz_days are based on a single core of a hypothetical 1GHz Core 2 Duo. The Core 2 architecture is worth 1.68 times the P4 architecture. Thus to convert from p4_cpu_speed_mhz to potential_ghz_days divide the P4 MHz by 1.68 to convert to Core 2 architecture and divide by 1000 to convert to GHz.
So let's say I have a system with 12 physical cores at 3.066GHz. My particular P4 cpu speed in MHz for that system is 7073 MHz per core. The "P4 CPU Speed MHz" comes from that particular system's "rolling average" which, right now, is 2307 * the actual core speed 3066 MHz. And since that rolling index is a measure compared to 1000 for a P4, we divide by 1000. Altogether now, 3066*(2307/1000) = 7073.

You can find your own rolling average in your machine's local.txt file in the Prime95 directory, e.g. "RollingAverage=2307"

So my "potential" GHz-Hrs / day equals the sum of all my individual machines indexed speed*# of cores, * 24 hours, divided by the magic fudge factor 1.68, and divided by 1000 to go from MHz to GHz because machines are faster than they used to be. :)

For the one machine in my account, I get (7073*12) * 24 / 1680 = 1212 GHzHrs/day. (It's a bit more in reality because I also have my v4 migrated info showing up in there, but it's a pittance).
=============================

Clearly this all wouldn't hold true for GPUs; they're definitely going to be returning far more results than that "potential GHz" would suggest, if GPUs are even counted in the database the same way in the first place, like CPUs are.

It seems like some apples/oranges going on, and it may be more interesting to see some kind of 30/60/90 day rolling averages or something or just a chart of GHz-hours over the course of a week or two like I mentioned. That wouldn't require any guessing or pre-knowledge of the systems in use, it's more results oriented.

If you had a farm of just CPU work, it could be useful to see the % of output you get compared to what it could be... if you have machines turned off at night or weekends, then your total throughput would be reflected as, for example, 50% of total capacity. But that's only true for the long term 90-day view... the 24-hour dial might not be as interesting if you only have a few systems/cores doing work.

On the other hand, Curtis' account must be pretty interesting since there would always be something going on, results being checked in pretty regularly, or the larger team accounts.

I hope that helps everyone. I know *I* just learned a lot about all of that by digging through and figuring out what's what.
Madpoo is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
Database design xilman Astronomy 1 2017-04-30 22:25
Theoretical Experiment Design c10ck3r Homework Help 7 2015-02-03 08:54
Digital Logic Design henryzz Puzzles 9 2014-12-04 20:56
new intel design tha Hardware 5 2007-04-19 11:38
design factoring algorithms koders333 Factoring 14 2006-01-25 14:08

All times are UTC. The time now is 04:42.


Fri Aug 6 04:42:11 UTC 2021 up 13 days, 23:11, 1 user, load averages: 1.65, 2.40, 3.44

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.