mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Software

Reply
 
Thread Tools
Old 2016-01-20, 22:09   #441
pegnose
 
pegnose's Avatar
 
Jan 2016

10100012 Posts
Default

Quote:
Originally Posted by chalsall View Post
Please trust me, I understand. When I was responsible for building out the first wireless WAN here in Barbados the manufacturer had a "once a month" bug. Of course, after deploying ~100 radios, the problem manifested ~3 times a day...
Oh nose. What was it?

Quote:
Originally Posted by chalsall View Post
Are the other three drives SSDs or HDs? Please know that "spinning rust storage" can have rather extreme random spikes in their draw (both OS driven, and independent).
Its one Samsung 840 Pro 256 GB SSD and two WD Green 3TB.

If I understand my PSU correct, the 4 rails are 1) for CPU and mobo, 2) for drives and all other devices, 3) PCIe 1, 4) PCIe 2. If I am correct, there is no danger here. Unfortunately there is no info on that on their website.

Quote:
Originally Posted by chalsall View Post
Don't guess. Test.
Many people with many different PSUs have thre freeze. There is no reason to believe it is the problem.

Quote:
Originally Posted by chalsall View Post
I would suggest (if you and/or others can) to first remove any kit you can (for example, HDs, GPUs, RAM), and rerun the tests you've used to produce the observed crashing (even if not deterministically -- currently you're doing statistical testing). Swap out MBs and PSUs. Make sure your mains power is good.
That is exactly what I did. As soon as ASUS support pointed me to my memory not exactly being compatible, I adjusted settings and took all componentes out except CPU and one ram module. I even unplugged all the fans. Ok, the DVD drive was sill on, but I had to run memtest86 somehow. ;)

Finally I arrived at memtest86, Prime95, and idle state, running for a whole day without any issue - with correct memory timings, more power for DRAM, and some compat settings that possibly are pure placebo.

Quote:
Originally Posted by chalsall View Post
This is not to say this is not a CPU issue, but you don't /know/ it is yet.

I hope that makes sense and helps.
To bring you up to speed: only few of us were able to imporve their situation by RMAing the CPU. My impression is that many think it is a platform/bios/microcode (whatever of those) issue - me included. I don't think that anything is broken, particularly as I had two crash-free months in the beginning. Here is 'our' main discussion thread:

http://www.tomshardware.co.uk/forum/....html#17356820

Some have the Skylake freeze during idle, others during load, one guy while video streaming. Some could improve by disabling c-states, some with memory timings and voltages, some even with CPU core voltage, if I remember correctly. It is diffuse, obscure, and some other nice words. ;)

It could be very different problems, but interestingly, all these different issues were not completely cured on any machine (not that I read of), but only got better to a more or less substantial extent.
pegnose is offline   Reply With Quote
Old 2016-01-20, 22:18   #442
pegnose
 
pegnose's Avatar
 
Jan 2016

34 Posts
Default

Quote:
Originally Posted by kladner View Post
chalsall:
pegnose:
I'm pretty sure, from the OP's post, that all the drives are SSDs. Twenty watts for three spinners also seems pretty unlikely.
Samsung 840 pro: 3.21 W in use
WD Green 3TB: 4.1 W in use (x2)
plus a substantial buffer, or let it be 30 W altogether; on one PSU rail, what are we talking about?!
pegnose is offline   Reply With Quote
Old 2016-01-20, 22:23   #443
pegnose
 
pegnose's Avatar
 
Jan 2016

34 Posts
Default

Quote:
Originally Posted by chalsall View Post
Please also note this: "My last freeze (and the first AFTER I thought I was finally good) now was with high load on CPU and HDD".
Yes, I stated this intentionally. But I rather meant data transfer. My software Raid 1 was resyncing. Of course, this also means power, but spikes? Plus: even the BeQuiet support told me that I would rather have to fear black-outs in such cases.

I really apreciate your effort!! But I am afraid we are on the wrong track. First thing I do is get different (brand) compatible memory. I.e. after I flash the new 1402 bios update and get the next freeze (if any, haha).

Last fiddled with by pegnose on 2016-01-20 at 22:25
pegnose is offline   Reply With Quote
Old 2016-01-20, 22:25   #444
chalsall
If I May
 
chalsall's Avatar
 
"Chris Halsall"
Sep 2002
Barbados

230668 Posts
Default

Quote:
Originally Posted by pegnose View Post
It is diffuse, obscure, and some other nice words. ;)

It could be very different problems, but interestingly, all these different issues were not completely cured on any machine (not that I read of), but only got better to a more or less substantial extent.
OK. But, you guys are changing many different variables all at the same time, with little cross correlation nor testable results.

This is not how the Scientific Method works.

To use an analogy, this is worse than shooting a shotgun in the dark hoping to find your keys....
chalsall is online now   Reply With Quote
Old 2016-01-20, 22:29   #445
pegnose
 
pegnose's Avatar
 
Jan 2016

10100012 Posts
Default

Quote:
Originally Posted by chalsall View Post
OK. But, you guys are changing many different variables all at the same time, with little cross correlation nor testable results.

This is not how the Scientific Method works.

To use an analogy, this is worse than shooting a shotgun in the dark hoping to find your keys....
You mean shooting the shotgun at the streetlight you are standing below... ,)
pegnose is offline   Reply With Quote
Old 2016-01-20, 22:31   #446
pegnose
 
pegnose's Avatar
 
Jan 2016

34 Posts
Default

And what is wrong with getting advised memory? I say: check one component at a time and make 100% sure it is ok. I am not done with memory, yet. If I get new memory from a different brand that is in my HCL, and I still have the issue, I move on to the next component.
pegnose is offline   Reply With Quote
Old 2016-01-20, 22:33   #447
chalsall
If I May
 
chalsall's Avatar
 
"Chris Halsall"
Sep 2002
Barbados

2·67·73 Posts
Default

Quote:
Originally Posted by pegnose View Post
I really apreciate your effort!! But I am afraid we are on the wrong track. First thing I do is get different (brand) compatible memory. I.e. after I flash the new 1402 bios update and get the next freeze (if any, haha).
Have you, personally, tried a different motherboard supplier with all your other components, including the CPU?

I learnt the hard way to eliminate *all* variables....
chalsall is online now   Reply With Quote
Old 2016-01-20, 22:34   #448
pegnose
 
pegnose's Avatar
 
Jan 2016

34 Posts
Default

Quote:
Originally Posted by chalsall View Post
Have you, personally, tried a different motherboard supplier with all your other components, including the CPU?

I learnt the hard way to eliminate *all* variables....
No. As I said: I started out with memory, and I am not yet done with it.

But, of course, part of the problem is - and this is more or less the same with all of us: I bought my mobo more than 6 months ago. What will my vendor say if I want to return it without being able to proove that it is broken and that it WAS broken from the beginning (after 6 mo that is necessary), PLUS that I want a different one in return? I should be happy if he deems my worthy of even the shortest response.

AND on the other hand: as I said, I don't believe that something is broken (or sort of broken with all ASUS Z170 boards). If my other componentes are fine, ASUS support has to deal with it. I will make them. Oh, I will.

EDIT: You HEAR me, ASUS?!?

Last fiddled with by pegnose on 2016-01-20 at 22:39
pegnose is offline   Reply With Quote
Old 2016-01-20, 22:45   #449
chalsall
If I May
 
chalsall's Avatar
 
"Chris Halsall"
Sep 2002
Barbados

2·67·73 Posts
Default HDD Diet: Power Consumption and Heat Dissipation

Just to put this out there...

Almost eleven years old, and yet still relevant....

Last fiddled with by chalsall on 2016-01-20 at 22:48 Reason: s/twelve/eleven/
chalsall is online now   Reply With Quote
Old 2016-01-20, 22:51   #450
pegnose
 
pegnose's Avatar
 
Jan 2016

34 Posts
Default

Quote:
Originally Posted by chalsall View Post
Just to put this out there...

Almost eleven years old, and yet still relevant....
You are right, that is important.

In my case, I have a good cooling solution (and case ;). My HDDs are resyncing for 16 h now, and they are 31°C and 32°C.


EDIT: Nighty night.

Last fiddled with by pegnose on 2016-01-20 at 23:02
pegnose is offline   Reply With Quote
Old 2016-01-20, 23:07   #451
chalsall
If I May
 
chalsall's Avatar
 
"Chris Halsall"
Sep 2002
Barbados

2×67×73 Posts
Default

Quote:
Originally Posted by pegnose View Post
In my case, I have a good cooling solution (and case ;). My HDDs are resyncing for 16 h now, and they are 31°C and 32°C.
You might have missed the point of the article...

Measuring the temperature of the components involved is an averaged and high-latency measurement of the power consumed.

Taking an instantaneous power consumption measurement is a lot more difficult (particularly when Direct Current rather than Alternating Current is involved).
chalsall is online now   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
Skylake vs Kabylake ET_ Hardware 17 2017-05-24 16:19
Skylake and RAM scaling mackerel Hardware 34 2016-03-03 19:14
So does skylake-nonXeon actually get us anything? fivemack Hardware 36 2015-09-08 01:42
Skylake processor tha Hardware 7 2015-03-05 23:49
Skylake AVX-512 clarke Software 15 2015-03-04 21:48

All times are UTC. The time now is 20:15.


Fri Aug 6 20:15:37 UTC 2021 up 14 days, 14:44, 1 user, load averages: 3.44, 3.07, 3.05

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.