![]() |
|
|
#12 |
|
Aug 2002
Termonfeckin, IE
22×691 Posts |
Usual crap. Eliminate the variables one at a time:
1. Memory 2. Heat 3. Overclock if any stressing either of the above or the FSB. 4. Motherboard 5. CPU Unfortunately, there is no short cut. |
|
|
|
|
|
#13 | |
|
(loop (#_fork))
Feb 2006
Cambridge, England
2×132×19 Posts |
Quote:
I'm not sure what else I can do without spending money; there's a slight temptation to buy a chunkier cooler for the CPU, but the temperatures I'm measuring on the CPU on the unreliable machine are no different from the ones I measure on a similar system (same CPU, different motherboard, less memory) that works. |
|
|
|
|
|
|
#14 |
|
Oct 2004
Austria
248210 Posts |
As I have heared, memtest does not find every memory issue.
Does a Prime95 torture test fail on this computer? 1.) Try for example to build the computer with only one of the memory sticks and then run the torture test for 24 or 48 hours (or run a msieve postprocessing which fits into the memory), then change the memory sticks and run the torture test (or threaded msieve linear algebra) again, etc., until you have found the faulty memory. (If it fails with (nearly) all memory sticks, it is most propably not a memory issue.) You say that threaded linear algebra fails after 24-48 hours, so propably running a memory test (either memtest or prime95 torture test or something else) only overnight might not be sufficient to detect a memory issue. You can also put the memory sticks into a known reliable computer and test if it fails now. 2.) What are your CPU- and motherboard temperatures (idle and stressed)? 3.) should be no issue as you say that the box is not OC'ed. 4.) I don't know how tho test this other than eliminating variables 1, 2, 3 and 5 5.) You can for example put the cpu onto an other motherboard and test if torture test or threaded msieve fails. Last fiddled with by Andi47 on 2008-01-23 at 13:54 |
|
|
|
|
|
#15 |
|
Aug 2002
Termonfeckin, IE
1010110011002 Posts |
Like Andi47 said above...
There is no OC so that is ruled out. Do tell us what the temps on both machines are. Also, try and swap CPUs between the reliable and unreliable machine so that you can tell if the CPU itself is at fault. |
|
|
|
|
|
#16 |
|
Sep 2002
Database er0rr
72538 Posts |
Additionally, the PSU (power supply unit) ought to be checked out. With all that memory, plus devices, the PSU might just not be up to it. I suggest you swap PSUs and re-check.
Last fiddled with by paulunderwood on 2008-01-24 at 14:13 |
|
|
|
|
|
#17 |
|
Oct 2004
Austria
46628 Posts |
@Fivemack: What are the Voltages (VCore, etc.) of your PC?
Last fiddled with by Andi47 on 2008-01-24 at 17:09 |
|
|
|
|
|
#18 |
|
(loop (#_fork))
Feb 2006
Cambridge, England
2·132·19 Posts |
I've looked at all the BIOS options, and found something about memory timing which was set to 'Turbo'; I reset it to a more conservative level, and have managed to run one thread of msieve for 96 hours without hitting issues.
That seems to make sense to explain the symptoms - it is likely that 'Turbo' is an option which doesn't take account of the rather large memory loading in an 8GB system. |
|
|
|
|
|
#19 | |
|
Oct 2004
Austria
2×17×73 Posts |
Quote:
Has one-threaded msieve also failed with the Turbo-option on? |
|
|
|
|
|
|
#20 |
|
Aug 2002
Termonfeckin, IE
1010110011002 Posts |
While Turbo may be part of the problem, I would advise you to dig deeper and figure out which component the Turbo setting was stressing. That component is probably vulnerable (unless Turbo was really screwing some settings) and more likely to fail soon.
|
|
|
|
|
|
#21 |
|
(loop (#_fork))
Feb 2006
Cambridge, England
2×132×19 Posts |
Yes, single-threaded msieve had failed in the past with Turbo on.
On the other hand, a four-threaded msieve with Turbo off has just failed. The particularly tiresome bit in all this is that the fast machine is the NFS server to a small farm of headless machines, and so they can't get anything done while I'm fiddling with the fast machine, and having acquired the farm I feel peculiarly distressed when it's not working efficiently. |
|
|
|
|
|
#22 |
|
(loop (#_fork))
Feb 2006
Cambridge, England
2×132×19 Posts |
Output from 'sensors' on the unreliable machine
Core temperatures +72, +69, +70, +71 C Voltages 1.18V, 1.74V, 3.38V, 3.02V, 1.38V, 0V, 0.08V, 3.07V, 3.10V fan 1991RPM case temperatures +42, +57, -2 (I presume this last one is an error) Output from 'sensors' on the reliable machine running four threads of linear algebra: Core temperatures +59C, +52C, +58C, +52C Maybe I should go and buy some thermal grease and a 500W PSU; it's quite possible that the CPU fan isn't well-installed on the unreliable machine. |
|
|
|
![]() |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Good air-cooler good enough for overclocked i7-5820K | RienS | Hardware | 17 | 2014-11-18 22:58 |
| GPU placement on motherboard | TheMawn | GPU Computing | 7 | 2013-08-17 04:43 |
| Motherboard for AMD FX8350 | fivemack | Hardware | 5 | 2012-10-23 20:56 |
| Help choosing motherboard please. | Flatlander | GPU Computing | 4 | 2011-01-26 08:15 |
| Motherboard Selection Help | jugbugs | Hardware | 13 | 2004-06-04 15:59 |