mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware

Reply
 
Thread Tools
Old 2019-12-19, 20:47   #45
Xyzzy
 
Xyzzy's Avatar
 
Aug 2002

7·1,237 Posts
Default

We don't know!



When you are doing big memory calculations for weeks/months isn't a bit of extra reliability worth the cost?
Xyzzy is offline   Reply With Quote
Old 2019-12-20, 01:15   #46
axn
 
axn's Avatar
 
Jun 2003

23·683 Posts
Default

Quote:
Originally Posted by Xyzzy View Post
PPS - ATX or mATX AKA 64GB or 128GB?
Future proof = 128GB.
axn is offline   Reply With Quote
Old 2019-12-20, 01:53   #47
masser
 
masser's Avatar
 
Jul 2003
Behind BB

2·7·11·13 Posts
Default

Quote:
Originally Posted by Xyzzy View Post
We don't know!



When you are doing big memory calculations for weeks/months isn't a bit of extra reliability worth the cost?
I don't know either, but here's a link on the topic.
masser is online now   Reply With Quote
Old 2019-12-20, 02:22   #48
Xyzzy
 
Xyzzy's Avatar
 
Aug 2002

7·1,237 Posts
Default

From that link's comment section:

Quote:
I think this depends on the kind of computation you’re doing. Statistics I’ve heard on soft-errors of this kind are on the order of 1 bit error per terabit-year. If you’re building a cluster that’s going to crunch on a single problem for months at a time, where one bit error during the calculation will destroy months of work, and where multiple terabytes of RAM are in use on that calculation, ECC is absolutely worth it. That’s why getting ECC memory support into nVidia’s GPUs was such a big deal for projects like the Titan supercomputer, which has 693.5TiB of RAM running 24/7.

For web server applications like this where the worst thing that happens is one page loads incorrectly, I wouldn’t worry about it so much.
Xyzzy is offline   Reply With Quote
Old 2019-12-20, 13:15   #49
Xyzzy
 
Xyzzy's Avatar
 
Aug 2002

7·1,237 Posts
Default

The CPU arrived yesterday.

The packaging is ridiculous and wasteful.

Note that this is just the CPU. There isn't a fan in that package!

Attached Thumbnails
Click image for larger version

Name:	packaging.jpg
Views:	286
Size:	187.9 KB
ID:	21455  
Xyzzy is offline   Reply With Quote
Old 2019-12-26, 00:45   #50
Xyzzy
 
Xyzzy's Avatar
 
Aug 2002

7·1,237 Posts
Default

We ordered some more parts today. We decided to go with the ATX motherboard.

Have:
1 CPU: AMD Threadripper 1920X 3.5 GHz 12-Core Processor ($199.99 @ Amazon)

Ordered:
1 CPU Cooler: Noctua NH-U14S TR4-SP3 82.52 CFM CPU Cooler ($79.90 @ Amazon)
1 Case: Corsair 275Q ATX Mid Tower Case ($99.99 @ Best Buy)
1 Power Supply: SeaSonic PRIME Ultra Platinum 550 W 80+ Platinum Certified Fully Modular ATX Power Supply ($112.98 @ Newegg)

Need to order:
1 Motherboard: ASRock X399 Taichi ATX sTR4 Motherboard ($319.99 @ Newegg)
8 Memory: Crucial 16 GB (1 x 16 GB) DDR4-2666 Memory ($799.92 @ Newegg)
1 Storage: Crucial P1 1 TB M.2-2280 NVME Solid State Drive ($99.99 @ Newegg)

Notes:
The CPU cooler should have 5mm clearance in the case we have chosen.
From a prior build we have a spare Seasonic 4+4 CPU power cable that will be needed. The ASRock motherboard takes a 4+4 and a 4! (See page 7 of PDF: #2 and #9)

Questions:
For NFS work, should the data be on a separate SSD?
Should we buy all 8 memory sticks ($$$) at one time for compatibility?
ECC?
-->$67 × 8 = $536 Crucial 16GB DDR4-3200 UDIMM
-->$111 × 8 = $888 Crucial 16GB DDR4-2666 VLP ECC UDIMM

Attached Thumbnails
Click image for larger version

Name:	cooler.jpg
Views:	230
Size:	44.6 KB
ID:	21486  
Attached Files
File Type: pdf X399 Taichi.pdf (2.09 MB, 2412 views)
Xyzzy is offline   Reply With Quote
Old 2019-12-26, 01:13   #51
VBCurtis
 
VBCurtis's Avatar
 
"Curtis"
Feb 2005
Riverside, CA

2·2,927 Posts
Default

Quote:
Originally Posted by Xyzzy View Post
Questions:
For NFS work, should the data be on a separate SSD?
Should we buy all 8 memory sticks ($$$) at one time for compatibility?
ECC?
-->$67 × 8 = $536 Crucial 16GB DDR4-3200 UDIMM
-->$111 × 8 = $888 Crucial 16GB DDR4-2666 VLP ECC UDIMM
I don't see any reason for a separate SSD for NFS data. I also would choose to spend $300 more on a more-core CPU over $300 extra for ECC memory. I don't run into matrix-fail errors on my non-ECC machine that ever makes me think "gee, if only I had ECC, I wouldn't be repeating this 10-day calculation"... but even if I did repeat once such calculation every 50 jobs, I wouldn't come out ahead on ECC memory versus adding RAM to my previous machine so it could solve big matrices too.
In your case, the CPU is already chosen, so the $300 can just stay in your pocket. That said, if your plan is to help Greg with the 16e queue, your jobs will be 2-5 months in length; while I've solved exactly one matrix of that size so I don't have a basis to judge, I still wouldn't spring for ECC. If you're going to join us with the 15e queue, your jobs are a few weeks each and trivial to repeat in the super-rare case something fails.
VBCurtis is online now   Reply With Quote
Old 2019-12-26, 02:43   #52
retina
Undefined
 
retina's Avatar
 
"The unspeakable one"
Jun 2006
My evil lair

11010100010012 Posts
Default ECC memory

IMO reliability and dependability trumps a modest amount of extra cost. If my system says 63 hours to completion of the current job, then I have piece of mind that it will finish in 63 hours and give an accurate result.

Running something again because it failed due to some transient error is frustrating, annoying and stressful. So the cost of undependable gear is a combination of money for extra electricity, missed deadlines and my sanity. Compared to a small amount of extra cash upfront it is much cheaper to have ECC memory. IMO of course.

ETA: Plus with non-ECC memory, when your matrix fails to solve correctly is it because your have bad sieve data, or you set the parameters wrongly, or should you just try again because it might have been a memory failure. I'd much rather eliminate the last option immediately and know it is something else that I need to fix.

Last fiddled with by retina on 2019-12-26 at 02:48
retina is offline   Reply With Quote
Old 2019-12-26, 18:12   #53
chalsall
If I May
 
chalsall's Avatar
 
"Chris Halsall"
Sep 2002
Barbados

2×112×47 Posts
Default

Quote:
Originally Posted by VBCurtis View Post
I don't see any reason for a separate SSD for NFS data.
Just to share, I'm soon going to be taking delivery of a Sabrent 256GB Rocket NVMe PCIe M.2 2280 Internal SSD.

I'm going to run an experiment, and use most of it for a swap partition. I sometimes want to do Blender rendering jobs which won't fit in my main workstation's RAM, so I have to spin up a "cloud" instance.

I know that many people say that using an SSD for swap is ill-advised because of "wear", but I think the technology has advanced enough that it should work for long enough. Heck, these things are cheap enough that if I have to replace it every year or so, no big deal.

I'm also going to "underprovision" the partitions, to allow the device to have additional blocks to migrate to when needed. (Yes, I also know that this is no longer generally needed, as most modern SSDs already have a relatively large amount of spare blocks already available, but it wouldn't hurt.)

Really looking forward to seeing how it works out.

Has anyone else here tried doing something like this? If so, what was your experience? Any advice?
chalsall is offline   Reply With Quote
Old 2019-12-26, 19:42   #54
PhilF
 
PhilF's Avatar
 
"6800 descendent"
Feb 2005
Colorado

32×83 Posts
Default

I read a report somewhere about an outfit trying to make SSD's fail. They constantly read and write data to them, 24/7/365. A few failed early, a few failed around the time they were supposed to, but the vast majority are still going strong long after their expected lifetime.

There are many different underlying SSD technologies, so your mileage may vary. I don't baby them in any way and have never had a problem.

That said, there's an aspect to them that most aren't aware of. That is long term retention. They are not well suited to store a bunch of info on, then put away in a drawer un-powered for years.
PhilF is offline   Reply With Quote
Old 2019-12-26, 19:53   #55
chalsall
If I May
 
chalsall's Avatar
 
"Chris Halsall"
Sep 2002
Barbados

261568 Posts
Default

Quote:
Originally Posted by PhilF View Post
They are not well suited to store a bunch of info on, then put away in a drawer un-powered for years.
Thanks for your above; encouraging.

And I did know about the long-term retention issue. But then, frankly, I've had the same problem with SeaCrap drives. Seriously -- I have had several that I used for "cold storage", and even though they sealed in an antistatic bag (in a safe), they still did the "click-of-death" thing when I tried to bring them online after a couple of years.

Fortunately, I always have redundancy in my backups. And I haven't purchased a SeaCrap drive for several years now.
chalsall is offline   Reply With Quote
Reply

Thread Tools


All times are UTC. The time now is 16:38.


Fri Jul 7 16:38:32 UTC 2023 up 323 days, 14:07, 1 user, load averages: 3.40, 2.78, 2.27

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2023, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.

≠ ± ∓ ÷ × · − √ ‰ ⊗ ⊕ ⊖ ⊘ ⊙ ≤ ≥ ≦ ≧ ≨ ≩ ≺ ≻ ≼ ≽ ⊏ ⊐ ⊑ ⊒ ² ³ °
∠ ∟ ° ≅ ~ ‖ ⟂ ⫛
≡ ≜ ≈ ∝ ∞ ≪ ≫ ⌊⌋ ⌈⌉ ∘ ∏ ∐ ∑ ∧ ∨ ∩ ∪ ⨀ ⊕ ⊗ 𝖕 𝖖 𝖗 ⊲ ⊳
∅ ∖ ∁ ↦ ↣ ∩ ∪ ⊆ ⊂ ⊄ ⊊ ⊇ ⊃ ⊅ ⊋ ⊖ ∈ ∉ ∋ ∌ ℕ ℤ ℚ ℝ ℂ ℵ ℶ ℷ ℸ 𝓟
¬ ∨ ∧ ⊕ → ← ⇒ ⇐ ⇔ ∀ ∃ ∄ ∴ ∵ ⊤ ⊥ ⊢ ⊨ ⫤ ⊣ … ⋯ ⋮ ⋰ ⋱
∫ ∬ ∭ ∮ ∯ ∰ ∇ ∆ δ ∂ ℱ ℒ ℓ
𝛢𝛼 𝛣𝛽 𝛤𝛾 𝛥𝛿 𝛦𝜀𝜖 𝛧𝜁 𝛨𝜂 𝛩𝜃𝜗 𝛪𝜄 𝛫𝜅 𝛬𝜆 𝛭𝜇 𝛮𝜈 𝛯𝜉 𝛰𝜊 𝛱𝜋 𝛲𝜌 𝛴𝜎𝜍 𝛵𝜏 𝛶𝜐 𝛷𝜙𝜑 𝛸𝜒 𝛹𝜓 𝛺𝜔