mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware

Reply
 
Thread Tools
Old 2019-09-01, 22:33   #111
hansl
 
hansl's Avatar
 
Apr 2019

CD16 Posts
Default

Speaking of system tuning, I saw this video recently (from the "Actually Hardcore Overclocking" channel) which I found interesting regarding getting the most out of memory clocks on Zen 2.

https://www.youtube.com/watch?v=xSAFs-t0DCg

Basically he recommends running FCLK at its max capable rate, (unbound from UCLK) if the difference would be greater than 166MHz:
FCLK - UCLK > 166MHz.

Because you get better bandwidth across CCXs at the expense of some potential latency to RAM.

So with 3000 MT/s RAM that user's UCLK should be at 1500MHz, and according to the video he should be capable of at least 1766 FCLK, so that would be 266MHz difference therefore worth separating them (theoretically). I wonder how well it works for prime95 though.

I don't have a Zen 2 to play with though, only "Zen+" 2000 series.

Last fiddled with by hansl on 2019-09-01 at 22:40
hansl is offline   Reply With Quote
Old 2019-09-02, 07:11   #112
mackerel
 
mackerel's Avatar
 
Feb 2016
UK

1C016 Posts
Default

Haven't watched that particular video, but I had also seen elsewhere suggesting if you have slow(er) ram, running the max IF speed possible can still lead to gains for gaming. I'm not sure it is only inter-CCX communication benefiting from it. When AMD revealed that each CCD only had half the write bandwidth compared to read, that would affect ram also. Unless you run single channel, you're not likely to ever saturate ram with writes on a single CCD CPU as the IF will choke it long before then. Any increase to IF speed will help with that, assuming bandwidth is more limiting than latency.

I haven't tested it myself yet. For now 1800 on IF seems a safe bet, but people aren't getting much further than that. I forget what I saw on one of mine, but I didn't reach 1900.
mackerel is offline   Reply With Quote
Old 2019-09-09, 21:10   #113
M344587487
 
M344587487's Avatar
 
"Composite as Heck"
Oct 2017

95010 Posts
Default

Threadripper 3 speculation on splitting into consumer (sTRX4 HEDT) and workstation (sWRX8 workstation) lines:
https://youtu.be/1Xrys7sOr4A?t=55

tl;dr:
  • Consumer TR3 has 4 channels ECC 3200MT/s, is overclockable, 64 PCIe gen4 lanes
  • Workstation TR3 has 8 channels ECC 3200MT/s, is not overclockable, 96-128 PCIe gen4 lanes
  • The difference between workstation TR3 and Epyc 2 is that the former will have more consumer IO and a higher TDP of up to 280W. 280W TDP is a spicy meatball

Threadripper hardware being for consumers means it's easier for us to obtain so that's nice, but I'm not as optimistic as the guy in the video that these workstation SKUs will be a "poor mans Epyc". Core for core the performance should beat the P line of Epyc 2 because of the TDP (unless the bins are much worse which I doubt). It wouldn't surprise me if the price for a workstation TR3 part is between the core equivalent P and non-P Epyc 2 parts. Still it's an interesting development, a 64 core workstation TR3 part if it exists would be top CPU in terms of throughput.
M344587487 is offline   Reply With Quote
Old 2019-09-18, 18:57   #114
M344587487
 
M344587487's Avatar
 
"Composite as Heck"
Oct 2017

2×52×19 Posts
Default

A new top-end Epyc part has been announced ( https://www.phoronix.com/scan.php?pa...=AMD-EPYC-7H12 ), it's like the 7742 with 64 cores but comes with a higher TDP of 280W and a higher base clock. The 280W TDP is interesting as it matches what is expected of the Threadripper parts, however it makes me even less optimistic about how price-competitive the TR parts will be. Maybe this move indicates TR3 will top out at 48 cores (supply is constrained, it'd be low volume anyway as it'll be hideously expensive for a workstation part and this way zen3 can have a natural progression to 64 cores next year).
M344587487 is offline   Reply With Quote
Old 2019-10-17, 19:40   #115
M344587487
 
M344587487's Avatar
 
"Composite as Heck"
Oct 2017

2·52·19 Posts
Default

https://www.tomshardware.com/news/am...3750x-105w-cpu

tl;dr 3750X is a potential Zen2 part, many unknowns including if it'll even be for general consumption, would slot between the 3700X and 3800X with a 105W TDP like the 3800X. I'm only bothering to mention it because one of the possible configurations is two quad core chiplets which would have a tasty 64MB of cache.
M344587487 is offline   Reply With Quote
Old 2019-10-18, 09:03   #116
mackerel
 
mackerel's Avatar
 
Feb 2016
UK

26·7 Posts
Default

As interesting a 2 CCD part would be, feels more intuitive for a 3750X to literally sit between 3700X and 3800X. Not that there is much room to squeeze another part in there, but might be another OEM special.

If they had gone 2 CCD, I think that would be more likely a 3850X.
mackerel is offline   Reply With Quote
Old 2019-10-19, 06:56   #117
M344587487
 
M344587487's Avatar
 
"Composite as Heck"
Oct 2017

2·52·19 Posts
Default

You're probably right about a 1 CCD OEM special. Still, a 2 CCD part would not clock as high as a 3800X all other things equal (and quad core bins are likely much worse than normal reducing clocks further) to the point where the 3750X name might make sense. It may perform more like a poor mans Epyc than a flagship part.


An interesting wrinkle is that the PS5 and next Xbox will have 8 core zen2 CPUs. These may be in a 2 CCD low clock configuration to soak up the low-binned parts just like the PS4 did for the Jaguar cores before. If the console stockpile demand is already being satisfied they may be looking for another outlet.
M344587487 is offline   Reply With Quote
Old 2019-11-01, 18:44   #118
petrw1
1976 Toyota Corona years forever!
 
petrw1's Avatar
 
"Wayne"
Nov 2006
Saskatchewan, Canada

123158 Posts
Default

Quote:
Originally Posted by nomead View Post
All right, one 3900X owner ran the mprime benchmarks for me, same methodology in plotting as before. The 3900X is thus over two times faster than the 3600 within a certain range of FFT sizes, from 5120K to 7680K. 2.45 ms/iter at the wavefront, 5120K... Certainly the effect of the larger L3 cache can be seen. He has 3000 MHz memory in the system, no idea about latency. No system tuning done, so I assume that fclk is just 1500 MHz.
What's on the Y-Axis?
petrw1 is online now   Reply With Quote
Old 2019-11-01, 19:11   #119
nomead
 
nomead's Avatar
 
"Sam Laur"
Dec 2018
Turku, Finland

1001111012 Posts
Default

Quote:
Originally Posted by petrw1 View Post
What's on the Y-Axis?
From message 103 in this thread (that later graph was just an update of that first benchmark, with 3900X results added) :

As a baseline comparison, the lowest graph curve is the now retired Ryzen 3 2200G, four cores, one worker. Speeds normalized by multiplying FFT length (in K) with throughput (iters/sec), then dividing that value by the slowest such result, which happened to be 8064K FFT on the 2200G processor.
nomead is offline   Reply With Quote
Old 2019-11-08, 07:57   #120
M344587487
 
M344587487's Avatar
 
"Composite as Heck"
Oct 2017

2·52·19 Posts
Default

The 3950X, 3960X and 3970X are released November 25th (16, 24, 32 cores respectively for $750, $1400, $2000 respectively on platforms AM4, TR40, TR40 respectively). 3960X and 3970X are within a few hundred dollars of comparably specced (wrt cores/cache) Epyc parts, with 128MB of L3 and a 280W TDP that should probably be liquid cooled or underclocked. The TR40 platform is for HEDT enthusiasts and will likely be expensive and high quality across the board, not backwards compatible with TR4.

https://www.anandtech.com/show/15062...19-fall-update
https://videocardz.com/82654/amd-ann...-ryzen-9-3950x
https://www.techpowerup.com/260974/m...s-motherboards
M344587487 is offline   Reply With Quote
Old 2019-11-08, 17:06   #121
M344587487
 
M344587487's Avatar
 
"Composite as Heck"
Oct 2017

16668 Posts
Default

"Eco mode" is an interesting feature that comes with the AM4 1004 bios update. It makes any zen2 105W TDP part stick to a 65W TDP, and any 65W TDP part stick to a 45W TDP. It may be the best and simplest way to underclock for some of our needs, any traditional overclock doesn't do a great deal for zen2 and the same is probably true for an underclock.


I have a feeling it'll help 105W parts fit the efficiency curve better than it helps 65W parts. Worth investigating at any rate.
M344587487 is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
RX470 and RX460 announced VictordeHolland GPU Computing 0 2016-07-30 13:05
Intel Xeon D announced VictordeHolland Hardware 7 2015-03-11 23:26
Factoring details mturpin Information & Answers 4 2013-02-08 02:43
Euler (6,2,5) details. Death Math 10 2011-08-03 13:49
Larrabee instruction set announced fivemack Hardware 0 2009-03-25 12:09

All times are UTC. The time now is 16:39.


Fri Jul 7 16:39:09 UTC 2023 up 323 days, 14:07, 1 user, load averages: 4.46, 3.08, 2.39

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2023, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.

≠ ± ∓ ÷ × · − √ ‰ ⊗ ⊕ ⊖ ⊘ ⊙ ≤ ≥ ≦ ≧ ≨ ≩ ≺ ≻ ≼ ≽ ⊏ ⊐ ⊑ ⊒ ² ³ °
∠ ∟ ° ≅ ~ ‖ ⟂ ⫛
≡ ≜ ≈ ∝ ∞ ≪ ≫ ⌊⌋ ⌈⌉ ∘ ∏ ∐ ∑ ∧ ∨ ∩ ∪ ⨀ ⊕ ⊗ 𝖕 𝖖 𝖗 ⊲ ⊳
∅ ∖ ∁ ↦ ↣ ∩ ∪ ⊆ ⊂ ⊄ ⊊ ⊇ ⊃ ⊅ ⊋ ⊖ ∈ ∉ ∋ ∌ ℕ ℤ ℚ ℝ ℂ ℵ ℶ ℷ ℸ 𝓟
¬ ∨ ∧ ⊕ → ← ⇒ ⇐ ⇔ ∀ ∃ ∄ ∴ ∵ ⊤ ⊥ ⊢ ⊨ ⫤ ⊣ … ⋯ ⋮ ⋰ ⋱
∫ ∬ ∭ ∮ ∯ ∰ ∇ ∆ δ ∂ ℱ ℒ ℓ
𝛢𝛼 𝛣𝛽 𝛤𝛾 𝛥𝛿 𝛦𝜀𝜖 𝛧𝜁 𝛨𝜂 𝛩𝜃𝜗 𝛪𝜄 𝛫𝜅 𝛬𝜆 𝛭𝜇 𝛮𝜈 𝛯𝜉 𝛰𝜊 𝛱𝜋 𝛲𝜌 𝛴𝜎𝜍 𝛵𝜏 𝛶𝜐 𝛷𝜙𝜑 𝛸𝜒 𝛹𝜓 𝛺𝜔