![]() |
|
|
#12 | |
|
"Marv"
May 2009
near the Tannhäuser Gate
3·269 Posts |
Quote:
From Anandtech: "on paper the new card only has a 9% compute throughput advantage. So it’s not on compute throughput where Radeon VII’s real winning charm lies" Last fiddled with by tServo on 2019-01-12 at 17:04 |
|
|
|
|
|
|
#13 |
|
"Sam Laur"
Dec 2018
Turku, Finland
317 Posts |
"Some guy on Twitter" = Editor in Chief for Anandtech...
And the quote refers to FP32 performance. Later on in the same article though, "The Vega 20 GPU does bring new compute features – particularly much higher FP64 compute throughput and new low-precision modes well-suited for neural network inferencing – but these features aren’t something consumers are likely to use." Last fiddled with by nomead on 2019-01-12 at 17:54 Reason: added quote |
|
|
|
|
|
#14 |
|
"Composite as Heck"
Oct 2017
2·52·19 Posts |
|
|
|
|
|
|
#15 |
|
"Sam Laur"
Dec 2018
Turku, Finland
317 Posts |
Awww...
That's it then, unfortunately my interest stopped right there.
|
|
|
|
|
|
#16 |
|
"/X\(‘-‘)/X\"
Jan 2013
https://pedan.tech/
1100011100002 Posts |
We'll have to see what it turns out to be. Ryan Smith specifically asked about it.
https://www.reddit.com/r/Amd/comment...apped/ee1jr5k/ |
|
|
|
|
|
#17 |
|
Feb 2016
UK
26×7 Posts |
|
|
|
|
|
|
#18 | |
|
"/X\(‘-‘)/X\"
Jan 2013
https://pedan.tech/
24·199 Posts |
Quote:
|
|
|
|
|
|
|
#19 | |
|
"Mihai Preda"
Apr 2015
5AC16 Posts |
Quote:
|
|
|
|
|
|
|
#20 |
|
"Composite as Heck"
Oct 2017
2×52×19 Posts |
Am I right in thinking that DP rate is the bottleneck for Vega 64 but that memory bandwidth comes a close second? Is it as simple as saying that for R7 to roughly match 2x Vega 64 throughput at the same clocks, it needed both double DP rate and double bandwidth (ignoring 4 CU difference)? Any potential bottlenecks other than those two? Other than higher is better I don't know how the specs translate into performance.
|
|
|
|
|
|
#21 | |
|
"Mihai Preda"
Apr 2015
101101011002 Posts |
Quote:
About memory, it is my impression that the latency did not improve much, but the bandwidth doubled. But to take advantage of this, better occupancy would be required (double the number of memory operations in flight), and this is not easily achievable because of other limiting resources: LDS memory and nb. of registers (VGPRs) that remain unchanged I guess. About compute, the parts that aren't DP (e.g. pointer arithmetic, other integer e.g. carry, logic) remain unchanged, and this will reduce the observed speedup. IMO another limiting factor for GCN performance is still the compiler, after so many years: the compiler does a rather poor job at generating highly efficient code (not an easy task I agree). OTOH the better cooling will help, and allow the card to be higher clocked without thermal throttling (which is a problem on Vega64 blower cooler) Last fiddled with by preda on 2019-01-17 at 10:51 |
|
|
|
|
|
|
#22 | |
|
3·11·199 Posts |
Quote:
I am procrastinating the buy a new more powerful gpu, do you have any plans to optimize gpuowl for large numbers ? |
|
|
![]() |
| Thread Tools | |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Vega 20 announced with 7.64 TFlops of FP64 | M344587487 | GPU Computing | 4 | 2018-11-08 16:56 |
| GTX 1180 Mars Volta consumer card specs leaked | tServo | GPU Computing | 20 | 2018-06-24 08:04 |
| RX Vega performance | xx005fs | GPU Computing | 5 | 2018-01-17 00:22 |
| Radeon Pro Duo | 0PolarBearsHere | GPU Computing | 0 | 2016-03-15 01:32 |
| AMD Radeon R9 295X2 | firejuggler | GPU Computing | 33 | 2014-09-03 21:42 |