View Single Post
Old 2020-10-15, 16:26   #3
M344587487
 
M344587487's Avatar
 
"Composite as Heck"
Oct 2017

3×263 Posts
Default

Got FlopsCL working by lowering the THREADS_PER_BLOCK variable from the default of 1024 to 256. The average of some runs yields this:
Code:
[float  ]1410.088 GFlop/s
[float2 ]1416.99 GFlop/s
[float4 ]1424.106 GFlop/s
[float8 ]1427.09 GFlop/s
[float16]1427.668 GFlop/s
[double  ]89.354 GFlop/s
[double2 ]89.364 GFlop/s
[double4 ]89.39 GFlop/s
[double8 ]89.39 GFlop/s
[double16]89.39 GFlop/s
Does this look right? I think the paper GFlop/s figure should be 1433.6 GFlop/s of SP and 89.6 GFlop/s of DP based on this formula and a 1:16 ratio but could be wrong:
Code:
shader_units*speed*instructions_per_clock
= 448 * 1600 * 2
=1433.6
Assuming the above is right and FlopsCL is a pretty accurate reflection of GFlop/s, can someone with a Radeon VII run FlopsCL to determine once and for all what the DP ratio is for that card? There was some question as to if the ratio was 1:4 like they said it was or if it got the full 1:2 treatment, I don't think there was ever a definitive answer. FlopsCL is here if you want to try it: http://olab.is.s.u-tokyo.ac.jp/~kami.../projects.html
M344587487 is offline   Reply With Quote