View Single Post
Old 2021-07-30, 21:56   #43
frmky
 
frmky's Avatar
 
Jul 2003
So Cal

2·7·157 Posts
Default

Quote:
Originally Posted by frmky View Post
nVidia Tesla V100 with now old CUDA code that only supports 64-bit vectors
1h 26m
53 minutes after many code changes.
After reimplementing the CUDA SpMV with CUB, the Tesla V100 now takes 36 minutes.
frmky is offline   Reply With Quote