How to Write a Fast Matrix Multiplication from Scratch with Tensor Cores (2024)

(alexarmbr.github.io)

105 points | by skidrow 12 hours ago

10 comments