Loading paper
Recovering single precision accuracy from Tensor Cores while surpassing the FP32 theoretical peak performance | Tomesphere