Loading paper
Hardware-Oriented Approximations of Softmax and RMSNorm for Efficient Transformer Inference | Tomesphere