Loading paper
QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs | Tomesphere