Loading paper
Outlier Suppression: Pushing the Limit of Low-bit Transformer Language Models | Tomesphere