A Geometric Analysis of Sign-Magnitude Asymmetry in a ReLU + RMSNorm Block under Ternary Quantization
Lei Dong

TL;DR
This paper provides a geometric explanation for sign-magnitude asymmetry in ReLU + RMSNorm models under ternary quantization, revealing how sign flips affect output energy and model sensitivity.
Contribution
It introduces a sign-magnitude decomposition analysis explaining the observed asymmetry and quantization error effects in pre-norm Transformer models.
Findings
Sign-flips produce 2.75 times more transverse energy than sign-preserving perturbations as flip rate approaches zero.
ReLU approximately preserves ternary quantization error, making it transparent to sign-magnitude perturbations.
Experimental results on TinyLlama-1.1B confirm theoretical predictions about sign sensitivity and outlier effects.
Abstract
Pre-norm Transformers with RMSNorm tolerate ternary {-1,0,+1} weight quantization with surprisingly small loss (Ma et al., 2024). We give a geometric explanation via sign-magnitude decomposition of weight perturbations. In a two-layer ReLU + RMSNorm model with i.i.d. Gaussian weights, sign-flips produce times more transverse output energy than sign-preserving magnitude perturbations of equal Frobenius norm, as the flip rate (Theorem 3). The mechanism: ReLU creates a hidden-space directional asymmetry between the two perturbation types, which RMSNorm's transverse-projection Fr\'echet derivative selectively exposes. Sign-quantization error is itself a sign-preserving perturbation with angular alignment (Theorem 4); its post-ReLU radial fraction () matches the pre-ReLU value within , so ReLU is approximately…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
