Loading paper
Geometric Limits of Knowledge Distillation: A Minimum-Width Theorem via Superposition Theory | Tomesphere