Loading paper
ElastiFormer: Learned Redundancy Reduction in Transformer via Self-Distillation | Tomesphere