Loading paper
DearKD: Data-Efficient Early Knowledge Distillation for Vision Transformers | Tomesphere