Loading paper
The Role of Masking for Efficient Supervised Knowledge Distillation of Vision Transformers | Tomesphere