Loading paper
ViT-P: Rethinking Data-efficient Vision Transformers from Locality | Tomesphere