Loading paper
Global-Local Similarity for Efficient Fine-Grained Image Recognition with Vision Transformers | Tomesphere