Loading paper
Aggregating Global Features into Local Vision Transformer | Tomesphere