Loading paper
Efficient Visual Transformer by Learnable Token Merging | Tomesphere