Loading paper
Win-Win: Training High-Resolution Vision Transformers from Two Windows | Tomesphere