Loading paper
Looking Locally: Object-Centric Vision Transformers as Foundation Models for Efficient Segmentation | Tomesphere