Loading paper
Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation | Tomesphere