Test-Time Optimization for Domain Adaptive Open Vocabulary Segmentation
Ulindu De Silva, Didula Samaraweera, Sasini Wanigathunga, Kavindu, Kariyawasam, Kanchana Ranasinghe, Muzammal Naseer, Ranga Rodrigo

TL;DR
Seg-TTO introduces a test-time optimization framework for open-vocabulary semantic segmentation that significantly improves performance on domain-specific datasets by aligning model parameters with input images using a novel self-supervised objective.
Contribution
It proposes a novel test-time optimization method, Seg-TTO, that enhances zero-shot open-vocabulary segmentation in specialized domains, outperforming existing approaches.
Findings
Up to 27% mIoU improvement on challenging datasets
Effective integration with state-of-the-art OVSS methods
Establishes new state-of-the-art performance in domain-specific OVSS tasks
Abstract
We present Seg-TTO, a novel framework for zero-shot, open-vocabulary semantic segmentation (OVSS), designed to excel in specialized domain tasks. While current open-vocabulary approaches show impressive performance on standard segmentation benchmarks under zero-shot settings, they fall short of supervised counterparts on highly domain-specific datasets. We focus on segmentation-specific test-time optimization to address this gap. Segmentation requires an understanding of multiple concepts within a single image while retaining the locality and spatial structure of representations. We propose a novel self-supervised objective adhering to these requirements and use it to align the model parameters with input images at test time. In the textual modality, we learn multiple embeddings for each category to capture diverse concepts within an image, while in the visual modality, we calculate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Educational Technology and Assessment
MethodsALIGN · Focus
