OmniOVCD: Streamlining Open-Vocabulary Change Detection with SAM 3

Xu Zhang; Danyang Li; Yingjie Xia; Xiaohang Dong; Hualong Yu; Jianye Wang; Qicheng Li

arXiv:2601.13895·cs.CV·April 27, 2026

OmniOVCD: Streamlining Open-Vocabulary Change Detection with SAM 3

Xu Zhang, Danyang Li, Yingjie Xia, Xiaohang Dong, Hualong Yu, Jianye Wang, Qicheng Li

PDF

1 Repo

TL;DR

OmniOVCD introduces a novel framework leveraging SAM 3's integrated segmentation and identification capabilities for open-vocabulary change detection, achieving state-of-the-art results without relying on multiple models.

Contribution

The paper proposes a standalone OVCD framework using SAM 3's decoupled output heads and a fusion strategy, improving accuracy and stability over existing methods.

Findings

01

Achieves state-of-the-art IoU scores on four benchmarks.

02

Effectively fuses semantic, instance, and presence outputs for accurate land-cover masks.

03

Maintains high category recognition accuracy and instance-level consistency.

Abstract

Change Detection (CD) is a fundamental task in remote sensing. It monitors the evolution of land cover over time. Based on this, Open-Vocabulary Change Detection (OVCD) introduces a new requirement. It aims to reduce the reliance on predefined categories. Existing training-free OVCD methods mostly use CLIP to identify categories. These methods also need extra models like DINO to extract features. However, combining different models often causes problems in matching features and makes the system unstable. Recently, the Segment Anything Model 3 (SAM 3) is introduced. It integrates segmentation and identification capabilities within one promptable model, which offers new possibilities for the OVCD task. In this paper, we propose OmniOVCD, a standalone framework designed for OVCD. By leveraging the decoupled output heads of SAM 3, we propose a Synergistic Fusion to Instance Decoupling…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Erxucomeon/OmniOVCD
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.