MemOVCD: Training-Free Open-Vocabulary Change Detection via Cross-Temporal Memory Reasoning and Global-Local Adaptive Rectification

Zuzheng Kuang; Honghao Chang; Boqiang Liang; Haoqian Wang; Lijun He; Fan Li; Haixia Bi

arXiv:2604.26774·cs.CV·April 30, 2026

MemOVCD: Training-Free Open-Vocabulary Change Detection via Cross-Temporal Memory Reasoning and Global-Local Adaptive Rectification

Zuzheng Kuang, Honghao Chang, Boqiang Liang, Haoqian Wang, Lijun He, Fan Li, Haixia Bi

PDF

TL;DR

MemOVCD introduces a training-free framework for open-vocabulary change detection in remote sensing images, leveraging cross-temporal memory reasoning and adaptive rectification to improve semantic change identification.

Contribution

It reformulates change detection as a two-frame tracking problem and employs novel memory propagation and rectification strategies for enhanced performance.

Findings

01

Achieves favorable results on five benchmark datasets.

02

Effectively distinguishes genuine semantic changes from appearance discrepancies.

03

Demonstrates strong generalization in diverse open-vocabulary scenarios.

Abstract

Open-vocabulary change detection aims to identify semantic changes in bi-temporal remote sensing images without predefined categories. Recent methods combine foundation models such as SAM, DINO and CLIP, but typically process each timestamp independently or interact only at the final comparison stage. Such paradigms suffer from insufficient temporal coupling during semantic reasoning, which limits their ability to distinguish genuine semantic changes from non-semantic appearance discrepancies. In addition, patch-dominant inference on high-resolution images often weakens global semantic continuity and produces fragmented change regions. To address these issues, we propose MemOVCD, a training-free open-vocabulary change detection framework based on cross-temporal memory reasoning and global-local adaptive rectification. Specifically, we reformulate bi-temporal change detection as a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.