CCExpert: Advancing MLLM Capability in Remote Sensing Change Captioning with Difference-Aware Integration and a Foundational Dataset
Zhiming Wang, Mingze Wang, Sheng Xu, Yanjing Li, Baochang Zhang

TL;DR
This paper introduces CCExpert, a novel model for remote sensing change captioning that leverages difference-aware integration and a large, diverse dataset to significantly improve performance over previous methods.
Contribution
The paper presents a new difference-aware integration module, a large dataset CC-Foundation, and a three-stage training process for enhanced remote sensing change captioning.
Findings
Achieved $S^*_m=81.80$ on LEVIR-CC benchmark, surpassing previous methods.
Constructed a dataset with 200,000 image pairs and 1.2 million captions.
Demonstrated the effectiveness of difference-aware integration in improving captioning accuracy.
Abstract
Remote Sensing Image Change Captioning (RSICC) aims to generate natural language descriptions of surface changes between multi-temporal remote sensing images, detailing the categories, locations, and dynamics of changed objects (e.g., additions or disappearances). Many current methods attempt to leverage the long-sequence understanding and reasoning capabilities of multimodal large language models (MLLMs) for this task. However, without comprehensive data support, these approaches often alter the essential feature transmission pathways of MLLMs, disrupting the intrinsic knowledge within the models and limiting their potential in RSICC. In this paper, we propose a novel model, CCExpert, based on a new, advanced multimodal large model framework. Firstly, we design a difference-aware integration module to capture multi-scale differences between bi-temporal images and incorporate them into…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeographic Information Systems Studies
