ChangeMinds: Multi-task Framework for Detecting and Describing Changes   in Remote Sensing

Yuduo Wang; Weikang Yu; Michael Kopp; Pedram Ghamisi

arXiv:2410.10047·cs.CV·October 16, 2024·3 cites

ChangeMinds: Multi-task Framework for Detecting and Describing Changes in Remote Sensing

Yuduo Wang, Weikang Yu, Michael Kopp, Pedram Ghamisi

PDF

Open Access 1 Repo

TL;DR

ChangeMinds is a unified multi-task framework that simultaneously performs change detection and change captioning in remote sensing images, leveraging a novel ChangeLSTM module and cross-attention to improve efficiency and accuracy.

Contribution

The paper introduces ChangeMinds, the first end-to-end multi-task model for concurrent change detection and captioning, with a new ChangeLSTM module and cross-attention mechanism.

Findings

01

Outperforms existing methods on LEVIR-MCI and other benchmarks.

02

Effectively captures complex spatiotemporal dynamics for both tasks.

03

Enhances multi-task learning efficiency and accuracy.

Abstract

Recent advancements in Remote Sensing (RS) for Change Detection (CD) and Change Captioning (CC) have seen substantial success by adopting deep learning techniques. Despite these advances, existing methods often handle CD and CC tasks independently, leading to inefficiencies from the absence of synergistic processing. In this paper, we present ChangeMinds, a novel unified multi-task framework that concurrently optimizes CD and CC processes within a single, end-to-end model. We propose the change-aware long short-term memory module (ChangeLSTM) to effectively capture complex spatiotemporal dynamics from extracted bi-temporal deep features, enabling the generation of universal change-aware representations that effectively serve both CC and CD tasks. Furthermore, we introduce a multi-task predictor with a cross-attention mechanism that enhances the interaction between image and text…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

y-d-wang/changeminds
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGeographic Information Systems Studies