Continual Panoptic Perception: Towards Multi-modal Incremental   Interpretation of Remote Sensing Images

Bo Yuan; Danpei Zhao; Zhuoran Liu; Wentao Li; Tian Li

arXiv:2407.14242·cs.CV·November 22, 2024

Continual Panoptic Perception: Towards Multi-modal Incremental Interpretation of Remote Sensing Images

Bo Yuan, Danpei Zhao, Zhuoran Liu, Wentao Li, Tian Li

PDF

1 Repo

TL;DR

This paper introduces a continual learning model for remote sensing images that integrates multi-task perception, cross-modal feature extraction, and knowledge distillation to improve interpretation accuracy and reduce forgetting.

Contribution

It proposes a unified multi-modal continual learning framework with a collaborative encoder and task-interactive knowledge distillation for remote sensing.

Findings

01

Over 13% improvement in panoptic quality with joint optimization

02

Effective mitigation of catastrophic forgetting in multi-task remote sensing

03

Validated on a fine-grained panoptic perception dataset

Abstract

Continual learning (CL) breaks off the one-way training manner and enables a model to adapt to new data, semantics and tasks continuously. However, current CL methods mainly focus on single tasks. Besides, CL models are plagued by catastrophic forgetting and semantic drift since the lack of old data, which often occurs in remote-sensing interpretation due to the intricate fine-grained semantics. In this paper, we propose Continual Panoptic Perception (CPP), a unified continual learning model that leverages multi-task joint learning covering pixel-level classification, instance-level segmentation and image-level perception for universal interpretation in remote sensing images. Concretely, we propose a collaborative cross-modal encoder (CCE) to extract the input image features, which supports pixel classification and caption generation synchronously. To inherit the knowledge from the old…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

YBIO/CPP
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsFocus · Knowledge Distillation