CKDA: Cross-modality Knowledge Disentanglement and Alignment for Visible-Infrared Lifelong Person Re-identification
Zhenyu Cui, Jiahuan Zhou, Yuxin Peng

TL;DR
This paper introduces CKDA, a novel method for lifelong person re-identification across visible and infrared modalities, which disentangles and aligns cross-modal knowledge to improve continual learning performance.
Contribution
It proposes a cross-modality knowledge disentanglement and alignment framework with dedicated modules to separate and align modality-specific and common knowledge, reducing interference and forgetting.
Findings
Outperforms state-of-the-art methods on four benchmark datasets.
Effectively disentangles modality-specific and common knowledge.
Reduces catastrophic forgetting in lifelong cross-modal person Re-ID.
Abstract
Lifelong person Re-IDentification (LReID) aims to match the same person employing continuously collected individual data from different scenarios. To achieve continuous all-day person matching across day and night, Visible-Infrared Lifelong person Re-IDentification (VI-LReID) focuses on sequential training on data from visible and infrared modalities and pursues average performance over all data. To this end, existing methods typically exploit cross-modal knowledge distillation to alleviate the catastrophic forgetting of old knowledge. However, these methods ignore the mutual interference of modality-specific knowledge acquisition and modality-common knowledge anti-forgetting, where conflicting knowledge leads to collaborative forgetting. To address the above problems, this paper proposes a Cross-modality Knowledge Disentanglement and Alignment method, called CKDA, which explicitly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Gait Recognition and Analysis · Human Pose and Action Recognition
