Modality-Adaptive Mixup and Invariant Decomposition for RGB-Infrared   Person Re-Identification

Zhipeng Huang; Jiawei Liu; Liang Li; Kecheng Zheng; Zheng-Jun Zha

arXiv:2203.01735·cs.CV·March 17, 2022

Modality-Adaptive Mixup and Invariant Decomposition for RGB-Infrared Person Re-Identification

Zhipeng Huang, Jiawei Liu, Liang Li, Kecheng Zheng, Zheng-Jun Zha

PDF

1 Video

TL;DR

This paper introduces a novel approach combining modality-adaptive mixup and invariant decomposition to improve RGB-infrared person re-identification by learning modality-invariant features and reducing modality discrepancy.

Contribution

It proposes a new MID framework that uses reinforcement learning for adaptive image mixing and convolution decomposition for invariant feature learning, advancing cross-modality re-identification.

Findings

01

Achieves superior performance on benchmark datasets.

02

Effectively reduces modality discrepancy.

03

Outperforms state-of-the-art methods.

Abstract

RGB-infrared person re-identification is an emerging cross-modality re-identification task, which is very challenging due to significant modality discrepancy between RGB and infrared images. In this work, we propose a novel modality-adaptive mixup and invariant decomposition (MID) approach for RGB-infrared person re-identification towards learning modality-invariant and discriminative representations. MID designs a modality-adaptive mixup scheme to generate suitable mixed modality images between RGB and infrared images for mitigating the inherent modality discrepancy at the pixel-level. It formulates modality mixup procedure as Markov decision process, where an actor-critic agent learns dynamical and local linear interpolation policy between different regions of cross-modality images under a deep reinforcement learning framework. Such policy guarantees modality-invariance in a more…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Modality-Adaptive Mixup and Invariant Decomposition for RGB-Infrared Person Re-Identification· underline

Taxonomy

MethodsMixup · Convolution