TL;DR
This paper introduces a novel approach combining modality-adaptive mixup and invariant decomposition to improve RGB-infrared person re-identification by learning modality-invariant features and reducing modality discrepancy.
Contribution
It proposes a new MID framework that uses reinforcement learning for adaptive image mixing and convolution decomposition for invariant feature learning, advancing cross-modality re-identification.
Findings
Achieves superior performance on benchmark datasets.
Effectively reduces modality discrepancy.
Outperforms state-of-the-art methods.
Abstract
RGB-infrared person re-identification is an emerging cross-modality re-identification task, which is very challenging due to significant modality discrepancy between RGB and infrared images. In this work, we propose a novel modality-adaptive mixup and invariant decomposition (MID) approach for RGB-infrared person re-identification towards learning modality-invariant and discriminative representations. MID designs a modality-adaptive mixup scheme to generate suitable mixed modality images between RGB and infrared images for mitigating the inherent modality discrepancy at the pixel-level. It formulates modality mixup procedure as Markov decision process, where an actor-critic agent learns dynamical and local linear interpolation policy between different regions of cross-modality images under a deep reinforcement learning framework. Such policy guarantees modality-invariance in a more…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
MethodsMixup · Convolution
