MDReID: Modality-Decoupled Learning for Any-to-Any Multi-Modal Object Re-Identification

Yingying Feng; Jie Li; Jie Hu; Yukang Zhang; Lei Tan; Jiayi Ji

arXiv:2510.23301·cs.CV·January 14, 2026

MDReID: Modality-Decoupled Learning for Any-to-Any Multi-Modal Object Re-Identification

Yingying Feng, Jie Li, Jie Hu, Yukang Zhang, Lei Tan, Jiayi Ji

PDF

TL;DR

MDReID introduces a novel framework for multi-modal object re-identification that effectively handles both modality-matched and mismatched scenarios by decomposing features into shared and specific components and employing tailored metric learning.

Contribution

The paper proposes MDReID, a new approach that decomposes modality features and uses specialized metric learning to improve robustness in multi-modal ReID tasks.

Findings

01

Achieves significant mAP improvements on three benchmarks.

02

Effectively handles modality-mismatched scenarios.

03

Outperforms existing methods in multi-modal ReID.

Abstract

Real-world object re-identification (ReID) systems often face modality inconsistencies, where query and gallery images come from different sensors (e.g., RGB, NIR, TIR). However, most existing methods assume modality-matched conditions, which limits their robustness and scalability in practical applications. To address this challenge, we propose MDReID, a flexible any-to-any image-level ReID framework designed to operate under both modality-matched and modality-mismatched scenarios. MDReID builds on the insight that modality information can be decomposed into two components: modality-shared features that are predictable and transferable, and modality-specific features that capture unique, modality-dependent characteristics. To effectively leverage this, MDReID introduces two key components: the Modality Decoupling Learning (MDL) and Modality-aware Metric Learning (MML). Specifically,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.