BIT: Matching-based Bi-directional Interaction Transformation Network for Visible-Infrared Person Re-Identification

Haoxuan Xu; Guanglin Niu

arXiv:2603.14243·cs.CV·March 17, 2026

BIT: Matching-based Bi-directional Interaction Transformation Network for Visible-Infrared Person Re-Identification

Haoxuan Xu, Guanglin Niu

PDF

Open Access

TL;DR

This paper introduces BIT, a novel matching-based bi-directional interaction network for visible-infrared person re-identification, effectively modeling pairwise modality interactions to improve retrieval accuracy under challenging conditions.

Contribution

The paper proposes the first pairwise matching-driven interaction approach for VI-ReID, utilizing an encoder-decoder architecture for explicit cross-modality feature interaction.

Findings

01

Achieves state-of-the-art performance on multiple VI-ReID benchmarks.

02

Effectively models complex modality correlations under distribution shifts.

03

Demonstrates robustness with fewer infrared samples.

Abstract

Visible-Infrared Person Re-Identification (VI-ReID) is a challenging retrieval task due to the substantial modality gap between visible and infrared images. While existing methods attempt to bridge this gap by learning modality-invariant features within a shared embedding space, they often overlook the complex and implicit correlations between modalities. This limitation becomes more severe under distribution shifts, where infrared samples are often far fewer than visible ones. To address these challenges, we propose a novel network termed Bi-directional Interaction Transformation (BIT). Instead of relying on rigid feature alignment, BIT adopts a matching-based strategy that explicitly models the interaction between visible and infrared image pairs. Specifically, BIT employs an encoder-decoder architecture where the encoder extracts preliminary feature representations, and the decoder…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Advanced Neural Network Applications · Face recognition and analysis