Thinking Before Matching: A Reinforcement Reasoning Paradigm Towards General Person Re-Identification

Quan Zhang; Jingze Wu; Jialong Wang; Xiaohua Xie; Jianhuang Lai; Hongbo Chen

arXiv:2604.19218·cs.CV·April 22, 2026

Thinking Before Matching: A Reinforcement Reasoning Paradigm Towards General Person Re-Identification

Quan Zhang, Jingze Wu, Jialong Wang, Xiaohua Xie, Jianhuang Lai, Hongbo Chen

PDF

TL;DR

This paper introduces ReID-R, a reasoning-driven paradigm for person re-identification that enhances identity understanding and generalization through a two-stage reinforcement reasoning approach, achieving competitive results with less data.

Contribution

ReID-R is a novel reasoning-based framework that explicitly incorporates chain-of-thought reasoning and reinforcement learning to improve person re-identification performance and interpretability.

Findings

01

ReID-R achieves competitive accuracy with only 14.3K data points.

02

The method outperforms some existing approaches on multiple benchmarks.

03

ReID-R provides high-quality interpretability of its reasoning process.

Abstract

Learning identity-discriminative representations with multi-scene generality has become a critical objective in person re-identification (ReID). However, mainstream perception-driven paradigms tend to identify fitting from massive annotated data rather than identity-causal cues understanding, which presents a fragile representation against multiple disruptions. In this work, ReID-R is proposed as a novel reasoning-driven paradigm that achieves explicit identity understanding and reasoning by incorporating chain-of-thought into the ReID pipeline. Specifically, ReID-R consists of a two-stage contribution: (i) Discriminative reasoning warm-up, where a model is trained in a CoT label-free manner to acquire identity-aware feature understanding; and (ii) Efficient reinforcement learning, which proposes a non-trivial sampling to construct scene-generalizable data. On this basis, ReID-R…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.