Instruct-ReID++: Towards Universal Purpose Instruction-Guided Person Re-identification
Weizhen He, Yiheng Deng, Yunfeng Yan, Feng Zhu, Yizhou, Wang, Lei Bai, Qingsong Xie, Donglian Qi, Wanli Ouyang and, Shixiang Tang

TL;DR
This paper introduces a universal instruction-guided person re-identification framework, proposing a new task, benchmark, and models that unify and improve upon existing ReID methods across diverse scenarios.
Contribution
It proposes the first general instruct-ReID task, a large-scale OmniReID++ benchmark, and novel models IRM and IRM++ with adaptive triplet loss and memory bank-assisted learning.
Findings
IRM and IRM++ outperform existing methods on 10 test sets.
The models achieve state-of-the-art performance in diverse ReID scenarios.
The benchmark facilitates comprehensive evaluation of instruction-guided ReID methods.
Abstract
Human intelligence can retrieve any person according to both visual and language descriptions. However, the current computer vision community studies specific person re-identification (ReID) tasks in different scenarios separately, which limits the applications in the real world. This paper strives to resolve this problem by proposing a novel instruct-ReID task that requires the model to retrieve images according to the given image or language instructions. Instruct-ReID is the first exploration of a general ReID setting, where existing 6 ReID tasks can be viewed as special cases by assigning different instructions. To facilitate research in this new instruct-ReID task, we propose a large-scale OmniReID++ benchmark equipped with diverse data and comprehensive evaluation methods e.g., task specific and task-free evaluation settings. In the task-specific evaluation setting, gallery sets…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Gait Recognition and Analysis · Face recognition and analysis
MethodsTriplet Loss
