Instruct-ReID++: Towards Universal Purpose Instruction-Guided Person   Re-identification

Weizhen He; Yiheng Deng; Yunfeng Yan; Feng Zhu; Yizhou; Wang; Lei Bai; Qingsong Xie; Donglian Qi; Wanli Ouyang and; Shixiang Tang

arXiv:2405.17790·cs.CV·April 30, 2025

Instruct-ReID++: Towards Universal Purpose Instruction-Guided Person Re-identification

Weizhen He, Yiheng Deng, Yunfeng Yan, Feng Zhu, Yizhou, Wang, Lei Bai, Qingsong Xie, Donglian Qi, Wanli Ouyang and, Shixiang Tang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a universal instruction-guided person re-identification framework, proposing a new task, benchmark, and models that unify and improve upon existing ReID methods across diverse scenarios.

Contribution

It proposes the first general instruct-ReID task, a large-scale OmniReID++ benchmark, and novel models IRM and IRM++ with adaptive triplet loss and memory bank-assisted learning.

Findings

01

IRM and IRM++ outperform existing methods on 10 test sets.

02

The models achieve state-of-the-art performance in diverse ReID scenarios.

03

The benchmark facilitates comprehensive evaluation of instruction-guided ReID methods.

Abstract

Human intelligence can retrieve any person according to both visual and language descriptions. However, the current computer vision community studies specific person re-identification (ReID) tasks in different scenarios separately, which limits the applications in the real world. This paper strives to resolve this problem by proposing a novel instruct-ReID task that requires the model to retrieve images according to the given image or language instructions. Instruct-ReID is the first exploration of a general ReID setting, where existing 6 ReID tasks can be viewed as special cases by assigning different instructions. To facilitate research in this new instruct-ReID task, we propose a large-scale OmniReID++ benchmark equipped with diverse data and comprehensive evaluation methods e.g., task specific and task-free evaluation settings. In the task-specific evaluation setting, gallery sets…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hwz-zju/instruct-reid
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Gait Recognition and Analysis · Face recognition and analysis

MethodsTriplet Loss