Beat: Bi-directional One-to-Many Embedding Alignment for Text-based Person Retrieval
Yiwei Ma, Xiaoshuai Sun, Jiayi Ji, Guannan Jiang, Weilin Zhuang,, Rongrong Ji

TL;DR
The paper introduces Beat, a bi-directional one-to-many embedding alignment method that improves text-based person retrieval by effectively handling modal differences and one-to-many relationships, achieving state-of-the-art results.
Contribution
It proposes a novel bi-directional one-to-many embedding paradigm and a corresponding model that enhances alignment between text and images without additional trainable parameters.
Findings
Achieves state-of-the-art on CUHK-PEDES, ICFG-PEDES, and RSTPReID datasets.
Demonstrates effectiveness on additional datasets like MS-COCO, CUB, and Flowers.
Addresses modal differences and one-to-many relationships in TPR.
Abstract
Text-based person retrieval (TPR) is a challenging task that involves retrieving a specific individual based on a textual description. Despite considerable efforts to bridge the gap between vision and language, the significant differences between these modalities continue to pose a challenge. Previous methods have attempted to align text and image samples in a modal-shared space, but they face uncertainties in optimization directions due to the movable features of both modalities and the failure to account for one-to-many relationships of image-text pairs in TPR datasets. To address this issue, we propose an effective bi-directional one-to-many embedding paradigm that offers a clear optimization direction for each sample, thus mitigating the optimization problem. Additionally, this embedding scheme generates multiple features for each sample without introducing trainable parameters,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Video Surveillance and Tracking Methods · Data-Driven Disease Surveillance
MethodsALIGN
