Semi-supervised Text-based Person Search
Daming Gao, Yang Bai, Min Cao, Hao Dou, Mang Ye, Min Zhang

TL;DR
This paper introduces a semi-supervised approach for text-based person search that leverages image captioning to generate pseudo-texts for unannotated images and employs noise-robust training strategies to improve retrieval accuracy.
Contribution
It pioneers semi-supervised TBPS by combining pseudo-text generation with a noise-robust retrieval framework incorporating PC-Mask and NP-Train strategies.
Findings
Achieves promising performance on multiple TBPS benchmarks.
Effectively handles noisy pseudo-text data during training.
Demonstrates the viability of semi-supervised learning in TBPS.
Abstract
Text-based person search (TBPS) aims to retrieve images of a specific person from a large image gallery based on a natural language description. Existing methods rely on massive annotated image-text data to achieve satisfactory performance in fully-supervised learning. It poses a significant challenge in practice, as acquiring person images from surveillance videos is relatively easy, while obtaining annotated texts is challenging. The paper undertakes a pioneering initiative to explore TBPS under the semi-supervised setting, where only a limited number of person images are annotated with textual descriptions while the majority of images lack annotations. We present a two-stage basic solution based on generation-then-retrieval for semi-supervised TBPS. The generation stage enriches annotated data by applying an image captioning model to generate pseudo-texts for unannotated images.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Data-Driven Disease Surveillance · Social and Cultural Studies
