Harnessing Weak Pair Uncertainty for Text-based Person Search
Jintao Sun, Zhedong Zheng, Gangyi Ding

TL;DR
This paper proposes an uncertainty-aware approach for text-based person search that leverages weak positive pairs, improving retrieval accuracy by explicitly modeling pair uncertainty.
Contribution
It introduces a novel uncertainty estimation and regularization framework to better utilize weak positive pairs in text-based person search.
Findings
Achieves +3.06%, +3.55%, +6.94% mAP improvements on three datasets.
Explicitly models pair uncertainty to enhance retrieval performance.
Incorporates group-wise matching loss for better representation learning.
Abstract
In this paper, we study the text-based person search, which is to retrieve the person of interest via natural language description. Prevailing methods usually focus on the strict one-to-one correspondence pair matching between the visual and textual modality, such as contrastive learning. However, such a paradigm unintentionally disregards the weak positive image-text pairs, which are of the same person but the text descriptions are annotated from different views (cameras). To take full use of weak positives, we introduce an uncertainty-aware method to explicitly estimate image-text pair uncertainty, and incorporate the uncertainty into the optimization procedure in a smooth manner. Specifically, our method contains two modules: uncertainty estimation and uncertainty regularization. (1) Uncertainty estimation is to obtain the relative confidence on the given positive pairs; (2) Based on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
