DAPL: Integration of Positive and Negative Descriptions in Text-Based Person Search
Yuchuan Deng, Zhanpeng Hu, Zijie Xin, Chuang Deng, Qijun Zhao

TL;DR
This paper introduces DAPL, a novel framework for text-based person search that integrates positive and negative descriptions, employing dual contrastive and matching techniques to improve accuracy and robustness.
Contribution
DAPL is the first to incorporate negative descriptions in TBPS using dual contrastive and token-wise similarity losses for better interpretative accuracy.
Findings
DAPL outperforms existing methods in TBPS accuracy.
The DTS loss improves fine-grained visual-text alignment.
Inclusion of negative descriptions reduces false positives.
Abstract
Text-based person search (TBPS) aims to retrieve specific images of individuals from large datasets using textual descriptions. Existing TBPS methods focus primarily on identifying explicit positive attributes, often neglecting the critical role of negative descriptions. This oversight can lead to false positives, where images that should be excluded based on negative descriptions are incorrectly included, due to partial alignment with the positive criteria. To address this limitation, we propose the Dual Attribute Prompt Learning (DAPL) framework, which incorporates both positive and negative descriptions to improve the interpretative accuracy of vision-language models in TBPS tasks. DAPL combines Dual Image-Attribute Contrastive (DIAC) learning with Sensitive Image-Attribute Matching (SIAM) learning to enhance the detection of previously unseen attributes. Furthermore, to achieve a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
