Progressive Feature Mining and External Knowledge-Assisted Text-Pedestrian Image Retrieval
Huafeng Li, Shedan Yang, Yafei Zhang, Dapeng Tao, Zhengtao Yu

TL;DR
This paper introduces a novel method for text-pedestrian image retrieval that employs progressive feature mining and external knowledge to enhance discriminative features and reduce modality and textual diversity issues, achieving superior results.
Contribution
The paper proposes a progressive feature mining approach combined with external knowledge-assisted feature purification to improve cross-modal pedestrian retrieval performance.
Findings
Outperforms existing methods on three challenging datasets.
Surpasses large-scale model-based methods on large-scale datasets.
Effectively reduces modality discrepancy and textual diversity impacts.
Abstract
Text-Pedestrian Image Retrieval aims to use the text describing pedestrian appearance to retrieve the corresponding pedestrian image. This task involves not only modality discrepancy, but also the challenge of the textual diversity of pedestrians with the same identity. At present, although existing research progress has been made in text-pedestrian image retrieval, these methods do not comprehensively consider the above-mentioned problems. Considering these, this paper proposes a progressive feature mining and external knowledge-assisted feature purification method. Specifically, we use a progressive mining mode to enable the model to mine discriminative features from neglected information, thereby avoiding the loss of discriminative information and improving the expression ability of features. In addition, to further reduce the negative impact of modal discrepancy and text diversity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Image Retrieval and Classification Techniques
