Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query
Guanyu Cai, Jun Zhang, Xinyang Jiang, Yifei Gong, Lianghua He, Fufu, Yu, Pai Peng, Xiaowei Guo, Feiyue Huang, Xing Sun

TL;DR
This paper introduces a novel interactive retrieval framework that actively identifies missing discriminative details in incomplete image descriptions, improving text-based image retrieval accuracy with minimal user effort.
Contribution
It proposes an Ask-and-Confirm framework with a reinforcement-learning policy for active detail enrichment, and a weakly-supervised training strategy without human-annotated dialogs.
Findings
Significant improvement in retrieval performance over baseline methods.
Efficient active detail search with user confirmation reduces user effort.
Framework effective even without human-annotated dialog data.
Abstract
Text-based image retrieval has seen considerable progress in recent years. However, the performance of existing methods suffers in real life since the user is likely to provide an incomplete description of an image, which often leads to results filled with false positives that fit the incomplete description. In this work, we introduce the partial-query problem and extensively analyze its influence on text-based image retrieval. Previous interactive methods tackle the problem by passively receiving users' feedback to supplement the incomplete query iteratively, which is time-consuming and requires heavy user effort. Instead, we propose a novel retrieval framework that conducts the interactive process in an Ask-and-Confirm fashion, where AI actively searches for discriminative details missing in the current query, and users only need to confirm AI's proposal. Specifically, we propose an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques · Information Retrieval and Search Behavior
