A Deep Reinforcement Learning Approach for Interactive Search with Sentence-level Feedback
Jianghong Zhou, Joyce C. Ho, Chen Lin, Eugene Agichtein

TL;DR
This paper introduces DQrank, a deep Q-learning method that leverages sentence-level user feedback in interactive search, significantly improving ranking performance by better capturing user intent and engagement.
Contribution
The work presents a novel DQrank model that incorporates sentence-level feedback using BERT-based ranking and exploration mechanisms, advancing beyond item-level feedback approaches.
Findings
DQrank outperforms previous RL methods by at least 12% in search accuracy.
The model effectively captures long-term user engagement from sentence feedback.
Ablation studies confirm the importance of each component in DQrank.
Abstract
Interactive search can provide a better experience by incorporating interaction feedback from the users. This can significantly improve search accuracy as it helps avoid irrelevant information and captures the users' search intents. Existing state-of-the-art (SOTA) systems use reinforcement learning (RL) models to incorporate the interactions but focus on item-level feedback, ignoring the fine-grained information found in sentence-level feedback. Yet such feedback requires extensive RL action space exploration and large amounts of annotated data. This work addresses these challenges by proposing a new deep Q-learning (DQ) approach, DQrank. DQrank adapts BERT-based models, the SOTA in natural language processing, to select crucial sentences based on users' engagement and rank the items to obtain more satisfactory responses. We also propose two mechanisms to better explore optimal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Multimodal Machine Learning Applications · Mobile Crowdsensing and Crowdsourcing
MethodsExperience Replay · Q-Learning · Focus
