A Deep Reinforcement Learning Approach for Interactive Search with   Sentence-level Feedback

Jianghong Zhou; Joyce C. Ho; Chen Lin; Eugene Agichtein

arXiv:2310.03043·cs.LG·October 6, 2023

A Deep Reinforcement Learning Approach for Interactive Search with Sentence-level Feedback

Jianghong Zhou, Joyce C. Ho, Chen Lin, Eugene Agichtein

PDF

Open Access

TL;DR

This paper introduces DQrank, a deep Q-learning method that leverages sentence-level user feedback in interactive search, significantly improving ranking performance by better capturing user intent and engagement.

Contribution

The work presents a novel DQrank model that incorporates sentence-level feedback using BERT-based ranking and exploration mechanisms, advancing beyond item-level feedback approaches.

Findings

01

DQrank outperforms previous RL methods by at least 12% in search accuracy.

02

The model effectively captures long-term user engagement from sentence feedback.

03

Ablation studies confirm the importance of each component in DQrank.

Abstract

Interactive search can provide a better experience by incorporating interaction feedback from the users. This can significantly improve search accuracy as it helps avoid irrelevant information and captures the users' search intents. Existing state-of-the-art (SOTA) systems use reinforcement learning (RL) models to incorporate the interactions but focus on item-level feedback, ignoring the fine-grained information found in sentence-level feedback. Yet such feedback requires extensive RL action space exploration and large amounts of annotated data. This work addresses these challenges by proposing a new deep Q-learning (DQ) approach, DQrank. DQrank adapts BERT-based models, the SOTA in natural language processing, to select crucial sentences based on users' engagement and rank the items to obtain more satisfactory responses. We also propose two mechanisms to better explore optimal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Multimodal Machine Learning Applications · Mobile Crowdsensing and Crowdsourcing

MethodsExperience Replay · Q-Learning · Focus