Learning to Expand: Reinforced Pseudo-relevance Feedback Selection for   Information-seeking Conversations

Haojie Pan; Cen Chen; Chengyu Wang; Minghui Qiu; Liu Yang; Feng Ji,; Jun Huang

arXiv:2011.12771·cs.CL·November 3, 2022

Learning to Expand: Reinforced Pseudo-relevance Feedback Selection for Information-seeking Conversations

Haojie Pan, Cen Chen, Chengyu Wang, Minghui Qiu, Liu Yang, Feng Ji,, Jun Huang

PDF

Open Access

TL;DR

This paper introduces a reinforcement learning approach for selecting pseudo-relevance feedback terms to expand responses in information-seeking conversations, significantly improving response ranking accuracy in both benchmarks and real-world e-commerce applications.

Contribution

It proposes an end-to-end reinforced learning method for PRF term selection that does not require manual annotations, enhancing response expansion and ranking in dialogue systems.

Findings

01

Outperforms existing PRF selection methods on standard benchmarks

02

Achieves the best results across various evaluation metrics

03

Significantly improves online response ranking in e-commerce deployment

Abstract

Information-seeking conversation systems are increasingly popular in real-world applications, especially for e-commerce companies. To retrieve appropriate responses for users, it is necessary to compute the matching degrees between candidate responses and users' queries with historical dialogue utterances. As the contexts are usually much longer than responses, it is thus necessary to expand the responses (usually short) with richer information. Recent studies on pseudo-relevance feedback (PRF) have demonstrated its effectiveness in query expansion for search engines, hence we consider expanding response using PRF information. However, existing PRF approaches are either based on heuristic rules or require heavy manual labeling, which are not suitable for solving our task. To alleviate this problem, we treat the PRF selection for response expansion as a learning task and propose a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Information Retrieval and Search Behavior · Text and Document Classification Technologies

MethodsLinear Layer · Linear Warmup With Linear Decay · Residual Connection · Layer Normalization · Softmax · Adam · Weight Decay · Attention Is All You Need · Dropout · WordPiece