Learning to Expand: Reinforced Pseudo-relevance Feedback Selection for Information-seeking Conversations
Haojie Pan, Cen Chen, Chengyu Wang, Minghui Qiu, Liu Yang, Feng Ji,, Jun Huang

TL;DR
This paper introduces a reinforcement learning approach for selecting pseudo-relevance feedback terms to expand responses in information-seeking conversations, significantly improving response ranking accuracy in both benchmarks and real-world e-commerce applications.
Contribution
It proposes an end-to-end reinforced learning method for PRF term selection that does not require manual annotations, enhancing response expansion and ranking in dialogue systems.
Findings
Outperforms existing PRF selection methods on standard benchmarks
Achieves the best results across various evaluation metrics
Significantly improves online response ranking in e-commerce deployment
Abstract
Information-seeking conversation systems are increasingly popular in real-world applications, especially for e-commerce companies. To retrieve appropriate responses for users, it is necessary to compute the matching degrees between candidate responses and users' queries with historical dialogue utterances. As the contexts are usually much longer than responses, it is thus necessary to expand the responses (usually short) with richer information. Recent studies on pseudo-relevance feedback (PRF) have demonstrated its effectiveness in query expansion for search engines, hence we consider expanding response using PRF information. However, existing PRF approaches are either based on heuristic rules or require heavy manual labeling, which are not suitable for solving our task. To alleviate this problem, we treat the PRF selection for response expansion as a learning task and propose a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Information Retrieval and Search Behavior · Text and Document Classification Technologies
MethodsLinear Layer · Linear Warmup With Linear Decay · Residual Connection · Layer Normalization · Softmax · Adam · Weight Decay · Attention Is All You Need · Dropout · WordPiece
