Generative Question Refinement with Deep Reinforcement Learning in Retrieval-based QA System
Ye Liu, Chenwei Zhang, Xiaohui Yan, Yi Chang, Philip S. Yu

TL;DR
This paper introduces QREFINE, a deep reinforcement learning-based model that refines ill-formed questions into well-formed ones, improving retrieval accuracy in QA systems by enhancing question quality and understanding.
Contribution
The paper presents a novel unified model combining character and contextual embeddings with deep reinforcement learning for question refinement in retrieval-based QA systems.
Findings
Refined questions are more readable and less error-prone.
Refinement significantly improves answer retrieval accuracy.
QREFINE outperforms baseline models in quality and retrieval performance.
Abstract
In real-world question-answering (QA) systems, ill-formed questions, such as wrong words, ill word order, and noisy expressions, are common and may prevent the QA systems from understanding and answering them accurately. In order to eliminate the effect of ill-formed questions, we approach the question refinement task and propose a unified model, QREFINE, to refine the ill-formed questions to well-formed question. The basic idea is to learn a Seq2Seq model to generate a new question from the original one. To improve the quality and retrieval performance of the generated questions, we make two major improvements: 1) To better encode the semantics of ill-formed questions, we enrich the representation of questions with character embedding and the recent proposed contextual word embedding such as BERT, besides the traditional context-free word embeddings; 2) To make it capable to generate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsLinear Layer · Sigmoid Activation · Tanh Activation · Weight Decay · Residual Connection · Adam · Layer Normalization · Softmax · Attention Is All You Need · Dropout
