Exploring Question-Specific Rewards for Generating Deep Questions
Yuxi Xie, Liangming Pan, Dongzhe Wang, Min-Yen Kan, Yansong Feng

TL;DR
This paper proposes using reinforcement learning with question-specific rewards to improve the quality of generated questions, focusing on fluency, relevance, and answerability, and analyzes their impact through experiments.
Contribution
It introduces a reinforcement learning framework with tailored rewards for question generation, highlighting the importance of reward relevance to human judgment.
Findings
Relevance reward improves question quality according to human judgment.
Answerability reward can bias the model, reducing question quality.
Optimizing for relevant rewards enhances automatic evaluation metrics.
Abstract
Recent question generation (QG) approaches often utilize the sequence-to-sequence framework (Seq2Seq) to optimize the log-likelihood of ground-truth questions using teacher forcing. However, this training objective is inconsistent with actual question quality, which is often reflected by certain global properties such as whether the question can be answered by the document. As such, we directly optimize for QG-specific objectives via reinforcement learning to improve question quality. We design three different rewards that target to improve the fluency, relevance, and answerability of generated questions. We conduct both automatic and human evaluations in addition to a thorough analysis to explore the effect of each QG-specific reward. We find that optimizing question-specific rewards generally leads to better performance in automatic evaluation metrics. However, only the rewards that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Educational Technology and Assessment · Expert finding and Q&A systems
