Semantic-Based Self-Critical Training For Question Generation
Lo\"ic, Kwate Dassi

TL;DR
This paper introduces a Transformer-based reinforcement learning approach for question generation that incorporates semantic rewards to improve the relevance and quality of generated questions, addressing issues of exposure bias and metric discordance.
Contribution
It proposes a novel semantic-based self-critical training method using a generator-evaluator architecture with semantic rewards, enhancing question generation quality.
Findings
Semantic-based reinforcement learning improves question relevance.
Combines n-gram and semantic evaluation metrics.
Human evaluation confirms semantic consistency.
Abstract
Question generation is a conditioned language generation task that consists in generating a context-aware question given a context and the targeted answer. Train language modelling with a mere likelihood maximization has been widely used while suffering from exposure bias and the discordance between the training and the test metrics. In the way of addressing this issue, The presented work portrays a fully Transformer-based reinforcement learning generator-evaluation architecture for neural question generation. To edge the flexibility of the generation, a semantic-based reward score was externally infused during the training to drive the training of the language model. The global architecture is laid out in a generator-evaluator fashion optimized directly to n-gram and semantic-based metrics. Evaluation metrics for language modelling only based on n-gram overlapping do not consider…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques
MethodsREINFORCE
