Inflected Forms Are Redundant in Question Generation Models
Xingwu Sun, Hongyin Tang, chengzhong Xu

TL;DR
This paper improves question generation models by reducing inflected form redundancy through word root replacement and a combined generation-copy-transformation approach, leading to better performance and efficiency.
Contribution
It introduces a novel method to fuse word transformation with question generation, decreasing decoder complexity and noise, and enhancing model accuracy.
Findings
Significant improvements in BLEU, ROUGE-L, and METEOR scores.
Reduced time cost and noise in question generation.
Effective application to RNN-based and UniLM models.
Abstract
Neural models with an encoder-decoder framework provide a feasible solution to Question Generation (QG). However, after analyzing the model vocabulary we find that current models (both RNN-based and pre-training based) have more than 23\% inflected forms. As a result, the encoder will generate separate embeddings for the inflected forms, leading to a waste of training data and parameters. Even worse, in decoding these models are vulnerable to irrelevant noise and they suffer from high computational costs. In this paper, we propose an approach to enhance the performance of QG by fusing word transformation. Firstly, we identify the inflected forms of words from the input of encoder, and replace them with the root words, letting the encoder pay more attention to the repetitive root words. Secondly, we propose to adapt QG as a combination of the following actions in the encode-decoder…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
