Inflected Forms Are Redundant in Question Generation Models

Xingwu Sun; Hongyin Tang; chengzhong Xu

arXiv:2301.00397·cs.CL·January 3, 2023

Inflected Forms Are Redundant in Question Generation Models

Xingwu Sun, Hongyin Tang, chengzhong Xu

PDF

Open Access

TL;DR

This paper improves question generation models by reducing inflected form redundancy through word root replacement and a combined generation-copy-transformation approach, leading to better performance and efficiency.

Contribution

It introduces a novel method to fuse word transformation with question generation, decreasing decoder complexity and noise, and enhancing model accuracy.

Findings

01

Significant improvements in BLEU, ROUGE-L, and METEOR scores.

02

Reduced time cost and noise in question generation.

03

Effective application to RNN-based and UniLM models.

Abstract

Neural models with an encoder-decoder framework provide a feasible solution to Question Generation (QG). However, after analyzing the model vocabulary we find that current models (both RNN-based and pre-training based) have more than 23\% inflected forms. As a result, the encoder will generate separate embeddings for the inflected forms, leading to a waste of training data and parameters. Even worse, in decoding these models are vulnerable to irrelevant noise and they suffer from high computational costs. In this paper, we propose an approach to enhance the performance of QG by fusing word transformation. Firstly, we identify the inflected forms of words from the input of encoder, and replace them with the root words, letting the encoder pay more attention to the repetitive root words. Secondly, we propose to adapt QG as a combination of the following actions in the encode-decoder…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques