Do Grammatical Error Correction Models Realize Grammatical   Generalization?

Masato Mita; Hitomi Yanaka

arXiv:2106.03031·cs.CL·June 8, 2021·6 cites

Do Grammatical Error Correction Models Realize Grammatical Generalization?

Masato Mita, Hitomi Yanaka

PDF

Open Access

TL;DR

This paper investigates whether current GEC models can generalize grammatical rules to unseen errors, revealing that standard Transformer models struggle with such generalization even in simplified settings.

Contribution

The study introduces an analysis method using synthetic and real datasets with controlled vocabularies to evaluate grammatical generalization in GEC models.

Findings

01

Transformer-based GEC models fail to generalize grammatical knowledge

02

Models do not correct errors from limited training data

03

Current models lack necessary grammatical generalization ability

Abstract

There has been an increased interest in data generation approaches to grammatical error correction (GEC) using pseudo data. However, these approaches suffer from several issues that make them inconvenient for real-world deployment including a demand for large amounts of training data. On the other hand, some errors based on grammatical rules may not necessarily require a large amount of data if GEC models can realize grammatical generalization. This study explores to what extent GEC models generalize grammatical knowledge required for correcting errors. We introduce an analysis method using synthetic and real GEC datasets with controlled vocabularies to evaluate whether models can generalize to unseen errors. We found that a current standard Transformer-based GEC model fails to realize grammatical generalization even in simple settings with limited vocabulary and syntax, suggesting that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification