Grammatical Error Correction as GAN-like Sequence Labeling

Kevin Parnow; Zuchao Li; and Hai Zhao

arXiv:2105.14209·cs.CL·June 1, 2021·1 cites

Grammatical Error Correction as GAN-like Sequence Labeling

Kevin Parnow, Zuchao Li, and Hai Zhao

PDF

Open Access

TL;DR

This paper introduces a GAN-inspired sequence labeling model for grammatical error correction that improves training by better mimicking real error distributions, leading to state-of-the-art results.

Contribution

It proposes a novel GAN-like framework with a discriminator and generator for GEC, addressing training-inference mismatch and enhancing correction accuracy.

Findings

01

Achieves state-of-the-art performance on multiple benchmarks.

02

Sampling from real error distributions improves correction quality.

03

The model effectively reduces training-inference mismatch.

Abstract

In Grammatical Error Correction (GEC), sequence labeling models enjoy fast inference compared to sequence-to-sequence models; however, inference in sequence labeling GEC models is an iterative process, as sentences are passed to the model for multiple rounds of correction, which exposes the model to sentences with progressively fewer errors at each round. Traditional GEC models learn from sentences with fixed error rates. Coupling this with the iterative correction process causes a mismatch between training and inference that affects final performance. In order to address this mismatch, we propose a GAN-like sequence labeling model, which consists of a grammatical error detector as a discriminator and a grammatical error labeler with Gumbel-Softmax sampling as a generator. By sampling from real error distributions, our errors are more genuine compared to traditional synthesized GEC…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification