Approaching Neural Grammatical Error Correction as a Low-Resource   Machine Translation Task

Marcin Junczys-Dowmunt; Roman Grundkiewicz; Shubha Guha; Kenneth; Heafield

arXiv:1804.05940·cs.CL·April 18, 2018

Approaching Neural Grammatical Error Correction as a Low-Resource Machine Translation Task

Marcin Junczys-Dowmunt, Roman Grundkiewicz, Shubha Guha, Kenneth, Heafield

PDF

1 Repo

TL;DR

This paper adapts low-resource neural machine translation techniques to improve neural grammatical error correction, achieving state-of-the-art results that surpass previous neural and non-neural systems on key benchmarks.

Contribution

It introduces a set of model-independent methods for neural GEC, inspired by low-resource MT, leading to significant performance improvements.

Findings

01

Over 10% improvement on CoNLL-2014 benchmark

02

5.9% improvement on JFLEG test set

03

Outperforms non-neural state-of-the-art systems

Abstract

Previously, neural methods in grammatical error correction (GEC) did not reach state-of-the-art results compared to phrase-based statistical machine translation (SMT) baselines. We demonstrate parallels between neural GEC and low-resource neural MT and successfully adapt several methods from low-resource MT to neural GEC. We further establish guidelines for trustable results in neural GEC and propose a set of model-independent methods for neural GEC that can be easily applied in most GEC settings. Proposed methods include adding source-side noise, domain-adaptation techniques, a GEC-specific training-objective, transfer learning with monolingual data, and ensembling of independently trained GEC models and language models. The combined effects of these methods result in better than state-of-the-art neural GEC models that outperform previously best neural GEC systems by more than 10%…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

grammatical/neural-naacl2018
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.