Sentence-Level Grammatical Error Identification as Sequence-to-Sequence   Correction

Allen Schmaltz; Yoon Kim; Alexander M. Rush; Stuart M. Shieber

arXiv:1604.04677·cs.CL·April 19, 2016

Sentence-Level Grammatical Error Identification as Sequence-to-Sequence Correction

Allen Schmaltz, Yoon Kim, Alexander M. Rush, Stuart M. Shieber

PDF

TL;DR

This paper presents an attention-based sequence-to-sequence model for sentence-level grammatical error identification and correction, demonstrating superior performance on the AESW 2016 Shared Task using a combination of character and word-based models.

Contribution

It introduces a novel combination of character-based and word-based encoder-decoder models with CNNs for improved grammatical error detection and correction.

Findings

01

Character-based models outperform word-based models.

02

The combined model achieves the highest accuracy on AESW 2016.

03

Sequence-to-sequence models can effectively identify and correct grammatical errors.

Abstract

We demonstrate that an attention-based encoder-decoder model can be used for sentence-level grammatical error identification for the Automated Evaluation of Scientific Writing (AESW) Shared Task 2016. The attention-based encoder-decoder models can be used for the generation of corrections, in addition to error identification, which is of interest for certain end-user applications. We show that a character-based encoder-decoder model is particularly effective, outperforming other results on the AESW Shared Task on its own, and showing gains over a word-based counterpart. Our final model--a combination of three character-based encoder-decoder models, one word-based encoder-decoder model, and a sentence-level CNN--is the highest performing system on the AESW 2016 binary prediction Shared Task.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.