Type-Driven Multi-Turn Corrections for Grammatical Error Correction
Shaopeng Lai, Qingyu Zhou, Jiali Zeng, Zhongli Li, Chao Li, Yunbo Cao,, Jinsong Su

TL;DR
This paper introduces a type-driven multi-turn correction method for grammatical error correction, explicitly modeling the correction process and interdependence between error types, leading to state-of-the-art results.
Contribution
It proposes a novel training approach that constructs multiple targeted training instances per error type, improving model awareness and correction accuracy.
Findings
Significantly improves model training effectiveness.
Achieves state-of-the-art single-model performance on English GEC benchmarks.
Demonstrates the benefit of explicit error type modeling.
Abstract
Grammatical Error Correction (GEC) aims to automatically detect and correct grammatical errors. In this aspect, dominant models are trained by one-iteration learning while performing multiple iterations of corrections during inference. Previous studies mainly focus on the data augmentation approach to combat the exposure bias, which suffers from two drawbacks. First, they simply mix additionally-constructed training instances and original ones to train models, which fails to help models be explicitly aware of the procedure of gradual corrections. Second, they ignore the interdependence between different types of corrections. In this paper, we propose a Type-Driven Multi-Turn Corrections approach for GEC. Using this approach, from each training instance, we additionally construct multiple training instances, each of which involves the correction of a specific type of errors. Then, we use…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
MethodsAttentive Walk-Aggregating Graph Neural Network
