From Spelling to Grammar: A New Framework for Chinese Grammatical Error Correction
Xiuyu Wu, Yunfang Wu

TL;DR
This paper introduces a two-step framework for Chinese Grammatical Error Correction, combining a novel zero-shot spelling correction with POS and semantic features for grammatical correction, achieving state-of-the-art results.
Contribution
It proposes a zero-shot spelling correction method and a POS-enhanced neural model for grammatical correction, outperforming previous methods without synthetic data.
Findings
Achieved 42.11 F0.5 score on CGEC dataset.
Outperformed previous state-of-the-art by 1.30 points.
Produced meaningful POS representations capturing transition rules.
Abstract
Chinese Grammatical Error Correction (CGEC) aims to generate a correct sentence from an erroneous sequence, where different kinds of errors are mixed. This paper divides the CGEC task into two steps, namely spelling error correction and grammatical error correction. Specifically, we propose a novel zero-shot approach for spelling error correction, which is simple but effective, obtaining a high precision to avoid error accumulation of the pipeline structure. To handle grammatical error correction, we design part-of-speech (POS) features and semantic class features to enhance the neural network model, and propose an auxiliary task to predict the POS sequence of the target sentence. Our proposed framework achieves a 42.11 F0.5 score on CGEC dataset without using any synthetic data or data augmentation methods, which outperforms the previous state-of-the-art by a wide margin of 1.30…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
