A New Evaluation Method: Evaluation Data and Metrics for Chinese Grammar Error Correction
Nankai Lin, Nankai Lin, Xiaotian Lin, Ziyu Yang, Shengyi Jiang

TL;DR
This paper introduces three novel evaluation metrics for Chinese Grammatical Error Correction that are designed to be independent of word segmentation and language models, aiming to improve the consistency and comparability of evaluations.
Contribution
The paper proposes three new evaluation metrics for CGEC, addressing the dependency issues of existing metrics on segmentation and language models.
Findings
The proposed metrics are validated for reasonableness and validity.
The metrics include sentence-level accuracy, char-level BLEU, and char-level meaning preservation.
They aim to become new standards for CGEC evaluation.
Abstract
As a fundamental task in natural language processing, Chinese Grammatical Error Correction (CGEC) has gradually received widespread attention and become a research hotspot. However, one obvious deficiency for the existing CGEC evaluation system is that the evaluation values are significantly influenced by the Chinese word segmentation results or different language models. The evaluation values of the same error correction model can vary considerably under different word segmentation systems or different language models. However, it is expected that these metrics should be independent of the word segmentation results and language models, as they may lead to a lack of uniqueness and comparability in the evaluation of different methods. To this end, we propose three novel evaluation metrics for CGEC in two dimensions: reference-based and reference-less. In terms of the reference-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
