A New Evaluation Method: Evaluation Data and Metrics for Chinese Grammar   Error Correction

Nankai Lin; Nankai Lin; Xiaotian Lin; Ziyu Yang; Shengyi Jiang

arXiv:2205.00217·cs.CL·May 3, 2022

A New Evaluation Method: Evaluation Data and Metrics for Chinese Grammar Error Correction

Nankai Lin, Nankai Lin, Xiaotian Lin, Ziyu Yang, Shengyi Jiang

PDF

Open Access

TL;DR

This paper introduces three novel evaluation metrics for Chinese Grammatical Error Correction that are designed to be independent of word segmentation and language models, aiming to improve the consistency and comparability of evaluations.

Contribution

The paper proposes three new evaluation metrics for CGEC, addressing the dependency issues of existing metrics on segmentation and language models.

Findings

01

The proposed metrics are validated for reasonableness and validity.

02

The metrics include sentence-level accuracy, char-level BLEU, and char-level meaning preservation.

03

They aim to become new standards for CGEC evaluation.

Abstract

As a fundamental task in natural language processing, Chinese Grammatical Error Correction (CGEC) has gradually received widespread attention and become a research hotspot. However, one obvious deficiency for the existing CGEC evaluation system is that the evaluation values are significantly influenced by the Chinese word segmentation results or different language models. The evaluation values of the same error correction model can vary considerably under different word segmentation systems or different language models. However, it is expected that these metrics should be independent of the word segmentation results and language models, as they may lead to a lack of uniqueness and comparability in the evaluation of different methods. To this end, we propose three novel evaluation metrics for CGEC in two dimensions: reference-based and reference-less. In terms of the reference-based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification