Combining GCN and Transformer for Chinese Grammatical Error Detection

Jinhong Zhang

arXiv:2105.09085·cs.CL·July 13, 2021

Combining GCN and Transformer for Chinese Grammatical Error Detection

Jinhong Zhang

PDF

Open Access

TL;DR

This paper presents a system combining BERT-based models and graph neural networks for detecting four types of Chinese grammatical errors, achieving top performance in the CGED 2020 task.

Contribution

The paper introduces a hybrid approach integrating syntactic, contextual, and lexical information using BERT and graph neural networks for Chinese grammatical error detection.

Findings

01

Achieved highest F1 scores in CGED 2020 task

02

Demonstrated effectiveness of ensemble mechanism

03

Validated the benefit of combining multiple models

Abstract

This paper describes our system at NLPTEA-2020 Task: Chinese Grammatical Error Diagnosis (CGED). The goal of CGED is to diagnose four types of grammatical errors: word selection (S), redundant words (R), missing words (M), and disordered words (W). The automatic CGED system contains two parts including error detection and error correction and our system is designed to solve the error detection problem. Our system is built on three models: 1) a BERT-based model leveraging syntactic information; 2) a BERT-based model leveraging contextual embeddings; 3) a lexicon-based graph neural network leveraging lexical information. We also design an ensemble mechanism to improve the single model's performance. Finally, our system achieves the highest F1 scores at detection level and identification level among all teams participating in the CGED 2020 task.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Text Readability and Simplification · Topic Modeling

MethodsGraph Neural Network