Improving Pre-trained Language Models with Syntactic Dependency   Prediction Task for Chinese Semantic Error Recognition

Bo Sun; Baoxin Wang; Wanxiang Che; Dayong Wu; Zhigang Chen; Ting Liu

arXiv:2204.07464·cs.CL·April 18, 2022·1 cites

Improving Pre-trained Language Models with Syntactic Dependency Prediction Task for Chinese Semantic Error Recognition

Bo Sun, Baoxin Wang, Wanxiang Che, Dayong Wu, Zhigang Chen, Ting Liu

PDF

Open Access

TL;DR

This paper introduces a novel approach for Chinese Semantic Error Recognition by incorporating syntactic dependency prediction tasks into pre-trained models, and provides a new dataset for this purpose, showing improved performance over existing models.

Contribution

The paper proposes syntax-related pre-training tasks for Chinese semantic error detection and creates the first high-quality dataset for this task.

Findings

01

Pre-training with syntactic dependency tasks improves error detection accuracy.

02

The proposed method outperforms universal and syntax-infused models on CoCLSA.

03

A new dataset for Chinese semantic error recognition is introduced.

Abstract

Existing Chinese text error detection mainly focuses on spelling and simple grammatical errors. These errors have been studied extensively and are relatively simple for humans. On the contrary, Chinese semantic errors are understudied and more complex that humans cannot easily recognize. The task of this paper is Chinese Semantic Error Recognition (CSER), a binary classification task to determine whether a sentence contains semantic errors. The current research has no effective method to solve this task. In this paper, we inherit the model structure of BERT and design several syntax-related pre-training tasks so that the model can learn syntactic knowledge. Our pre-training tasks consider both the directionality of the dependency structure and the diversity of the dependency relationship. Due to the lack of a published dataset for CSER, we build a high-quality dataset for CSER for the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText Readability and Simplification · Natural Language Processing Techniques · Topic Modeling

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Multi-Head Attention · Residual Connection · Linear Warmup With Linear Decay · Dense Connections · Attention Dropout · Layer Normalization · Softmax