Disentangled Phonetic Representation for Chinese Spelling Correction

Zihong Liang; Xiaojun Quan; Qifan Wang

arXiv:2305.14783·cs.CL·May 25, 2023·1 cites

Disentangled Phonetic Representation for Chinese Spelling Correction

Zihong Liang, Xiaojun Quan, Qifan Wang

PDF

Open Access 1 Repo 7 Models

TL;DR

This paper introduces a novel approach for Chinese Spelling Correction that disentangles phonetic and textual features, using a pinyin-to-character objective and self-distillation to improve correction accuracy.

Contribution

It proposes a disentangled representation method with a pinyin-to-character objective and self-distillation, enhancing phonetic information utilization in CSC.

Findings

01

Outperforms existing methods on three benchmarks.

02

Effective disentanglement of phonetic and textual features.

03

Improved correction accuracy with phonetic information.

Abstract

Chinese Spelling Correction (CSC) aims to detect and correct erroneous characters in Chinese texts. Although efforts have been made to introduce phonetic information (Hanyu Pinyin) in this task, they typically merge phonetic representations with character representations, which tends to weaken the representation effect of normal texts. In this work, we propose to disentangle the two types of features to allow for direct interaction between textual and phonetic information. To learn useful phonetic representations, we introduce a pinyin-to-character objective to ask the model to predict the correct characters based solely on phonetic information, where a separation mask is imposed to disable attention from phonetic input to text. To avoid overfitting the phonetics, we further design a self-distillation module to ensure that semantic information plays a major role in the prediction.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

liangzh63/dorm-csc
noneOfficial

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis