SpellGCN: Incorporating Phonological and Visual Similarities into Language Models for Chinese Spelling Check
Xingyi Cheng, Weidi Xu, Kunlong Chen, Shaohua Jiang, Feng Wang,, Taifeng Wang, Wei Chu, Yuan Qi

TL;DR
This paper introduces SpellGCN, a graph convolutional network that integrates phonological and visual similarities among Chinese characters into language models, significantly improving Chinese Spelling Check accuracy.
Contribution
It proposes a novel end-to-end trainable model that incorporates character similarity knowledge via a specialized graph convolutional network for Chinese Spelling Check.
Findings
Achieves superior performance on three datasets
Outperforms previous models by a large margin
Demonstrates effectiveness of phonological and visual similarity integration
Abstract
Chinese Spelling Check (CSC) is a task to detect and correct spelling errors in Chinese natural language. Existing methods have made attempts to incorporate the similarity knowledge between Chinese characters. However, they take the similarity knowledge as either an external input resource or just heuristic rules. This paper proposes to incorporate phonological and visual similarity knowledge into language models for CSC via a specialized graph convolutional network (SpellGCN). The model builds a graph over the characters, and SpellGCN is learned to map this graph into a set of inter-dependent character classifiers. These classifiers are applied to the representations extracted by another network, such as BERT, enabling the whole network to be end-to-end trainable. Experiments (The dataset and all code for this paper are available at https://github.com/ACL2020SpellGCN/SpellGCN) are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗Macropodus/macbert4mdcspell_v1model· 40k dl· ♡ 240k dl♡ 2
- 🤗Macropodus/macbert4csc_v2model· 8 dl· ♡ 28 dl♡ 2
- 🤗Macropodus/macbert4csc_v1model· 5 dl· ♡ 15 dl♡ 1
- 🤗Macropodus/bert4csc_v1model· 4 dl· ♡ 14 dl♡ 1
- 🤗Macropodus/relm_v1model· 42 dl· ♡ 142 dl♡ 1
- 🤗Macropodus/macbert4mdcspell_v2model· 283 dl· ♡ 6283 dl♡ 6
- 🤗Macropodus/macbert4mdcspell_v3model· 310 dl· ♡ 1310 dl♡ 1
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
MethodsLinear Layer · Weight Decay · Residual Connection · Adam · Layer Normalization · Softmax · Attention Is All You Need · Dropout · Refunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention
