Read, Listen, and See: Leveraging Multimodal Information Helps Chinese Spell Checking
Heng-Da Xu, Zhongli Li, Qingyu Zhou, Chao Li, Zizhen Wang, Yunbo Cao,, Heyan Huang, Xian-Ling Mao

TL;DR
This paper introduces ReaLiSe, a multimodal Chinese spell checker that effectively combines semantic, phonetic, and graphic information to improve error detection and correction in user-generated text.
Contribution
The paper presents a novel multimodal approach for Chinese spell checking that directly leverages semantic, phonetic, and graphic information of characters, outperforming previous heuristic-based methods.
Findings
ReaLiSe outperforms strong baselines on SIGHAN benchmarks.
Multimodal information significantly improves spell checking accuracy.
Selective modality mixing enhances correction performance.
Abstract
Chinese Spell Checking (CSC) aims to detect and correct erroneous characters for user-generated text in the Chinese language. Most of the Chinese spelling errors are misused semantically, phonetically or graphically similar characters. Previous attempts noticed this phenomenon and try to use the similarity for this task. However, these methods use either heuristics or handcrafted confusion sets to predict the correct character. In this paper, we propose a Chinese spell checker called ReaLiSe, by directly leveraging the multimodal information of the Chinese characters. The ReaLiSe model tackles the CSC task by (1) capturing the semantic, phonetic and graphic information of the input characters, and (2) selectively mixing the information in these modalities to predict the correct output. Experiments on the SIGHAN benchmarks show that the proposed model outperforms strong baselines by a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗iioSnail/ReaLiSe-for-cscmodel· 11 dl· ♡ 711 dl♡ 7
- 🤗Macropodus/macbert4mdcspell_v1model· 40k dl· ♡ 240k dl♡ 2
- 🤗Macropodus/macbert4csc_v2model· 8 dl· ♡ 28 dl♡ 2
- 🤗Macropodus/macbert4csc_v1model· 5 dl· ♡ 15 dl♡ 1
- 🤗Macropodus/bert4csc_v1model· 4 dl· ♡ 14 dl♡ 1
- 🤗Macropodus/relm_v1model· 42 dl· ♡ 142 dl♡ 1
- 🤗Macropodus/macbert4mdcspell_v2model· 283 dl· ♡ 6283 dl♡ 6
- 🤗Macropodus/macbert4mdcspell_v3model· 310 dl· ♡ 1310 dl♡ 1
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
