Read, Listen, and See: Leveraging Multimodal Information Helps Chinese   Spell Checking

Heng-Da Xu; Zhongli Li; Qingyu Zhou; Chao Li; Zizhen Wang; Yunbo Cao,; Heyan Huang; Xian-Ling Mao

arXiv:2105.12306·cs.CL·May 27, 2021·1 cites

Read, Listen, and See: Leveraging Multimodal Information Helps Chinese Spell Checking

Heng-Da Xu, Zhongli Li, Qingyu Zhou, Chao Li, Zizhen Wang, Yunbo Cao,, Heyan Huang, Xian-Ling Mao

PDF

Open Access 1 Repo 8 Models

TL;DR

This paper introduces ReaLiSe, a multimodal Chinese spell checker that effectively combines semantic, phonetic, and graphic information to improve error detection and correction in user-generated text.

Contribution

The paper presents a novel multimodal approach for Chinese spell checking that directly leverages semantic, phonetic, and graphic information of characters, outperforming previous heuristic-based methods.

Findings

01

ReaLiSe outperforms strong baselines on SIGHAN benchmarks.

02

Multimodal information significantly improves spell checking accuracy.

03

Selective modality mixing enhances correction performance.

Abstract

Chinese Spell Checking (CSC) aims to detect and correct erroneous characters for user-generated text in the Chinese language. Most of the Chinese spelling errors are misused semantically, phonetically or graphically similar characters. Previous attempts noticed this phenomenon and try to use the similarity for this task. However, these methods use either heuristics or handcrafted confusion sets to predict the correct character. In this paper, we propose a Chinese spell checker called ReaLiSe, by directly leveraging the multimodal information of the Chinese characters. The ReaLiSe model tackles the CSC task by (1) capturing the semantic, phonetic and graphic information of the input characters, and (2) selectively mixing the information in these modalities to predict the correct output. Experiments on the SIGHAN benchmarks show that the proposed model outperforms strong baselines by a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

DaDaMrX/ReaLiSe
pytorchOfficial

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification