Count, Decode and Fetch: A New Approach to Handwritten Chinese Character Error Correction
Pengfei Hu, Jiefeng Ma, Zhenrong Zhang, Jun Du, Jianshu Zhang

TL;DR
This paper introduces CDF, a novel approach for handwritten Chinese character error correction that improves generalization to unseen misspelled characters by explicitly modeling radical counts and using a fetcher for correction.
Contribution
The paper proposes the Count, Decode and Fetch (CDF) method, which enhances error correction by explicitly modeling radical counts and employing a fetcher, improving generalization to unseen errors.
Findings
Significantly improves error correction performance.
Enhances generalization to unseen misspelled characters.
Integrates with existing encoder-decoder models effectively.
Abstract
Recently, handwritten Chinese character error correction has been greatly improved by employing encoder-decoder methods to decompose a Chinese character into an ideographic description sequence (IDS). However, existing methods implicitly capture and encode linguistic information inherent in IDS sequences, leading to a tendency to generate IDS sequences that match seen characters. This poses a challenge when dealing with an unseen misspelled character, as the decoder may generate an IDS sequence that matches a seen character instead. Therefore, we introduce Count, Decode and Fetch (CDF), a novel approach that exhibits better generalization towards unseen misspelled characters. CDF is mainly composed of three parts: the counter, the decoder, and the fetcher. In the first stage, the counter predicts the number of each radical class without the symbol-level position annotations. In the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsAttentive Walk-Aggregating Graph Neural Network
