Stroke-Based Autoencoders: Self-Supervised Learners for Efficient   Zero-Shot Chinese Character Recognition

Zongze Chen; Wenxia Yang; Xin Li

arXiv:2207.08191·cs.CV·July 19, 2022

Stroke-Based Autoencoders: Self-Supervised Learners for Efficient Zero-Shot Chinese Character Recognition

Zongze Chen, Wenxia Yang, Xin Li

PDF

Open Access

TL;DR

This paper introduces a stroke-based autoencoder that models Chinese character morphology using self-supervised learning, enabling effective zero-shot recognition and improved character representations by capturing detailed stroke information.

Contribution

The paper presents a novel stroke-based autoencoder architecture trained to reconstruct stroke sequences, enhancing zero-shot recognition and morphological understanding of Chinese characters.

Findings

01

Outperforms existing methods in zero-shot recognition accuracy.

02

Enriches Chinese word embeddings with morphological features.

03

Effective in recognizing unseen characters based on stroke information.

Abstract

Chinese characters carry a wealth of morphological and semantic information; therefore, the semantic enhancement of the morphology of Chinese characters has drawn significant attention. The previous methods were intended to directly extract information from a whole Chinese character image, which usually cannot capture both global and local information simultaneously. In this paper, we develop a stroke-based autoencoder(SAE), to model the sophisticated morphology of Chinese characters with the self-supervised method. Following its canonical writing order, we first represent a Chinese character as a series of stroke images with a fixed writing order, and then our SAE model is trained to reconstruct this stroke image sequence. This pre-trained SAE model can predict the stroke image series for unseen characters, as long as their strokes or radicals appeared in the training set. We have…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Image Processing and 3D Reconstruction · Image Retrieval and Classification Techniques