TL;DR
This paper introduces an unsupervised learning framework called STSN that effectively separates structure and texture features in oracle character images, enabling improved recognition across different data domains without requiring labeled scanned data.
Contribution
The novel STSN model performs joint disentanglement, domain adaptation, and recognition for oracle characters, addressing the challenge of scarce labeled scanned data.
Findings
STSN outperforms existing adaptation methods on Oracle-241 dataset.
It improves recognition accuracy of scanned oracle characters, even with noise and damage.
The model effectively disentangles structure and texture features for better domain transfer.
Abstract
Oracle bone script is the earliest-known Chinese writing system of the Shang dynasty and is precious to archeology and philology. However, real-world scanned oracle data are rare and few experts are available for annotation which make the automatic recognition of scanned oracle characters become a challenging task. Therefore, we aim to explore unsupervised domain adaptation to transfer knowledge from handprinted oracle data, which are easy to acquire, to scanned domain. We propose a structure-texture separation network (STSN), which is an end-to-end learning framework for joint disentanglement, transformation, adaptation and recognition. First, STSN disentangles features into structure (glyph) and texture (noise) components by generative models, and then aligns handprinted and scanned data in structure feature space such that the negative influence caused by serious noises can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
