Puzzle Pieces Picker: Deciphering Ancient Chinese Characters with Radical Reconstruction
Pengjie Wang, Kaile Zhang, Xinyu Wang, Shengwei Han, Yongge Liu,, Lianwen Jin, Xiang Bai, Yuliang Liu

TL;DR
This paper presents Puzzle Pieces Picker (P3), a novel Transformer-based method for reconstructing and deciphering ancient Chinese characters from Oracle Bone Inscriptions through radical analysis, supported by a new detailed dataset.
Contribution
Introduces P3, a Transformer-based approach for radical reconstruction of ancient Chinese characters, along with a comprehensive dataset for paleographic analysis.
Findings
Demonstrated the effectiveness of P3 in reconstructing ancient characters
Developed the ACCP dataset with detailed radical annotations
Showed promising results in deciphering complex ancient scripts
Abstract
Oracle Bone Inscriptions is one of the oldest existing forms of writing in the world. However, due to the great antiquity of the era, a large number of Oracle Bone Inscriptions (OBI) remain undeciphered, making it one of the global challenges in the field of paleography today. This paper introduces a novel approach, namely Puzzle Pieces Picker (P), to decipher these enigmatic characters through radical reconstruction. We deconstruct OBI into foundational strokes and radicals, then employ a Transformer model to reconstruct them into their modern (conterpart)\textcolor{blue}{counterparts}, offering a groundbreaking solution to ancient script analysis. To further this endeavor, a new Ancient Chinese Character Puzzles (ACCP) dataset was developed, comprising an extensive collection of character images from seven key historical stages, annotated with detailed radical sequences. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTranslation Studies and Practices · Ideological and Political Education · Computational and Text Analysis Methods
MethodsAttention Is All You Need · Softmax · Layer Normalization · Linear Layer · Byte Pair Encoding · Label Smoothing · Adam · Residual Connection · Position-Wise Feed-Forward Layer · Multi-Head Attention
