Finding structure in logographic writing with library learning
Guangyuan Jiang, Matthias Hofer, Jiayuan Mao, Lionel Wong, Joshua B., Tenenbaum, Roger P. Levy

TL;DR
This paper introduces a computational framework using library learning and program synthesis to uncover and analyze the structural principles of Chinese logographic writing, highlighting evolution towards efficiency.
Contribution
It presents a novel library learning-based method to discover linguistic structures in Chinese characters and explores their evolution under efficiency pressures.
Findings
Discovered known linguistic structures in Chinese writing
Revealed evolution towards simplification for efficiency
Demonstrated the utility of library learning in linguistic analysis
Abstract
One hallmark of human language is its combinatoriality -- reusing a relatively small inventory of building blocks to create a far larger inventory of increasingly complex structures. In this paper, we explore the idea that combinatoriality in language reflects a human inductive bias toward representational efficiency in symbol systems. We develop a computational framework for discovering structure in a writing system. Built on top of state-of-the-art library learning and program synthesis techniques, our computational framework discovers known linguistic structures in the Chinese writing system and reveals how the system evolves towards simplification under pressures for representational efficiency. We demonstrate how a library learning approach, utilizing learned abstractions and compression, may help reveal the fundamental computational principles that underlie the creation of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLiteracy, Media, and Education · Education and Technology Integration · Library Science and Information Literacy
MethodsLib
