Textual Analysis for Studying Chinese Historical Documents and Literary Novels
Chao-Lin Liu, Guan-Tao Jin, Hongsu Wang, Qing-Feng Liu, Wen-Huei, Cheng, Wei-Yun Chiu, Richard Tzong-Han Tsai, Yu-Chun Wang

TL;DR
This paper demonstrates how computational textual analysis can uncover insights in Chinese historical and literary texts, addressing challenges in disambiguation and exploring thematic questions in classical novels.
Contribution
It introduces methods for analyzing Chinese historical and literary documents, including disambiguation of names and thematic exploration of classical novels.
Findings
Identified the most powerful monster in Journey to the West
Analyzed character roles in Dream of the Red Chamber
Showcased potential of computer-assisted Chinese literature analysis
Abstract
We analyzed historical and literary documents in Chinese to gain insights into research issues, and overview our studies which utilized four different sources of text materials in this paper. We investigated the history of concepts and transliterated words in China with the Database for the Study of Modern China Thought and Literature, which contains historical documents about China between 1830 and 1930. We also attempted to disambiguate names that were shared by multiple government officers who served between 618 and 1912 and were recorded in Chinese local gazetteers. To showcase the potentials and challenges of computer-assisted analysis of Chinese literatures, we explored some interesting yet non-trivial questions about two of the Four Great Classical Novels of China: (1) Which monsters attempted to consume the Buddhist monk Xuanzang in the Journey to the West (JTTW), which was…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Humanities and Scholarship · Computational and Text Analysis Methods · Advanced Text Analysis Techniques
