Internal Knowledge Without External Expression: Probing the Generalization Boundary of a Classical Chinese Language Model
Jiuting Chen, Yuan Lian, Hao Wu, Tianqi Huang, Hiroshi Sasaki, Makoto Kouno, Jongil Choi

TL;DR
This study trains a Classical Chinese language model to explore its ability to internally recognize unknown inputs versus its external expression of uncertainty, revealing a dissociation between internal knowledge and external uncertainty markers.
Contribution
It demonstrates that language models encode factual knowledge internally but do not naturally express uncertainty externally without explicit training signals.
Findings
Model shows a significant perplexity increase for fabricated events, indicating internal factual encoding.
Externally, the model rarely uses epistemic markers to express uncertainty, reflecting training data conventions.
Uncertainty expression is driven by training data, not inherent metacognitive ability.
Abstract
We train a 318M-parameter Transformer language model from scratch on a curated corpus of 1.56 billion tokens of pure Classical Chinese, with zero English characters or Arabic numerals. Through systematic out-of-distribution (OOD) testing, we investigate whether the model can distinguish known from unknown inputs, and crucially, whether it can express this distinction in its generated text. We find a clear dissociation between internal and external uncertainty. Internally, the model exhibits a perplexity jump ratio of 2.39x between real and fabricated historical events (p = 8.9e-11, n = 92 per group), with semi-fabricated events (real figures + fictional events) showing the highest perplexity (4.24x, p = 1.1e-16), demonstrating genuine factual encoding beyond syntactic pattern matching. Externally, however, the model never learns to express uncertainty: classical Chinese epistemic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
