Phase transition on a context-sensitive random language model with short range interactions
Yuma Toji, Jun Takahashi, Vwani Roychowdhury, Hideyuki Miyahara

TL;DR
This study demonstrates that phase transitions in language models can occur due to intrinsic linguistic properties, even with only short-range interactions, challenging the notion that such transitions require long-range dependencies.
Contribution
The paper constructs a context-sensitive language model with short-range interactions and shows it exhibits phase transitions, highlighting their linguistic origin.
Findings
Phase transition occurs in a short-range interaction language model.
The model belongs to the class of context-sensitive grammars.
Finite-temperature phase transitions are driven by language's intrinsic properties.
Abstract
Since the random language model was proposed by E. DeGiuli [Phys. Rev. Lett. 122, 128301], language models have been investigated intensively from the viewpoint of statistical mechanics. Recently, the existence of a Berezinskii--Kosterlitz--Thouless transition was numerically demonstrated in models with long-range interactions between symbols. In statistical mechanics, it has long been known that long-range interactions can induce phase transitions. Therefore, it has remained unclear whether phase transitions observed in language models originate from genuinely linguistic properties that are absent in conventional spin models. In this study, we construct a random language model with short-range interactions and numerically investigate its statistical properties. Our model belongs to the class of context-sensitive grammars in the Chomsky hierarchy and allows explicit reference to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
