Language Design as Information Renormalization
Angel J. Gallego, Roman Orus

TL;DR
This paper links linguistic operations to physics concepts, modeling language as an information renormalization process using tensor networks, and explores quantum computing approaches for language modeling.
Contribution
It introduces a novel interpretation of the linguistic merge operation as information renormalization and formalizes it with tensor network models.
Findings
Language correlations decay polynomially, matching empirical observations.
Tensor network models efficiently represent sentence probabilities.
Quantum states can be used to simulate language models on quantum computers.
Abstract
Here we consider some well-known facts in syntax from a physics perspective, allowing us to establish equivalences between both fields with many consequences. Mainly, we observe that the operation MERGE, put forward by N. Chomsky in 1995, can be interpreted as a physical information coarse-graining. Thus, MERGE in linguistics entails information renormalization in physics, according to different time scales. We make this point mathematically formal in terms of language models. In this setting, MERGE amounts to a probability tensor implementing a coarse-graining, akin to a probabilistic context-free grammar. The probability vectors of meaningful sentences are given by stochastic tensor networks (TN) built from diagonal tensors and which are mostly loop-free, such as Tree Tensor Networks and Matrix Product States, thus being computationally very efficient to manipulate. We show that this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
