Language Design as Information Renormalization

Angel J. Gallego; Roman Orus

arXiv:1708.01525·cs.CL·February 8, 2022

Language Design as Information Renormalization

Angel J. Gallego, Roman Orus

PDF

TL;DR

This paper links linguistic operations to physics concepts, modeling language as an information renormalization process using tensor networks, and explores quantum computing approaches for language modeling.

Contribution

It introduces a novel interpretation of the linguistic merge operation as information renormalization and formalizes it with tensor network models.

Findings

01

Language correlations decay polynomially, matching empirical observations.

02

Tensor network models efficiently represent sentence probabilities.

03

Quantum states can be used to simulate language models on quantum computers.

Abstract

Here we consider some well-known facts in syntax from a physics perspective, allowing us to establish equivalences between both fields with many consequences. Mainly, we observe that the operation MERGE, put forward by N. Chomsky in 1995, can be interpreted as a physical information coarse-graining. Thus, MERGE in linguistics entails information renormalization in physics, according to different time scales. We make this point mathematically formal in terms of language models. In this setting, MERGE amounts to a probability tensor implementing a coarse-graining, akin to a probabilistic context-free grammar. The probability vectors of meaningful sentences are given by stochastic tensor networks (TN) built from diagonal tensors and which are mostly loop-free, such as Tree Tensor Networks and Matrix Product States, thus being computationally very efficient to manipulate. We show that this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.