Syntactic Structures and Code Parameters

Kevin Shu; Matilde Marcolli

arXiv:1610.00311·cs.CL·October 4, 2016

Syntactic Structures and Code Parameters

Kevin Shu, Matilde Marcolli

PDF

Open Access

TL;DR

This paper explores the application of error-correcting codes to syntactic structures of languages, revealing their distribution in code parameter space and how language change models can influence code properties.

Contribution

It introduces a novel approach of assigning error-correcting codes to syntactic data and analyzes their distribution, including codes surpassing classical bounds, and studies language change dynamics through spin glass models.

Findings

01

Many codes lie above classical bounds like Gilbert-Varshamov and Plotkin.

02

Language change dynamics can improve code parameters under certain entailment relations.

03

Analysis of large language sets reveals insights into spin glass dynamics.

Abstract

We assign binary and ternary error-correcting codes to the data of syntactic structures of world languages and we study the distribution of code points in the space of code parameters. We show that, while most codes populate the lower region approximating a superposition of Thomae functions, there is a substantial presence of codes above the Gilbert-Varshamov bound and even above the asymptotic bound and the Plotkin bound. We investigate the dynamics induced on the space of code parameters by spin glass models of language change, and show that, in the presence of entailment relations between syntactic parameters the dynamics can sometimes improve the code. For large sets of languages and syntactic data, one can gain information on the spin glass dynamics from the induced dynamics in the space of code parameters.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFractal and DNA sequence analysis · Error Correcting Code Techniques · Algorithms and Data Compression