Ultrametric Distance in Syntax
Mark D. Roberts

TL;DR
This paper applies ultrametric analysis to syntactic phrase trees, providing a new way to measure hierarchical complexity and redefine syntactic relations like c-command and government.
Contribution
It introduces an ultrametric framework for syntactic trees, resolving branching height ambiguity and offering new definitions for syntactic relations.
Findings
Ultrametric triangles are not equilateral in syntax.
A minimum ultrametric distance between lexical categories is calculated.
Ultrametric definitions of c-command and government are proposed.
Abstract
Phrase structure trees have a hierarchical structure. In many subjects, most notably in Taxonomy such tree structures have been studied using ultrametrics. Here syntactical hierarchical phrase trees are subject to a similar analysis, which is much siompler as the branching structure is more readily discernible and switched. The occurence of hierarchical structure elsewhere in linguistics is mentioned. The phrase tree can be represented by a matrix and the elements of the matrix can be represented by triangles. The height at which branching occurs is not prescribed in previous syntatic models, but it is by using the ultrametric matrix. The ambiguity of which branching height to choose is resolved by postulating that branching occurs at the lowest height available. An ultrametric produces a measure of the complexity of sentences: presumably the complexity of sentence increases as a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuthorship Attribution and Profiling · Advanced Text Analysis Techniques · Data Visualization and Analytics
