Investigating Cross-Linguistic Adjective Ordering Tendencies with a   Latent-Variable Model

Jun Yen Leung; Guy Emerson; Ryan Cotterell

arXiv:2010.04755·cs.CL·October 13, 2020

Investigating Cross-Linguistic Adjective Ordering Tendencies with a Latent-Variable Model

Jun Yen Leung, Guy Emerson, Ryan Cotterell

PDF

TL;DR

This paper introduces a corpus-driven latent-variable model that accurately predicts adjective orderings across 24 languages, providing evidence for universal hierarchical tendencies in adjective sequencing.

Contribution

It presents the first purely corpus-based model for multilingual adjective ordering, demonstrating cross-linguistic universal tendencies with high accuracy.

Findings

01

Model accurately orders adjectives in 24 languages

02

Provides evidence for universal hierarchical adjective ordering

03

Works across different training and testing languages

Abstract

Across languages, multiple consecutive adjectives modifying a noun (e.g. "the big red dog") follow certain unmarked ordering rules. While explanatory accounts have been put forward, much of the work done in this area has relied primarily on the intuitive judgment of native speakers, rather than on corpus data. We present the first purely corpus-driven model of multi-lingual adjective ordering in the form of a latent-variable model that can accurately order adjectives across 24 different languages, even when the training and testing languages are different. We utilize this novel statistical model to provide strong converging evidence for the existence of universal, cross-linguistic, hierarchical adjective ordering tendencies.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.