Word class flexibility: A deep contextualized approach
Bai Li, Guillaume Thomas, Yang Xu, Frank Rudzicz

TL;DR
This paper introduces a new methodology using deep contextualized embeddings to quantify and analyze word class flexibility across 37 languages, revealing shared tendencies and directional semantic shifts.
Contribution
It presents a novel approach leveraging contextualized embeddings to measure word class flexibility systematically across multiple languages.
Findings
Contextualized embeddings align with human judgments of class variation.
Shared tendencies in class flexibility are observed across languages.
Greater semantic variation occurs when flexible words are used in their dominant class.
Abstract
Word class flexibility refers to the phenomenon whereby a single word form is used across different grammatical categories. Extensive work in linguistic typology has sought to characterize word class flexibility across languages, but quantifying this phenomenon accurately and at scale has been fraught with difficulties. We propose a principled methodology to explore regularity in word class flexibility. Our method builds on recent work in contextualized word embeddings to quantify semantic shift between word classes (e.g., noun-to-verb, verb-to-noun), and we apply this method to 37 languages. We find that contextualized embeddings not only capture human judgment of class variation within words in English, but also uncover shared tendencies in class flexibility across languages. Specifically, we find greater semantic variation when flexible lemmas are used in their dominant word class,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Linguistic Variation and Morphology · Topic Modeling
