Word class flexibility: A deep contextualized approach

Bai Li; Guillaume Thomas; Yang Xu; Frank Rudzicz

arXiv:2009.09241·cs.CL·September 22, 2020

Word class flexibility: A deep contextualized approach

Bai Li, Guillaume Thomas, Yang Xu, Frank Rudzicz

PDF

Open Access 2 Repos

TL;DR

This paper introduces a new methodology using deep contextualized embeddings to quantify and analyze word class flexibility across 37 languages, revealing shared tendencies and directional semantic shifts.

Contribution

It presents a novel approach leveraging contextualized embeddings to measure word class flexibility systematically across multiple languages.

Findings

01

Contextualized embeddings align with human judgments of class variation.

02

Shared tendencies in class flexibility are observed across languages.

03

Greater semantic variation occurs when flexible words are used in their dominant class.

Abstract

Word class flexibility refers to the phenomenon whereby a single word form is used across different grammatical categories. Extensive work in linguistic typology has sought to characterize word class flexibility across languages, but quantifying this phenomenon accurately and at scale has been fraught with difficulties. We propose a principled methodology to explore regularity in word class flexibility. Our method builds on recent work in contextualized word embeddings to quantify semantic shift between word classes (e.g., noun-to-verb, verb-to-noun), and we apply this method to 37 languages. We find that contextualized embeddings not only capture human judgment of class variation within words in English, but also uncover shared tendencies in class flexibility across languages. Specifically, we find greater semantic variation when flexible lemmas are used in their dominant word class,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Linguistic Variation and Morphology · Topic Modeling