Lost in Context? On the Sense-wise Variance of Contextualized Word   Embeddings

Yile Wang; Yue Zhang

arXiv:2208.09669·cs.CL·August 23, 2022·1 cites

Lost in Context? On the Sense-wise Variance of Contextualized Word Embeddings

Yile Wang, Yue Zhang

PDF

Open Access

TL;DR

This paper investigates the variance of contextualized word embeddings across different contexts, revealing factors influencing this variance and proposing methods to mitigate position bias in sense disambiguation.

Contribution

It quantifies sense-wise variance in embeddings, analyzes influencing factors, and introduces a simple approach to reduce position bias in word sense disambiguation.

Findings

01

Embeddings can be highly consistent across contexts.

02

Part-of-speech and sentence length influence variance.

03

Position bias affects similarity of first words in contexts.

Abstract

Contextualized word embeddings in language models have given much advance to NLP. Intuitively, sentential information is integrated into the representation of words, which can help model polysemy. However, context sensitivity also leads to the variance of representations, which may break the semantic consistency for synonyms. We quantify how much the contextualized embeddings of each word sense vary across contexts in typical pre-trained models. Results show that contextualized embeddings can be highly consistent across contexts. In addition, part-of-speech, number of word senses, and sentence length have an influence on the variance of sense representations. Interestingly, we find that word representations are position-biased, where the first words in different contexts tend to be more similar. We analyze such a phenomenon and also propose a simple way to alleviate such bias in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems