Patterns of Lexical Ambiguity in Contextualised Language Models
Janosch Haber, Massimo Poesio

TL;DR
This paper examines how well contextualised language models, especially BERT Large, capture lexical ambiguity distinctions like polysemy and homonymy, using a new human-annotated dataset of word sense similarities.
Contribution
Introduces an extended dataset with human judgments on word sense similarity and co-predication acceptability, providing a new benchmark for evaluating contextualised embeddings.
Findings
BERT Large shows strong correlation with human similarity ratings.
Models differentiate homonyms and some polysemy types but struggle with others.
The dataset captures complex lexical ambiguity patterns.
Abstract
One of the central aspects of contextualised language models is that they should be able to distinguish the meaning of lexically ambiguous words by their contexts. In this paper we investigate the extent to which the contextualised embeddings of word forms that display multiplicity of sense reflect traditional distinctions of polysemy and homonymy. To this end, we introduce an extended, human-annotated dataset of graded word sense similarity and co-predication acceptability, and evaluate how well the similarity of embeddings predicts similarity in meaning. Both types of human judgements indicate that the similarity of polysemic interpretations falls in a continuum between identity of meaning and homonymy. However, we also observe significant differences within the similarity ratings of polysemes, forming consistent patterns for different types of polysemic sense alternation. Our dataset…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Test · Linear Layer · Attention Dropout · Weight Decay · Linear Warmup With Linear Decay · Residual Connection · Softmax
