Modelling Lexical Ambiguity with Density Matrices

Francois Meyer; Martha Lewis

arXiv:2010.05670·cs.CL·October 13, 2020

Modelling Lexical Ambiguity with Density Matrices

Francois Meyer, Martha Lewis

PDF

TL;DR

This paper introduces neural models that learn density matrices to better represent lexical ambiguity, especially homonymy, within compositional distributional semantics, outperforming existing vector-based models.

Contribution

It presents three novel neural models for learning density matrices from text corpora, enhancing the modeling of lexical ambiguity in compositional semantics.

Findings

01

Best model outperforms existing vector-based models

02

Density matrices effectively discriminate between word senses

03

Neural models improve sense disambiguation in compositional tasks

Abstract

Words can have multiple senses. Compositional distributional models of meaning have been argued to deal well with finer shades of meaning variation known as polysemy, but are not so well equipped to handle word senses that are etymologically unrelated, or homonymy. Moving from vectors to density matrices allows us to encode a probability distribution over different senses of a word, and can also be accommodated within a compositional distributional model of meaning. In this paper we present three new neural models for learning density matrices from a corpus, and test their ability to discriminate between word senses on a range of compositional datasets. When paired with a particular composition method, our best model outperforms existing vector-based compositional models as well as strong sentence encoders.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.