Multi-sense embeddings through a word sense disambiguation process

Terry Ruas; William Grosky; Akiko Aizawa

arXiv:2101.08700·cs.CL·December 20, 2022

Multi-sense embeddings through a word sense disambiguation process

Terry Ruas, William Grosky, Akiko Aizawa

PDF

1 Repo

TL;DR

This paper introduces a novel unsupervised method called MSSA for disambiguating words by sense, enhancing semantic representations with multi-sense embeddings, and demonstrates its effectiveness across multiple benchmarks.

Contribution

The paper presents an unsupervised disambiguation technique, a multi-sense embedding model adaptable to existing algorithms, and a reusable methodology for refining semantic representations.

Findings

01

Achieved state-of-the-art results on six word similarity benchmarks.

02

Outperformed several complex existing systems in semantic tasks.

03

Provided a flexible approach applicable to various embedding algorithms.

Abstract

Natural Language Understanding has seen an increasing number of publications in the last few years, especially after robust word embeddings models became prominent, when they proved themselves able to capture and represent semantic relationships from massive amounts of data. Nevertheless, traditional models often fall short in intrinsic issues of linguistics, such as polysemy and homonymy. Any expert system that makes use of natural language in its core, can be affected by a weak semantic representation of text, resulting in inaccurate outcomes based on poor decisions. To mitigate such issues, we propose a novel approach called Most Suitable Sense Annotation (MSSA), that disambiguates and annotates each word by its specific sense, considering the semantic effects of its context. Our approach brings three main contributions to the semantic representation scenario: (i) an unsupervised…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

truas/MSSA
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.