sense2vec - A Fast and Accurate Method for Word Sense Disambiguation In   Neural Word Embeddings

Andrew Trask; Phil Michalak; John Liu

arXiv:1511.06388·cs.CL·November 23, 2015·139 cites

sense2vec - A Fast and Accurate Method for Word Sense Disambiguation In Neural Word Embeddings

Andrew Trask, Phil Michalak, John Liu

PDF

Open Access 1 Repo

TL;DR

sense2vec introduces a fast, supervised method for word sense disambiguation in neural embeddings, improving accuracy and efficiency for NLP tasks across multiple languages and nuanced senses.

Contribution

It presents a novel supervised approach for modeling multiple word senses in embeddings, addressing efficiency and application challenges of prior methods.

Findings

01

Disambiguates contrastive and nuanced senses effectively.

02

Achieves over 8% error reduction in dependency parsing.

03

Demonstrates broad applicability across languages.

Abstract

Neural word representations have proven useful in Natural Language Processing (NLP) tasks due to their ability to efficiently model complex semantic and syntactic word relationships. However, most techniques model only one representation per word, despite the fact that a single word can have multiple meanings or "senses". Some techniques model words by using multiple vectors that are clustered based on context. However, recent neural approaches rarely focus on the application to a consuming NLP algorithm. Furthermore, the training process of recent word-sense models is expensive relative to single-sense embedding processes. This paper presents a novel approach which addresses these concerns by modeling multiple embeddings for each word based on supervised disambiguation, which provides a fast and accurate way for a consuming NLP model to select a sense-disambiguated embedding. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

explosion/sense2vec
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis