Sense Embedding Learning for Word Sense Induction

Linfeng Song; Zhiguo Wang; Haitao Mi; Daniel Gildea

arXiv:1606.05409·cs.CL·June 23, 2016·2 cites

Sense Embedding Learning for Word Sense Induction

Linfeng Song, Zhiguo Wang, Haitao Mi, Daniel Gildea

PDF

Open Access

TL;DR

This paper introduces a sense embedding learning approach for word sense induction, using a joint training model to induce sense centroids and representing instances with contextual vectors, outperforming previous methods.

Contribution

It proposes a novel sense embedding learning method that jointly trains a model for the entire vocabulary under a multi-task framework for WSI.

Findings

01

Outperforms all participants on SemEval-2010 WSI dataset

02

Distributed sense vectors improve performance over count-based models

03

Joint training of sense centroids enhances induction accuracy

Abstract

Conventional word sense induction (WSI) methods usually represent each instance with discrete linguistic features or cooccurrence features, and train a model for each polysemous word individually. In this work, we propose to learn sense embeddings for the WSI task. In the training stage, our method induces several sense centroids (embedding) for each polysemous word. In the testing stage, our method represents each instance as a contextual vector, and induces its sense by finding the nearest sense centroid in the embedding space. The advantages of our method are (1) distributed sense vectors are taken as the knowledge representations which are trained discriminatively, and usually have better performance than traditional count-based distributional models, and (2) a general model for the whole vocabulary is jointly trained to induce sense centroids under the mutlitask learning framework.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems