Learning Word Embeddings for Hyponymy with Entailment-Based   Distributional Semantics

James Henderson

arXiv:1710.02437·cs.CL·October 9, 2017·2 cites

Learning Word Embeddings for Hyponymy with Entailment-Based Distributional Semantics

James Henderson

PDF

Open Access

TL;DR

This paper introduces a novel distributional semantic model that learns word embeddings for hyponymy by modeling entailment in vector space, leading to improved prediction of lexical hyponymy relations.

Contribution

It proposes a new entailment-based framework for learning word embeddings specifically tailored for hyponymy detection, outperforming previous models.

Findings

01

Embeddings outperform previous hyponymy prediction models.

02

Posterior vectors yield better results than evidence-based vectors.

03

Model performs well in both unsupervised and semi-supervised settings.

Abstract

Lexical entailment, such as hyponymy, is a fundamental issue in the semantics of natural language. This paper proposes distributional semantic models which efficiently learn word embeddings for entailment, using a recently-proposed framework for modelling entailment in a vector-space. These models postulate a latent vector for a pseudo-phrase containing two neighbouring word vectors. We investigate both modelling words as the evidence they contribute about this phrase vector, or as the posterior distribution of a one-word phrase vector, and find that the posterior vectors perform better. The resulting word embeddings outperform the best previous results on predicting hyponymy between words, in unsupervised and semi-supervised experiments.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis