Tiered Clustering to Improve Lexical Entailment

John Wieting

arXiv:1412.0751·cs.CL·December 3, 2014

Tiered Clustering to Improve Lexical Entailment

John Wieting

PDF

Open Access

TL;DR

This paper explores how clustering words into senses and using multiple context vectors can enhance lexical entailment recognition in NLP, improving existing vector space models.

Contribution

It introduces a method of clustering words into senses to extend and improve two existing lexical entailment algorithms.

Findings

01

Clustering into senses improves entailment detection accuracy.

02

Using multiple context vectors enhances model performance.

03

Extensions outperform single-vector approaches in experiments.

Abstract

Many tasks in Natural Language Processing involve recognizing lexical entailment. Two different approaches to this problem have been proposed recently that are quite different from each other. The first is an asymmetric similarity measure designed to give high scores when the contexts of the narrower term in the entailment are a subset of those of the broader term. The second is a supervised approach where a classifier is learned to predict entailment given a concatenated latent vector representation of the word. Both of these approaches are vector space models that use a single context vector as a representation of the word. In this work, I study the effects of clustering words into senses and using these multiple context vectors to infer entailment using extensions of these two algorithms. I find that this approach offers some improvement to these entailment algorithms.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text and Document Classification Technologies