Tiered Clustering to Improve Lexical Entailment
John Wieting

TL;DR
This paper explores how clustering words into senses and using multiple context vectors can enhance lexical entailment recognition in NLP, improving existing vector space models.
Contribution
It introduces a method of clustering words into senses to extend and improve two existing lexical entailment algorithms.
Findings
Clustering into senses improves entailment detection accuracy.
Using multiple context vectors enhances model performance.
Extensions outperform single-vector approaches in experiments.
Abstract
Many tasks in Natural Language Processing involve recognizing lexical entailment. Two different approaches to this problem have been proposed recently that are quite different from each other. The first is an asymmetric similarity measure designed to give high scores when the contexts of the narrower term in the entailment are a subset of those of the broader term. The second is a supervised approach where a classifier is learned to predict entailment given a concatenated latent vector representation of the word. Both of these approaches are vector space models that use a single context vector as a representation of the word. In this work, I study the effects of clustering words into senses and using these multiple context vectors to infer entailment using extensions of these two algorithms. I find that this approach offers some improvement to these entailment algorithms.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text and Document Classification Technologies
