When Hearst Is not Enough: Improving Hypernymy Detection from Corpus   with Distributional Models

Changlong Yu; Jialong Han; Peifeng Wang; Yangqiu Song; Hongming Zhang,; Wilfred Ng; Shuming Shi

arXiv:2010.04941·cs.CL·October 13, 2020·1 cites

When Hearst Is not Enough: Improving Hypernymy Detection from Corpus with Distributional Models

Changlong Yu, Jialong Han, Peifeng Wang, Yangqiu Song, Hongming Zhang,, Wilfred Ng, Shuming Shi

PDF

Open Access 1 Repo

TL;DR

This paper improves hypernymy detection by combining pattern-based and distributional models, addressing their individual limitations and demonstrating enhanced performance and interpretability on benchmark datasets.

Contribution

It introduces a complementary framework that integrates pattern-based and distributional methods for hypernymy detection, especially in sparsity cases.

Findings

01

The framework achieves competitive improvements on benchmark datasets.

02

Distributional methods effectively complement pattern-based approaches in sparsity cases.

03

The combined approach offers better interpretability of hypernymy detection results.

Abstract

We address hypernymy detection, i.e., whether an is-a relationship exists between words (x, y), with the help of large textual corpora. Most conventional approaches to this task have been categorized to be either pattern-based or distributional. Recent studies suggest that pattern-based ones are superior, if large-scale Hearst pairs are extracted and fed, with the sparsity of unseen (x, y) pairs relieved. However, they become invalid in some specific sparsity cases, where x or y is not involved in any pattern. For the first time, this paper quantifies the non-negligible existence of those specific cases. We also demonstrate that distributional methods are ideal to make up for pattern-based ones in such cases. We devise a complementary framework, under which a pattern-based and a distributional model collaborate seamlessly in cases which they each prefer. On several benchmark datasets,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

HKUST-KnowComp/ComHyper
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification