INRIASAC: Simple Hypernym Extraction Methods

Gregory Grefenstette (TAO)

arXiv:1502.01271·cs.CL·January 7, 2016

INRIASAC: Simple Hypernym Extraction Methods

Gregory Grefenstette (TAO)

PDF

TL;DR

This paper introduces simple, heuristic-based methods for hypernym extraction from large text corpora, achieving top performance in the SemEval 2015 taxonomy structuring task.

Contribution

It presents straightforward techniques using Wikipedia data and heuristics that outperform more complex methods in hypernym extraction.

Findings

01

Ranked first in SemEval 2015 taxonomy task

02

Effective use of Wikipedia and simple heuristics

03

Achieved strong results with minimal complexity

Abstract

Given a set of terms from a given domain, how can we structure them into a taxonomy without manual intervention? This is the task 17 of SemEval 2015. Here we present our simple taxonomy structuring techniques which, despite their simplicity, ranked first in this 2015 benchmark. We use large quantities of text (English Wikipedia) and simple heuristics such as term overlap and document and sentence co-occurrence to produce hypernym lists. We describe these techniques and pre-sent an initial evaluation of results.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.