Generating Categories for Sets of Entities

Shuo Zhang; Krisztian Balog; Jamie Callan

arXiv:2008.08428·cs.IR·August 20, 2020

Generating Categories for Sets of Entities

Shuo Zhang, Krisztian Balog, Jamie Callan

PDF

TL;DR

This paper introduces a neural network-based method to automatically generate and rank category suggestions for entity sets, aiding knowledge base expansion and organization.

Contribution

It presents a novel approach combining neural abstractive summarization with hierarchical ranking to improve category generation for knowledge bases.

Findings

01

Effective candidate category generation demonstrated on Wikipedia data

02

Improved ranking accuracy using structure, content, and hierarchy features

03

Enhanced support for knowledge editors in expanding category systems

Abstract

Category systems are central components of knowledge bases, as they provide a hierarchical grouping of semantically related concepts and entities. They are a unique and valuable resource that is utilized in a broad range of information access tasks. To aid knowledge editors in the manual process of expanding a category system, this paper presents a method of generating categories for sets of entities. First, we employ neural abstractive summarization models to generate candidate categories. Next, the location within the hierarchy is identified for each candidate. Finally, structure-, content-, and hierarchy-based features are used to rank candidates to identify by the most promising ones (measured in terms of specificity, hierarchy, and importance). We develop a test collection based on Wikipedia categories and demonstrate the effectiveness of the proposed approach.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.