EntEval: A Holistic Evaluation Benchmark for Entity Representations

Mingda Chen; Zewei Chu; Yang Chen; Karl Stratos; Kevin Gimpel

arXiv:1909.00137·cs.CL·November 12, 2019

EntEval: A Holistic Evaluation Benchmark for Entity Representations

Mingda Chen, Zewei Chu, Yang Chen, Karl Stratos, Kevin Gimpel

PDF

Open Access 2 Repos

TL;DR

EntEval introduces a comprehensive benchmark for evaluating entity representations across various tasks and proposes training methods leveraging Wikipedia hyperlinks to enhance these representations.

Contribution

The paper presents a new holistic evaluation benchmark for entity representations and develops training techniques using Wikipedia hyperlinks to improve entity modeling.

Findings

01

Improved performance on multiple EntEval tasks using new training objectives.

02

Demonstrated effectiveness of hyperlink-based training techniques.

03

Established a standardized benchmark for entity representation quality.

Abstract

Rich entity representations are useful for a wide class of problems involving entities. Despite their importance, there is no standardized benchmark that evaluates the overall quality of entity representations. In this work, we propose EntEval: a test suite of diverse tasks that require nontrivial understanding of entities including entity typing, entity similarity, entity relation prediction, and entity disambiguation. In addition, we develop training techniques for learning better entity representations by using natural hyperlink annotations in Wikipedia. We identify effective objectives for incorporating the contextual information in hyperlinks into state-of-the-art pretrained language models and show that they improve strong baselines on multiple EntEval tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Data Quality and Management