Corpus-level Fine-grained Entity Typing

Yadollah Yaghoobzadeh; Heike Adel; Hinrich Sch\"utze

arXiv:1708.02275·cs.CL·June 11, 2018

Corpus-level Fine-grained Entity Typing

Yadollah Yaghoobzadeh, Heike Adel, Hinrich Sch\"utze

PDF

TL;DR

This paper introduces FIGMENT, an embedding-based approach for corpus-level entity typing that combines global and context models with multi-level representations and noise mitigation techniques, improving knowledge base completion.

Contribution

It proposes a novel embedding-based framework with multi-level representations and noise reduction algorithms for more accurate entity typing from large corpora.

Findings

01

Effective in large entity typing dataset from Freebase

02

Multi-level representations outperform single-level models

03

Noise mitigation improves model performance

Abstract

This paper addresses the problem of corpus-level entity typing, i.e., inferring from a large corpus that an entity is a member of a class such as "food" or "artist". The application of entity typing we are interested in is knowledge base completion, specifically, to learn which classes an entity is a member of. We propose FIGMENT to tackle this problem. FIGMENT is embedding- based and combines (i) a global model that scores based on aggregated contextual information of an entity and (ii) a context model that first scores the individual occurrences of an entity and then aggregates the scores. Each of the two proposed models has some specific properties. For the global model, learning high quality entity representations is crucial because it is the only source used for the predictions. Therefore, we introduce representations using name and contexts of entities on the three levels of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.