Instantiation

Abhijeet Gupta; Gemma Boleda; Sebastian Pado

arXiv:1808.01662·cs.CL·October 15, 2021

Instantiation

Abhijeet Gupta, Gemma Boleda, Sebastian Pado

PDF

TL;DR

This paper investigates the linguistic and computational modeling of instantiation relations between entities and categories, introducing a new dataset and analyzing distributional properties to improve detection methods.

Contribution

It introduces a novel dataset for instantiation detection and analyzes distributional properties of entities and categories to enhance modeling approaches.

Findings

01

Entities form regions in distributional space.

02

Category embeddings are often outside entity regions.

03

Using entity-based category representations improves instantiation detection.

Abstract

In computational linguistics, a large body of work exists on distributed modeling of lexical relations, focussing largely on lexical relations such as hypernymy (scientist -- person) that hold between two categories, as expressed by common nouns. In contrast, computational linguistics has paid little attention to entities denoted by proper nouns (Marie Curie, Mumbai, ...). These have investigated in detail by the Knowledge Representation and Semantic Web communities, but generally not with regard to their linguistic properties. Our paper closes this gap by investigating and modeling the lexical relation of instantiation, which holds between an entity-denoting and a category-denoting expression (Marie Curie -- scientist or Mumbai -- city). We present a new, principled dataset for the task of instantiation detection as well as experiments and analyses on this dataset. We obtain the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.