Entity Identification as Multitasking

Karl Stratos

arXiv:1612.02706·cs.CL·July 24, 2017

Entity Identification as Multitasking

Karl Stratos

PDF

1 Repo

TL;DR

This paper introduces a multitasking neural architecture for entity identification that separates boundary detection and type prediction, achieving linear scaling and producing type-disambiguating embeddings, thus improving efficiency and interpretability.

Contribution

It proposes a novel neural model that jointly optimizes boundary detection and type prediction as separate tasks, addressing quadratic complexity and lack of segment-level representation.

Findings

01

Performs competitively with BiLSTM-CRFs

02

Scales linearly with number of types

03

Induces type-disambiguating mention embeddings

Abstract

Standard approaches in entity identification hard-code boundary detection and type prediction into labels (e.g., John/B-PER Smith/I-PER) and then perform Viterbi. This has two disadvantages: 1. the runtime complexity grows quadratically in the number of types, and 2. there is no natural segment-level representation. In this paper, we propose a novel neural architecture that addresses these disadvantages. We frame the problem as multitasking, separating boundary detection and type prediction but optimizing them jointly. Despite its simplicity, this architecture performs competitively with fully structured models such as BiLSTM-CRFs while scaling linearly in the number of types. Furthermore, by construction, the model induces type-disambiguating embeddings of predicted mentions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

karlstratos/mention2vec
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.