Efficient and Interpretable Neural Models for Entity Tracking

Shubham Toshniwal

arXiv:2208.14252·cs.CL·August 31, 2022

Efficient and Interpretable Neural Models for Entity Tracking

Shubham Toshniwal

PDF

Open Access

TL;DR

This paper proposes efficient, interpretable neural models for entity tracking in long texts, integrating them into language models to improve understanding and application in NLP tasks like question-answering and summarization.

Contribution

It introduces scalable, interpretable entity tracking models using fixed-dimensional vectors from pretrained models and advocates for their integration into language models.

Findings

01

Models effectively track entities in long documents.

02

Integration into language models enhances NLP task performance.

03

Efficient representations reduce computational costs.

Abstract

What would it take for a natural language model to understand a novel, such as The Lord of the Rings? Among other things, such a model must be able to: (a) identify and record new characters (entities) and their attributes as they are introduced in the text, and (b) identify subsequent references to the characters previously introduced and update their attributes. This problem of entity tracking is essential for language understanding, and thus, useful for a wide array of downstream applications in NLP such as question-answering, summarization. In this thesis, we focus on two key problems in relation to facilitating the use of entity tracking models: (i) scaling entity tracking models to long documents, such as a novel, and (ii) integrating entity tracking into language models. Applying language technologies to long documents has garnered interest recently, but computational…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques