Coreference Resolution without Span Representations

Yuval Kirstain; Ori Ram; Omer Levy

arXiv:2101.00434·cs.CL·June 1, 2021

Coreference Resolution without Span Representations

Yuval Kirstain, Ori Ram, Omer Levy

PDF

1 Repo 1 Models

TL;DR

This paper presents a lightweight, efficient end-to-end coreference resolution model that eliminates the need for span representations, maintaining competitive performance while reducing memory usage.

Contribution

The authors introduce a novel coreference model that removes span representations and heuristics, simplifying the architecture and improving efficiency.

Findings

01

Performs competitively with standard models

02

Reduces memory footprint significantly

03

Enables processing of longer documents

Abstract

The introduction of pretrained language models has reduced many complex task-specific NLP models to simple lightweight layers. An exception to this trend is coreference resolution, where a sophisticated task-specific model is appended to a pretrained transformer encoder. While highly effective, the model has a very large memory footprint -- primarily due to dynamically-constructed span and span-pair representations -- which hinders the processing of complete documents and the ability to train on multiple instances in a single batch. We introduce a lightweight end-to-end coreference model that removes the dependency on span representations, handcrafted features, and heuristics. Our model performs competitively with the current standard model, while being simpler and more efficient.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yuvalkirstain/s2e-coref
pytorchOfficial

Models

🤗
biu-nlp/f-coref
model· 279k dl· ♡ 19
279k dl♡ 19

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.