CopyNext: Explicit Span Copying and Alignment in Sequence to Sequence   Models

Abhinav Singh; Patrick Xia; Guanghui Qin; Mahsa Yarmohammadi; Benjamin; Van Durme

arXiv:2010.15266·cs.CL·October 30, 2020

CopyNext: Explicit Span Copying and Alignment in Sequence to Sequence Models

Abhinav Singh, Patrick Xia, Guanghui Qin, Mahsa Yarmohammadi, Benjamin, Van Durme

PDF

TL;DR

CopyNext introduces an explicit span copying mechanism in seq2seq models, enabling precise token-level and span-level copying with explicit alignments, improving tasks like information extraction and achieving high accuracy and speed.

Contribution

It presents a novel explicit span copying and alignment method in seq2seq models, enhancing interpretability and efficiency for applications like NER.

Findings

01

Achieved near state-of-the-art accuracy on Nested NER.

02

Significantly increased decoding speed.

03

Provided explicit token and span alignments.

Abstract

Copy mechanisms are employed in sequence to sequence models (seq2seq) to generate reproductions of words from the input to the output. These frameworks, operating at the lexical type level, fail to provide an explicit alignment that records where each token was copied from. Further, they require contiguous token sequences from the input (spans) to be copied individually. We present a model with an explicit token-level copy operation and extend it to copying entire spans. Our model provides hard alignments between spans in the input and output, allowing for nontraditional applications of seq2seq, like information extraction. We demonstrate the approach on Nested Named Entity Recognition, achieving near state-of-the-art accuracy with an order of magnitude increase in decoding speed.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.