Highly Parallel Autoregressive Entity Linking with Discriminative Correction
Nicola De Cao, Wilker Aziz, Ivan Titov

TL;DR
This paper introduces a highly parallel and efficient autoregressive entity linking model that significantly reduces computational costs and improves accuracy by combining generative and discriminative training components.
Contribution
The authors propose a novel parallel autoregressive entity linking method with a shallow decoder and a discriminative correction term, enhancing speed and accuracy over previous generative approaches.
Findings
Model is over 70 times faster than previous methods.
Achieves state-of-the-art accuracy on AIDA-CoNLL dataset.
Outperforms existing approaches in both speed and precision.
Abstract
Generative approaches have been recently shown to be effective for both Entity Disambiguation and Entity Linking (i.e., joint mention detection and disambiguation). However, the previously proposed autoregressive formulation for EL suffers from i) high computational cost due to a complex (deep) decoder, ii) non-parallelizable decoding that scales with the source sequence length, and iii) the need for training on a large amount of data. In this work, we propose a very efficient approach that parallelizes autoregressive linking across all potential mentions and relies on a shallow and efficient decoder. Moreover, we augment the generative objective with an extra discriminative component, i.e., a correction term which lets us directly optimize the generator's ranking. When taken together, these techniques tackle all the above issues: our model is >70 times faster and more accurate than the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies
