Re2G: Retrieve, Rerank, Generate
Michael Glass, Gaetano Rossiello, Md Faisal Mahbub Chowdhury, Ankita, Rajaram Naik, Pengshan Cai, Alfio Gliozzo

TL;DR
Re2G combines neural retrieval, reranking, and generation in a BART-based model, significantly improving performance on knowledge-intensive tasks through end-to-end training and ensemble retrieval methods.
Contribution
The paper introduces Re2G, a novel system that integrates neural retrieval, reranking, and generation with a new training approach, advancing state-of-the-art results.
Findings
Achieved 9-34% improvements over previous SOTA on KILT tasks.
Demonstrated effectiveness across slot filling, QA, fact-checking, and dialog.
Enabled ensemble of BM25 and neural retrieval for better results.
Abstract
As demonstrated by GPT-3 and T5, transformers grow in capability as parameter spaces become larger and larger. However, for tasks that require a large amount of knowledge, non-parametric memory allows models to grow dramatically with a sub-linear increase in computational cost and GPU memory requirements. Recent models such as RAG and REALM have introduced retrieval into conditional generation. These models incorporate neural initial retrieval from a corpus of passages. We build on this line of research, proposing Re2G, which combines both neural initial retrieval and reranking into a BART-based sequence-to-sequence generation. Our reranking approach also permits merging retrieval results from sources with incomparable scores, enabling an ensemble of BM25 and neural initial retrieval. To train our system end-to-end, we introduce a novel variation of knowledge distillation to train the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
Methods15 Ways to Contact How can i speak to someone at Delta Airlines · Multi-Head Attention · Attention Is All You Need · Linear Layer · Byte Pair Encoding · Attention Dropout · Adafactor · Adam · Residual Connection · {Dispute@FaQ-s}How to file a dispute with Expedia?
