Zero-shot Slot Filling with DPR and RAG
Michael Glass, Gaetano Rossiello, Alfio Gliozzo

TL;DR
This paper presents enhancements to retrieval-augmented language models for zero-shot slot filling, achieving top performance on key benchmarks by improving retrieval and generation components.
Contribution
The authors introduce strategies to improve RAG-based models for slot filling, resulting in state-of-the-art performance on KILT benchmark datasets.
Findings
KGI0 system achieved top-1 on T-REx and zsRE datasets.
Enhanced retrieval and generation components improved slot filling accuracy.
Model outperforms previous approaches on real-world benchmarks.
Abstract
The ability to automatically extract Knowledge Graphs (KG) from a given collection of documents is a long-standing problem in Artificial Intelligence. One way to assess this capability is through the task of slot filling. Given an entity query in form of [Entity, Slot, ?], a system is asked to `fill' the slot by generating or extracting the missing value from a relevant passage or passages. This capability is crucial to create systems for automatic knowledge base population, which is becoming in ever-increasing demand, especially in enterprise applications. Recently, there has been a promising direction in evaluating language models in the same way we would evaluate knowledge bases, and the task of slot filling is the most suitable to this intent. The recent advancements in the field try to solve this task in an end-to-end fashion using retrieval-based language models. Models like…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Data Quality and Management
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Byte Pair Encoding · Adam · Residual Connection · Dense Connections · Linear Warmup With Linear Decay · Weight Decay
