Internet-augmented language models through few-shot prompting for open-domain question answering
Angeliki Lazaridou, Elena Gribovskaya, Wojciech Stokowiec, Nikolai, Grigorev

TL;DR
This paper demonstrates that large language models augmented with web-based retrieval and few-shot prompting significantly improve open-domain question answering, offering a strong, fine-tuning-free baseline that leverages external knowledge sources.
Contribution
It introduces a method to condition language models on web-retrieved information via few-shot prompts without additional training, enhancing open-domain QA performance.
Findings
Web-augmented LMs outperform closed-book models of similar size.
Using multiple retrieved evidences and reranking improves accuracy.
Increasing inference compute benefits smaller models.
Abstract
In this work, we aim to capitalize on the unique few-shot capabilities of large-scale language models (LSLMs) to overcome some of their challenges with respect to grounding to factual and up-to-date information. Motivated by semi-parametric language models (LMs), which ground their decisions in external retrieved evidence, we use few-shot prompting to learn to condition LMs on information returned from the web using Google Search, a broad and constantly updated knowledge source. Our approach does not involve fine-tuning or learning additional parameters, thus making it applicable to any LM, offering therefore a strong baseline. Indeed, we find that LMs conditioned on the web surpass performance of closed-book models of similar, or even larger, model sizes in open-domain question answering. Finally, we find that increasing the inference-time compute of models, achieved via using multiple…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
