Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLP
Omar Khattab, Keshav Santhanam, Xiang Lisa Li, David Hall, Percy, Liang, Christopher Potts, Matei Zaharia

TL;DR
The paper introduces Demonstrate-Search-Predict (DSP), a novel framework that enhances knowledge-intensive NLP tasks by enabling sophisticated interactions between language models and retrieval models, leading to state-of-the-art results.
Contribution
DSP is a flexible framework that allows complex, programmatic pipelines between LMs and RMs, improving over simple retrieve-then-read methods for various NLP tasks.
Findings
Achieved new state-of-the-art in-context learning results.
Significant relative gains over GPT-3.5 and standard pipelines.
Effective in open-domain, multi-hop, and conversational tasks.
Abstract
Retrieval-augmented in-context learning has emerged as a powerful approach for addressing knowledge-intensive tasks using frozen language models (LM) and retrieval models (RM). Existing work has combined these in simple "retrieve-then-read" pipelines in which the RM retrieves passages that are inserted into the LM prompt. To begin to fully realize the potential of frozen LMs and RMs, we propose Demonstrate-Search-Predict (DSP), a framework that relies on passing natural language texts in sophisticated pipelines between an LM and an RM. DSP can express high-level programs that bootstrap pipeline-aware demonstrations, search for relevant passages, and generate grounded predictions, systematically breaking down problems into small transformations that the LM and RM can handle more reliably. We have written novel DSP programs for answering questions in open-domain, multi-hop, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques
