Few-Shot Bot: Prompt-Based Learning for Dialogue Systems
Andrea Madotto, Zhaojiang Lin, Genta Indra Winata, Pascale Fung

TL;DR
This paper demonstrates that prompt-based few-shot learning with large language models can effectively perform a wide range of dialogue tasks without fine-tuning, enabling efficient and adaptable conversational AI systems.
Contribution
It introduces a novel prompt-based few-shot learning approach for dialogue tasks, including a new classifier and an end-to-end chatbot called Few-Shot Bot that requires no training.
Findings
GPT-J-6B achieves competitive performance without training.
A prompt-based classifier effectively selects appropriate prompts.
Few-Shot Bot combines skills and knowledge bases for human-like responses.
Abstract
Learning to converse using only a few examples is a great challenge in conversational AI. The current best conversational models, which are either good chit-chatters (e.g., BlenderBot) or goal-oriented systems (e.g., MinTL), are language models (LMs) fine-tuned on large conversational datasets. Training these models is expensive, both in terms of computational resources and time, and it is hard to keep them up to date with new conversational skills. A simple yet unexplored solution is prompt-based few-shot learning (Brown et al. 2020) which does not require gradient-based fine-tuning but instead uses a few examples in the LM context as the only source of learning. In this paper, we explore prompt-based few-shot learning in dialogue tasks. We benchmark LMs of different sizes in nine response generation tasks, which include four knowledge-grounded tasks, a task-oriented generations task,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques
