GODEL: Large-Scale Pre-Training for Goal-Directed Dialog
Baolin Peng, Michel Galley, Pengcheng He, Chris Brockett, Lars Liden,, Elnaz Nouri, Zhou Yu, Bill Dolan, Jianfeng Gao

TL;DR
GODEL is a large pre-trained dialog model that excels in various tasks by incorporating grounded pre-training, improving response usefulness and adaptability over previous models.
Contribution
The paper introduces GODEL, a novel grounded pre-training approach for dialog models, enhancing performance across multiple dialog tasks and introducing a new extrinsic evaluation methodology.
Findings
GODEL outperforms state-of-the-art dialog models in few-shot settings.
Grounded pre-training improves response usefulness and task adaptability.
Extrinsic evaluation correlates better with human judgments.
Abstract
We introduce GODEL (Grounded Open Dialogue Language Model), a large pre-trained language model for dialog. In contrast with earlier models such as DialoGPT, GODEL leverages a new phase of grounded pre-training designed to better support adapting GODEL to a wide range of downstream dialog tasks that require information external to the current conversation (e.g., a database or document) to produce good responses. Experiments against an array of benchmarks that encompass task-oriented dialog, conversational QA, and grounded open-domain dialog show that GODEL outperforms state-of-the-art pre-trained dialog models in few-shot fine-tuning setups, in terms of both human and automatic evaluation. A novel feature of our evaluation methodology is the introduction of a notion of utility that assesses the usefulness of responses (extrinsic evaluation) in addition to their communicative features…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques
