A Unified Pre-training Framework for Conversational AI
Siqi Bao, Bingjin Chen, Huang He, Xin Tian, Han Zhou, Fan Wang, Hua, Wu, Haifeng Wang, Wenquan Wu, Yingzhan Lin

TL;DR
This paper presents PLATO-2, a unified pre-training framework for diverse conversational AI tasks, demonstrating state-of-the-art performance across open-domain, knowledge-grounded, and task-oriented dialogues through a two-stage curriculum learning approach.
Contribution
The paper introduces PLATO-2, a novel two-stage curriculum learning framework that effectively unifies training for multiple dialogue system types, outperforming existing methods.
Findings
PLATO-2 achieved 1st place in DSTC9 tasks for open-domain, knowledge-grounded, and task-oriented dialogues.
The two-stage training improves response diversity and coherence.
PLATO-2 demonstrates versatility across different dialogue applications.
Abstract
In this work, we explore the application of PLATO-2 on various dialogue systems, including open-domain conversation, knowledge grounded dialogue, and task-oriented conversation. PLATO-2 is initially designed as an open-domain chatbot, trained via two-stage curriculum learning. In the first stage, a coarse-grained response generation model is learned to fit the simplified one-to-one mapping relationship. This model is applied to the task-oriented conversation, given that the semantic mappings tend to be deterministic in task completion. In the second stage, another fine-grained generation model and an evaluation model are further learned for diverse response generation and coherence estimation, respectively. With superior capability on capturing one-to-many mapping, such models are suitable for the open-domain conversation and knowledge grounded dialogue. For the comprehensive evaluation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems
MethodsPLATO-2
