PLATO-2: Towards Building an Open-Domain Chatbot via Curriculum Learning

Siqi Bao; Huang He; Fan Wang; Hua Wu; Haifeng Wang; Wenquan Wu; Zhen; Guo; Zhibin Liu; Xinchao Xu

arXiv:2006.16779·cs.CL·May 31, 2021·27 cites

PLATO-2: Towards Building an Open-Domain Chatbot via Curriculum Learning

Siqi Bao, Huang He, Fan Wang, Hua Wu, Haifeng Wang, Wenquan Wu, Zhen, Guo, Zhibin Liu, Xinchao Xu

PDF

Open Access 3 Repos

TL;DR

PLATO-2 introduces a curriculum learning approach with two training stages to develop a high-quality open-domain chatbot capable of generating diverse and contextually appropriate responses, achieving state-of-the-art results.

Contribution

It presents a novel two-stage training process for open-domain chatbots, combining coarse and fine-grained models with latent variables and evaluation mechanisms.

Findings

01

Achieves state-of-the-art performance on Chinese and English datasets.

02

Demonstrates effectiveness of curriculum learning in dialogue systems.

03

Produces diverse and contextually relevant responses.

Abstract

To build a high-quality open-domain chatbot, we introduce the effective training process of PLATO-2 via curriculum learning. There are two stages involved in the learning process. In the first stage, a coarse-grained generation model is trained to learn response generation under the simplified framework of one-to-one mapping. In the second stage, a fine-grained generative model augmented with latent variables and an evaluation model are further trained to generate diverse responses and to select the best response, respectively. PLATO-2 was trained on both Chinese and English data, whose effectiveness and superiority are verified through comprehensive evaluations, achieving new state-of-the-art results.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsPLATO-2