EVA2.0: Investigating Open-Domain Chinese Dialogue Systems with   Large-Scale Pre-Training

Yuxian Gu; Jiaxin Wen; Hao Sun; Yi Song; Pei Ke; Chujie Zheng; Zheng; Zhang; Jianzhu Yao; Lei Liu; Xiaoyan Zhu; Minlie Huang

arXiv:2203.09313·cs.CL·October 24, 2023

EVA2.0: Investigating Open-Domain Chinese Dialogue Systems with Large-Scale Pre-Training

Yuxian Gu, Jiaxin Wen, Hao Sun, Yi Song, Pei Ke, Chujie Zheng, Zheng, Zhang, Jianzhu Yao, Lei Liu, Xiaoyan Zhu, Minlie Huang

PDF

2 Repos 4 Models

TL;DR

EVA2.0 is a large-scale Chinese dialogue model that leverages extensive experiments to optimize data quality, architecture, training, and decoding, significantly advancing human-like chatbot capabilities.

Contribution

The paper introduces EVA2.0, a 2.8-billion-parameter pre-trained Chinese dialogue system, with comprehensive analysis of key factors affecting performance and public release of models and code.

Findings

01

EVA2.0 outperforms existing open-source Chinese dialogue models.

02

Extensive experiments reveal critical factors influencing dialogue system performance.

03

Human and automatic evaluations confirm EVA2.0's superior conversational abilities.

Abstract

Large-scale pre-training has shown remarkable performance in building open-domain dialogue systems. However, previous works mainly focus on showing and evaluating the conversational performance of the released dialogue model, ignoring the discussion of some key factors towards a powerful human-like chatbot, especially in Chinese scenarios. In this paper, we conduct extensive experiments to investigate these under-explored factors, including data quality control, model architecture designs, training approaches, and decoding strategies. We propose EVA2.0, a large-scale pre-trained open-domain Chinese dialogue model with 2.8 billion parameters, and will make our models and codes publicly available. Automatic and human evaluations show that EVA2.0 significantly outperforms other open-source counterparts. We also discuss the limitations of this work by presenting some failure cases and pose…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.