Faithful Persona-based Conversational Dataset Generation with Large Language Models
Pegah Jandaghi, XiangHai Sheng, Xinyi Bai, Jay Pujara, Hakim Sidahmed

TL;DR
This paper introduces a novel Generator-Critic framework leveraging Large Language Models to generate high-quality, persona-based conversational datasets, significantly improving dataset quality and engagement in AI chatbots.
Contribution
The paper proposes a new Generator-Critic architecture for dataset generation using LLMs, enhancing conversation quality and expanding persona-based datasets for NLP models.
Findings
Synthetic-Persona-Chat contains 20k conversations.
Losing rate in Turing test decreased from 17.2% to 8.8%.
Framework improves conversation quality over iterations.
Abstract
High-quality conversational datasets are essential for developing AI models that can communicate with users. One way to foster deeper interactions between a chatbot and its user is through personas, aspects of the user's character that provide insights into their personality, motivations, and behaviors. Training Natural Language Processing (NLP) models on a diverse and comprehensive persona-based dataset can lead to conversational models that create a deeper connection with the user, and maintain their engagement. In this paper, we leverage the power of Large Language Models (LLMs) to create a large, high-quality conversational dataset from a seed dataset. We propose a Generator-Critic architecture framework to expand the initial dataset, while improving the quality of its conversations. The Generator is an LLM prompted to output conversations. The Critic consists of a mixture of expert…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPersona Design and Applications · Innovative Human-Technology Interaction · AI in Service Interactions
