AutoConv: Automatically Generating Information-seeking Conversations   with Large Language Models

Siheng Li; Cheng Yang; Yichun Yin; Xinyu Zhu; Zesen Cheng; Lifeng; Shang; Xin Jiang; Qun Liu; Yujiu Yang

arXiv:2308.06507·cs.CL·August 15, 2023

AutoConv: Automatically Generating Information-seeking Conversations with Large Language Models

Siheng Li, Cheng Yang, Yichun Yin, Xinyu Zhu, Zesen Cheng, Lifeng, Shang, Xin Jiang, Qun Liu, Yujiu Yang

PDF

TL;DR

AutoConv leverages large language models to generate high-quality synthetic information-seeking conversations, reducing reliance on scarce human-annotated data and improving performance on key datasets.

Contribution

AutoConv introduces a novel method for synthetic conversation generation using LLMs, addressing data scarcity in information-seeking dialogue research.

Findings

01

AutoConv outperforms strong baselines on two datasets.

02

It significantly reduces dependence on human annotation.

03

Experimental results demonstrate substantial quality improvements.

Abstract

Information-seeking conversation, which aims to help users gather information through conversation, has achieved great progress in recent years. However, the research is still stymied by the scarcity of training data. To alleviate this problem, we propose AutoConv for synthetic conversation generation, which takes advantage of the few-shot learning ability and generation capacity of large language models (LLM). Specifically, we formulate the conversation generation problem as a language modeling task, then finetune an LLM with a few human conversations to capture the characteristics of the information-seeking process and use it for generating synthetic conversations with high quality. Experimental results on two frequently-used datasets verify that AutoConv has substantial improvements over strong baselines and alleviates the dependence on human annotation. In addition, we also provide…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.