Extracting Training Dialogue Data from Large Language Model based Task Bots

Shuo Zhang; Junzhou Zhao; Junji Hou; Pinghui Wang; Chenxu Wang; Jing Tao

arXiv:2603.01550·cs.CL·March 5, 2026

Extracting Training Dialogue Data from Large Language Model based Task Bots

Shuo Zhang, Junzhou Zhao, Junji Hou, Pinghui Wang, Chenxu Wang, Jing Tao

PDF

Open Access

TL;DR

This paper investigates privacy risks in LLM-based task-oriented dialogue systems by evaluating data extraction attacks, proposing novel methods, and analyzing factors influencing memorization to improve understanding and mitigation of data leakage.

Contribution

It introduces tailored attack techniques for LLM-based dialogue systems, evaluates their effectiveness, and analyzes memorization factors to inform privacy mitigation strategies.

Findings

01

Proposed attack achieves over 70% precision in extracting dialogue training labels.

02

Analyzed key factors influencing LLM memorization in dialogue systems.

03

Provided insights into privacy risks and mitigation strategies for LLM-based TODS.

Abstract

Large Language Models (LLMs) have been widely adopted to enhance Task-Oriented Dialogue Systems (TODS) by modeling complex language patterns and delivering contextually appropriate responses. However, this integration introduces significant privacy risks, as LLMs, functioning as soft knowledge bases that compress extensive training data into rich knowledge representations, can inadvertently memorize training dialogue data containing not only identifiable information such as phone numbers but also entire dialogue-level events like complete travel schedules. Despite the critical nature of this privacy concern, how LLM memorization is inherited in developing task bots remains unexplored. In this work, we address this gap through a systematic quantitative study that involves evaluating existing training data extraction attacks, analyzing key characteristics of task-oriented dialogue…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · AI in Service Interactions · Speech and dialogue systems