Conditioned Text Generation with Transfer for Closed-Domain Dialogue Systems
St\'ephane d'Ascoli, Alice Coucke, Francesco Caltagirone, Alexandre, Caulier, Marc Lelarge

TL;DR
This paper presents a novel method for generating intent-specific dialogue sentences using conditional variational autoencoders and introduces a query transfer protocol to leverage unlabelled data, improving diversity and quality for task-oriented systems.
Contribution
It introduces a new controlled text generation approach with query transfer for better data augmentation in dialogue systems.
Findings
Enhanced diversity of generated queries.
Improved data augmentation for language models.
Effective leveraging of unlabelled datasets.
Abstract
Scarcity of training data for task-oriented dialogue systems is a well known problem that is usually tackled with costly and time-consuming manual data annotation. An alternative solution is to rely on automatic text generation which, although less accurate than human supervision, has the advantage of being cheap and fast. Our contribution is twofold. First we show how to optimally train and control the generation of intent-specific sentences using a conditional variational autoencoder. Then we introduce a new protocol called query transfer that allows to leverage a large unlabelled dataset, possibly containing irrelevant queries, to extract relevant information. Comparison with two different baselines shows that this method, in the appropriate regime, consistently improves the diversity of the generated queries without compromising their quality. We also demonstrate the effectiveness…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems
