A Unified Data Augmentation Framework for Low-Resource Multi-Domain Dialogue Generation
Yongkang Liu, Ercong Nie, Shi Feng, Zheng Hua, Zifeng Ding, Daling, Wang, Yifei Zhang, Hinrich Sch\"utze

TL;DR
This paper introduces AMD$^2$G, a novel data augmentation framework for low-resource multi-domain dialogue generation that leverages de-domained corpora and a two-stage training process to improve performance across diverse domains.
Contribution
The paper proposes a unified data augmentation framework with a de-domaining technique and a two-stage training approach for low-resource multi-domain dialogue systems.
Findings
AMD$^2$G outperforms direct and collective training methods.
De-domaining effectively captures domain-agnostic features.
Framework demonstrates superior results on Chinese multi-domain datasets.
Abstract
Current state-of-the-art dialogue systems heavily rely on extensive training datasets. However, challenges arise in domains where domain-specific training datasets are insufficient or entirely absent. To tackle this challenge, we propose a novel data \textbf{A}ugmentation framework for \textbf{M}ulti-\textbf{D}omain \textbf{D}ialogue \textbf{G}eneration, referred to as \textbf{AMDG}. The AMDG framework consists of a data augmentation process and a two-stage training approach: domain-agnostic training and domain adaptation training. We posit that domain corpora are a blend of domain-agnostic and domain-specific features, with certain representation patterns shared among diverse domains. Domain-agnostic training aims to enable models to learn these common expressive patterns. To construct domain-agnostic dialogue corpora, we employ a \textit{\textbf{de-domaining}} data processing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Context-Aware Activity Recognition Systems · Robotics and Automated Systems
