A Unified Data Augmentation Framework for Low-Resource Multi-Domain   Dialogue Generation

Yongkang Liu; Ercong Nie; Shi Feng; Zheng Hua; Zifeng Ding; Daling; Wang; Yifei Zhang; Hinrich Sch\"utze

arXiv:2406.09881·cs.CL·July 1, 2024

A Unified Data Augmentation Framework for Low-Resource Multi-Domain Dialogue Generation

Yongkang Liu, Ercong Nie, Shi Feng, Zheng Hua, Zifeng Ding, Daling, Wang, Yifei Zhang, Hinrich Sch\"utze

PDF

Open Access 1 Repo

TL;DR

This paper introduces AMD$^2$G, a novel data augmentation framework for low-resource multi-domain dialogue generation that leverages de-domained corpora and a two-stage training process to improve performance across diverse domains.

Contribution

The paper proposes a unified data augmentation framework with a de-domaining technique and a two-stage training approach for low-resource multi-domain dialogue systems.

Findings

01

AMD$^2$G outperforms direct and collective training methods.

02

De-domaining effectively captures domain-agnostic features.

03

Framework demonstrates superior results on Chinese multi-domain datasets.

Abstract

Current state-of-the-art dialogue systems heavily rely on extensive training datasets. However, challenges arise in domains where domain-specific training datasets are insufficient or entirely absent. To tackle this challenge, we propose a novel data \textbf{A}ugmentation framework for \textbf{M}ulti-\textbf{D}omain \textbf{D}ialogue \textbf{G}eneration, referred to as \textbf{AMD $^{2}$ G}. The AMD $^{2}$ G framework consists of a data augmentation process and a two-stage training approach: domain-agnostic training and domain adaptation training. We posit that domain corpora are a blend of domain-agnostic and domain-specific features, with certain representation patterns shared among diverse domains. Domain-agnostic training aims to enable models to learn these common expressive patterns. To construct domain-agnostic dialogue corpora, we employ a \textit{\textbf{de-domaining}} data processing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

misonsky/Amdg
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Context-Aware Activity Recognition Systems · Robotics and Automated Systems