Data Augmentation Integrating Dialogue Flow and Style to Adapt Spoken   Dialogue Systems to Low-Resource User Groups

Zhiyang Qi; Michimasa Inaba

arXiv:2408.10516·cs.CL·August 21, 2024

Data Augmentation Integrating Dialogue Flow and Style to Adapt Spoken Dialogue Systems to Low-Resource User Groups

Zhiyang Qi, Michimasa Inaba

PDF

Open Access

TL;DR

This paper introduces a data augmentation framework that uses large language models to generate personalized dialogue data, improving spoken dialogue systems' performance for low-resource user groups like minors.

Contribution

The study presents a novel approach combining LLMs and PLMs to augment dialogue data, enabling SDSs to better adapt to users with limited data, especially minors.

Findings

01

Enhanced SDS performance with augmented data

02

Improved interaction quality for low-resource user groups

03

Validated effectiveness through extensive experiments

Abstract

This study addresses the interaction challenges encountered by spoken dialogue systems (SDSs) when engaging with users who exhibit distinct conversational behaviors, particularly minors, in scenarios where data are scarce. We propose a novel data augmentation framework to enhance SDS performance for user groups with limited resources. Our approach leverages a large language model (LLM) to extract speaker styles and a pre-trained language model (PLM) to simulate dialogue act history. This method generates enriched and personalized dialogue data, facilitating improved interactions with unique user demographics. Extensive experiments validate the efficacy of our methodology, highlighting its potential to foster the development of more adaptive and inclusive dialogue systems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Intelligent Tutoring Systems and Adaptive Learning · Context-Aware Activity Recognition Systems