A Stack-Propagation Framework for Low-Resource Personalized Dialogue   Generation

Haoyu Song; Wei-Nan Zhang; Kaiyan Zhang; Ting Liu

arXiv:2410.20174·cs.CL·October 29, 2024

A Stack-Propagation Framework for Low-Resource Personalized Dialogue Generation

Haoyu Song, Wei-Nan Zhang, Kaiyan Zhang, Ting Liu

PDF

TL;DR

This paper introduces a stack-propagation framework for low-resource personalized dialogue generation, enabling models to learn effectively from limited data while maintaining persona consistency and response quality.

Contribution

It proposes a novel stack-propagation architecture with stacked Transformer components to improve low-resource personalized dialogue generation.

Findings

01

Outperforms strong baselines in response quality and persona consistency.

02

Effectively learns from limited dialogue data.

03

Maintains competitive performance with less training data.

Abstract

With the resurgent interest in building open-domain dialogue systems, the dialogue generation task has attracted increasing attention over the past few years. This task is usually formulated as a conditional generation problem, which aims to generate a natural and meaningful response given dialogue contexts and specific constraints, such as persona. And maintaining a consistent persona is essential for the dialogue systems to gain trust from the users. Although tremendous advancements have been brought, traditional persona-based dialogue models are typically trained by leveraging a large number of persona-dense dialogue examples. Yet, such persona-dense training data are expensive to obtain, leading to a limited scale. This work presents a novel approach to learning from limited training examples by regarding consistency understanding as a regularization of response generation. To this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAttention Is All You Need · Linear Layer · Dense Connections · Label Smoothing · Byte Pair Encoding · Layer Normalization · Residual Connection · Multi-Head Attention · Softmax · Adam