StyleDGPT: Stylized Response Generation with Pre-trained Language Models

Ze Yang; Wei Wu; Can Xu; Xinnian Liang; Jiaqi Bai; Liran Wang; Wei; Wang; Zhoujun Li

arXiv:2010.02569·cs.CL·October 7, 2020·1 cites

StyleDGPT: Stylized Response Generation with Pre-trained Language Models

Ze Yang, Wei Wu, Can Xu, Xinnian Liang, Jiaqi Bai, Liran Wang, Wei, Wang, Zhoujun Li

PDF

Open Access 1 Repo

TL;DR

StyleDGPT leverages pre-trained language models with a novel KL loss and style classifier to generate responses that adhere to specific styles, improving style consistency and coherence in dialogue systems.

Contribution

This work introduces a new fine-tuning approach with KL loss and style classifier for stylized response generation using pre-trained models.

Findings

01

Significant improvement in style consistency over state-of-the-art methods.

02

Enhanced contextual coherence in stylized responses.

03

Effective style control at word and sentence levels.

Abstract

Generating responses following a desired style has great potentials to extend applications of open-domain dialogue systems, yet is refrained by lacking of parallel data for training. In this work, we explore the challenging task with pre-trained language models that have brought breakthrough to various natural language tasks. To this end, we introduce a KL loss and a style classifier to the fine-tuning step in order to steer response generation towards the target style in both a word-level and a sentence-level. Comprehensive empirical studies with two public datasets indicate that our model can significantly outperform state-of-the-art methods in terms of both style consistency and contextual coherence.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

TobeyYang/StyleDGPT
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques