SideControl: Controlled Open-domain Dialogue Generation via Additive   Side Networks

Wanyu Du; Yangfeng Ji

arXiv:2109.01958·cs.CL·September 27, 2022

SideControl: Controlled Open-domain Dialogue Generation via Additive Side Networks

Wanyu Du, Yangfeng Ji

PDF

Open Access 1 Repo

TL;DR

SideControl introduces a novel framework for controlling open-domain dialogue generation using additive side networks, achieving better controllability, quality, and sample-efficiency with limited training data.

Contribution

The paper proposes SideControl, a new method that incorporates control signals via a control attributes loss, improving over existing gradient and weighted-decoding approaches.

Findings

01

Outperforms baseline methods in controllability and quality

02

Requires fewer training samples for effective control

03

Demonstrates superior sample-efficiency on benchmark datasets

Abstract

Transformer-based pre-trained language models boost the performance of open-domain dialogue systems. Prior works leverage Transformer-based pre-trained language models to generate texts with desired attributes in two general approaches: (1) gradient-based methods: updating all latent representations of pre-trained models with gradients from attribute models; (2) weighted-decoding methods: re-ranking beam candidates from pre-trained models with attribute functions. However, gradient-based methods lead to high computation cost and can easily get overfitted on small training sets, while weighted-decoding methods are inherently constrained by the low-variance high-bias pre-trained model. In this work, we propose a novel approach to control the generation of Transformer-based pre-trained language models: the SideControl framework, which leverages a novel control attributes loss to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wyu-du/controlled-dialogue-generation
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems