A Simple and Efficient Multi-Task Learning Approach for Conditioned   Dialogue Generation

Yan Zeng; Jian-Yun Nie

arXiv:2010.11140·cs.CL·April 27, 2021·1 cites

A Simple and Efficient Multi-Task Learning Approach for Conditioned Dialogue Generation

Yan Zeng, Jian-Yun Nie

PDF

Open Access 1 Repo

TL;DR

This paper introduces a multi-task learning method that effectively uses related labeled non-dialogue text data to improve conditioned dialogue generation, addressing data scarcity issues.

Contribution

It presents a novel multi-task learning framework that jointly trains on dialogue and non-dialogue text data to enhance dialogue generation models.

Findings

01

Outperforms state-of-the-art models in conditioned dialogue generation

02

Leverages labeled non-dialogue text data effectively

03

Achieves significant performance improvements over previous methods

Abstract

Conditioned dialogue generation suffers from the scarcity of labeled responses. In this work, we exploit labeled non-dialogue text data related to the condition, which are much easier to collect. We propose a multi-task learning approach to leverage both labeled dialogue and text data. The 3 tasks jointly optimize the same pre-trained Transformer -- conditioned dialogue generation task on the labeled dialogue data, conditioned language encoding task and conditioned language generation task on the labeled text data. Experimental results show that our approach outperforms the state-of-the-art models by leveraging the labeled texts, and it also obtains larger improvement in performance comparing to the previous methods to leverage text data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zengyan-97/MultiT-C-Dialog
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems

MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Label Smoothing · Transformer · Adam · Layer Normalization · Dense Connections · Multi-Head Attention