Deep Reinforcement Learning for Multi-Domain Dialogue Systems

Heriberto Cuay\'ahuitl; Seunghak Yu; Ashley Williamson; Jacob Carse

arXiv:1611.08675·cs.AI·May 10, 2021·31 cites

Deep Reinforcement Learning for Multi-Domain Dialogue Systems

Heriberto Cuay\'ahuitl, Seunghak Yu, Ashley Williamson, Jacob Carse

PDF

Open Access 1 Repo

TL;DR

This paper introduces NDQN, a scalable deep reinforcement learning method for multi-domain dialogue systems, demonstrating improved performance over traditional DQN in simulation experiments for restaurant and hotel information-seeking tasks.

Contribution

The paper proposes NDQN, a novel multi-domain reinforcement learning approach that enhances scalability and effectiveness in dialogue policy learning.

Findings

01

NDQN outperforms DQN in scalability and dialogue management.

02

Experimental results show improved dialogue success rates.

03

NDQN is promising for real-world multi-domain dialogue systems.

Abstract

Standard deep reinforcement learning methods such as Deep Q-Networks (DQN) for multiple tasks (domains) face scalability problems. We propose a method for multi-domain dialogue policy learning---termed NDQN, and apply it to an information-seeking spoken dialogue system in the domains of restaurants and hotels. Experimental results comparing DQN (baseline) versus NDQN (proposed) using simulations report that our proposed method exhibits better scalability and is promising for optimising the behaviour of multi-domain dialogue systems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cuayahuitl/SimpleDS
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Social Robot Interaction and HRI · Context-Aware Activity Recognition Systems

MethodsQ-Learning · Dense Connections · Convolution · Deep Q-Network