Strategic Dialogue Management via Deep Reinforcement Learning
Heriberto Cuay\'ahuitl, Simon Keizer, Oliver Lemon

TL;DR
This paper demonstrates that Deep Reinforcement Learning can effectively train strategic dialogue agents capable of negotiation, outperforming traditional methods in a complex game setting.
Contribution
It introduces a novel application of DRL with high-dimensional state spaces for strategic dialogue management, showing significant performance improvements over existing approaches.
Findings
DRL-based policies achieved a 53% win rate against automated players.
DRL outperformed supervised learning and rule-based baselines.
The approach demonstrates the potential of DRL for training negotiation-capable dialogue agents.
Abstract
Artificially intelligent agents equipped with strategic skills that can negotiate during their interactions with other natural or artificial agents are still underdeveloped. This paper describes a successful application of Deep Reinforcement Learning (DRL) for training intelligent agents with strategic conversational skills, in a situated dialogue setting. Previous studies have modelled the behaviour of strategic agents using supervised learning and traditional reinforcement learning techniques, the latter using tabular representations or learning with linear function approximation. In this study, we apply DRL with a high-dimensional state space to the strategic board game of Settlers of Catan---where players can offer resources in exchange for others and they can also reply to offers made by other players. Our experimental results report that the DRL-based learnt policies significantly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Reinforcement Learning in Robotics · Speech and dialogue systems
