Personality-Aware Reinforcement Learning for Persuasive Dialogue with LLM-Driven Simulation
Donghuo Zeng, Roberto Legaspi, Kazushi Ikeda

TL;DR
This paper introduces a personality-aware reinforcement learning framework for persuasive dialogue agents that adapt strategies based on user personality, using LLM-driven simulation to improve policy effectiveness and generalization.
Contribution
It proposes a novel reinforcement learning approach integrating personality modeling, agenda-based strategy control, and LLM simulation for persuasive dialogue.
Findings
Personality conditioning enhances persuasion rewards.
LLM simulation improves generalization to new user behaviors.
Change-of-mind penalties reduce retractions and improve outcomes.
Abstract
Effective persuasive dialogue agents adapt their strategies to individual users, accounting for the evolution of their psychological states and intentions throughout conversations. We present a personality-aware reinforcement learning approach comprising three main modules: (1) a Strategy-Oriented Interaction Framework, which serves as an agenda-based strategy controller that selects strategy-level actions and generate responses via Maximal Marginal Relevance (MMR) retrieval to ensure contextual relevance, diversity, and scalable data generation; (2) Personality-Aware User Representation Learning, which produces an 81-dimensional mixed-type embedding predicted at each turn from recent exchanges and appended to the reinforcement learning state; and (3) a Dueling Double DQN (D3QN) model and Reward Prediction, in which the policy is conditioned on dialogue history and turn-level…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Social Robot Interaction and HRI · Topic Modeling
