Personality-Aware Reinforcement Learning for Persuasive Dialogue with LLM-Driven Simulation

Donghuo Zeng; Roberto Legaspi; Kazushi Ikeda

arXiv:2601.06877·cs.HC·January 13, 2026

Personality-Aware Reinforcement Learning for Persuasive Dialogue with LLM-Driven Simulation

Donghuo Zeng, Roberto Legaspi, Kazushi Ikeda

PDF

Open Access

TL;DR

This paper introduces a personality-aware reinforcement learning framework for persuasive dialogue agents that adapt strategies based on user personality, using LLM-driven simulation to improve policy effectiveness and generalization.

Contribution

It proposes a novel reinforcement learning approach integrating personality modeling, agenda-based strategy control, and LLM simulation for persuasive dialogue.

Findings

01

Personality conditioning enhances persuasion rewards.

02

LLM simulation improves generalization to new user behaviors.

03

Change-of-mind penalties reduce retractions and improve outcomes.

Abstract

Effective persuasive dialogue agents adapt their strategies to individual users, accounting for the evolution of their psychological states and intentions throughout conversations. We present a personality-aware reinforcement learning approach comprising three main modules: (1) a Strategy-Oriented Interaction Framework, which serves as an agenda-based strategy controller that selects strategy-level actions and generate responses via Maximal Marginal Relevance (MMR) retrieval to ensure contextual relevance, diversity, and scalable data generation; (2) Personality-Aware User Representation Learning, which produces an 81-dimensional mixed-type embedding predicted at each turn from recent exchanges and appended to the reinforcement learning state; and (3) a Dueling Double DQN (D3QN) model and Reward Prediction, in which the policy is conditioned on dialogue history and turn-level…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Social Robot Interaction and HRI · Topic Modeling