How Do LLMs Persuade? Linear Probes Can Uncover Persuasion Dynamics in Multi-Turn Conversations
Brandon Jaipersaud, David Krueger, Ekdeep Singh Lubana

TL;DR
This paper demonstrates that linear probes can effectively analyze persuasion dynamics in multi-turn conversations involving large language models, offering insights into persuasion success, strategies, and user personalities with efficiency and accuracy.
Contribution
The study introduces the use of linear probes to analyze persuasion in LLMs, revealing their ability to identify persuasion points and strategies, outperforming prompting in some cases.
Findings
Probes can identify persuasion success points in conversations.
Probes outperform prompting in uncovering persuasion strategies.
Probes are faster and as effective as prompting-based methods.
Abstract
Large Language Models (LLMs) have started to demonstrate the ability to persuade humans, yet our understanding of how this dynamic transpires is limited. Recent work has used linear probes, lightweight tools for analyzing model representations, to study various LLM skills such as the ability to model user sentiment and political perspective. Motivated by this, we apply probes to study persuasion dynamics in natural, multi-turn conversations. We leverage insights from cognitive science to train probes on distinct aspects of persuasion: persuasion success, persuadee personality, and persuasion strategy. Despite their simplicity, we show that they capture various aspects of persuasion at both the sample and dataset levels. For instance, probes can identify the point in a conversation where the persuadee was persuaded or where persuasive success generally occurs across the entire dataset.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
