SELF-PERCEPT: Introspection Improves Large Language Models' Detection of Multi-Person Mental Manipulation in Conversations

Danush Khanna; Pratinav Seth; Sidhaarth Sredharan Murali; Aditya Kumar Guru; Siddharth Shukla; Tanuj Tyagi; Sandeep Chaurasia; Kripabandhu Ghosh

arXiv:2505.20679·cs.CL·May 28, 2025

SELF-PERCEPT: Introspection Improves Large Language Models' Detection of Multi-Person Mental Manipulation in Conversations

Danush Khanna, Pratinav Seth, Sidhaarth Sredharan Murali, Aditya Kumar Guru, Siddharth Shukla, Tanuj Tyagi, Sandeep Chaurasia, Kripabandhu Ghosh

PDF

Open Access 1 Video

TL;DR

This paper introduces SELF-PERCEPT, a two-stage prompting framework that enhances large language models' ability to detect subtle mental manipulation in complex multi-person conversations, supported by a new dataset.

Contribution

The paper presents a novel two-stage prompting method inspired by Self-Perception Theory and a new dataset for detecting manipulation in multi-turn, multi-person dialogues.

Findings

01

SELF-PERCEPT improves detection accuracy over baseline models.

02

State-of-the-art LLMs struggle with manipulation detection.

03

The dataset captures realistic manipulation scenarios.

Abstract

Mental manipulation is a subtle yet pervasive form of abuse in interpersonal communication, making its detection critical for safeguarding potential victims. However, due to manipulation's nuanced and context-specific nature, identifying manipulative language in complex, multi-turn, and multi-person conversations remains a significant challenge for large language models (LLMs). To address this gap, we introduce the MultiManip dataset, comprising 220 multi-turn, multi-person dialogues balanced between manipulative and non-manipulative interactions, all drawn from reality shows that mimic real-world scenarios. For manipulative interactions, it includes 11 distinct manipulations depicting real-life scenarios. We conduct extensive evaluations of state-of-the-art LLMs, such as GPT-4o and Llama-3.1-8B, employing various prompting strategies. Despite their capabilities, these models often…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

SELF-PERCEPT: Introspection Improves Large Language Models’ Detection of Multi-Person Mental Manipulation in Conversations· underline

Taxonomy

TopicsText Readability and Simplification · Natural Language Processing Techniques · Topic Modeling