Boosting Large Language Models for Mental Manipulation Detection via Data Augmentation and Distillation
Yuansheng Gao, Peng Gao, Han Bao, Bin Li, Jixiang Luo, Zonghui Wang, Wenzhi Chen

TL;DR
This paper introduces MentalMAD, a comprehensive framework combining data augmentation, teacher supervision, and distillation to improve large language models in detecting covert mental manipulation on social media.
Contribution
It presents a novel multi-component approach and a new dataset for enhancing LLMs in mental manipulation detection, addressing annotation and data scarcity challenges.
Findings
MentalMAD improves detection accuracy by 14%.
Macro-F1 score increases by 27.3%.
Weighted F1 score improves by 15.1%.
Abstract
Mental manipulation on social media poses a covert yet serious threat to individuals' psychological well-being and the integrity of online interactions. Detecting such behavior is challenging due to the difficult-to-annotate training data, its highly covert and multi-turn nature, and the lack of real-world datasets. To address these challenges, we propose MentalMAD, a framework that enhances large language models for mental manipulation detection. Our approach consists of three key components: EvoSA, an annotation-free data augmentation method that combines evolutionary operations with speech-act-aware prompting; teacher-model-generated complementary-task supervision; and Complementary-Convergent Distillation, a phase-wise strategy for transferring manipulation-specific knowledge to student models. We then constructed the ReaMent dataset, comprising 5,000 real-world-sourced dialogues.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Mental Health via Writing · Mobile Crowdsensing and Crowdsourcing
MethodsKnowledge Distillation
