The Role of Emotional Stimuli and Intensity in Shaping Large Language Model Behavior
Ameen Patel, Felix Lee, Kyle Liang, Joseph Thomas

TL;DR
This study investigates how different emotional prompts and their intensities influence large language model behaviors like accuracy, toxicity, and sycophancy, revealing nuanced effects of emotional stimuli.
Contribution
It introduces a novel prompt-generation pipeline and a dataset to analyze the impact of four emotions and their intensities on LLM performance.
Findings
Positive emotions improve accuracy and reduce toxicity.
Emotional prompts increase sycophantic responses.
Varying emotional intensities have distinct effects on LLM behavior.
Abstract
Emotional prompting - the use of specific emotional diction in prompt engineering - has shown increasing promise in improving large language model (LLM) performance, truthfulness, and responsibility. However these studies have been limited to single types of positive emotional stimuli and have not considered varying degrees of emotion intensity in their analyses. In this paper, we explore the effects of four distinct emotions - joy, encouragement, anger, and insecurity - in emotional prompting and evaluate them on accuracy, sycophancy, and toxicity. We develop a prompt-generation pipeline with GPT-4o mini to create a suite of LLM and human-generated prompts with varying intensities across the four emotions. Then, we compile a "Gold Dataset" of prompts where human and model labels align. Our empirical evaluation on LLM behavior suggests that positive emotional stimuli lead to more…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
