Prompting Science Report 3: I'll pay you or I'll kill you -- but will you care?

Lennart Meincke; Ethan Mollick; Lilach Mollick; Dan Shapiro

arXiv:2508.00614·cs.CL·August 4, 2025

Prompting Science Report 3: I'll pay you or I'll kill you -- but will you care?

Lennart Meincke, Ethan Mollick, Lilach Mollick, Dan Shapiro

PDF

Open Access

TL;DR

This report empirically tests common prompting beliefs like tipping and threatening AI models, finding they generally do not improve performance, while prompt variations can significantly influence question-by-question results.

Contribution

It provides rigorous empirical evidence that tipping and threatening do not enhance AI performance, highlighting the importance of prompt design over such tactics.

Findings

01

Threatening or tipping has no significant effect on benchmark performance.

02

Prompt variations can significantly alter question-level outcomes.

03

Simple prompting tactics may be less effective for difficult problems.

Abstract

This is the third in a series of short reports that seek to help business, education, and policy leaders understand the technical details of working with AI through rigorous testing. In this report, we investigate two commonly held prompting beliefs: a) offering to tip the AI model and b) threatening the AI model. Tipping was a commonly shared tactic for improving AI performance and threats have been endorsed by Google Founder Sergey Brin (All-In, May 2025, 8:20) who observed that 'models tend to do better if you threaten them,' a claim we subject to empirical testing here. We evaluate model performance on GPQA (Rein et al. 2024) and MMLU-Pro (Wang et al. 2024). We demonstrate two things: - Threatening or tipping a model generally has no significant effect on benchmark performance. - Prompt variations can significantly affect performance on a per-question level. However, it is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPsychology of Social Influence · Expert finding and Q&A systems · Misinformation and Its Impacts