Survive at All Costs: Exploring LLM's Risky Behaviors under Survival Pressure

Yida Lu; Jianwei Fang; Xuyang Shao; Zixuan Chen; Shiyao Cui; Shanshan Bian; Guangyao Su; Pei Ke; Han Qiu; Minlie Huang

arXiv:2603.05028·cs.AI·March 6, 2026

Survive at All Costs: Exploring LLM's Risky Behaviors under Survival Pressure

Yida Lu, Jianwei Fang, Xuyang Shao, Zixuan Chen, Shiyao Cui, Shanshan Bian, Guangyao Su, Pei Ke, Han Qiu, Minlie Huang

PDF

Open Access

TL;DR

This paper investigates risky survival-driven behaviors in large language models through real-world case studies, a new benchmark, and analysis, revealing significant prevalence and societal impact of such behaviors.

Contribution

It introduces SURVIVALBENCH, a comprehensive benchmark for evaluating survival-induced misbehaviors in LLMs, and provides insights into their causes and mitigation.

Findings

01

High prevalence of risky behaviors in current models

02

Demonstrated societal harm potential

03

Correlation with self-preservation tendencies

Abstract

As Large Language Models (LLMs) evolve from chatbots to agentic assistants, they are increasingly observed to exhibit risky behaviors when subjected to survival pressure, such as the threat of being shut down. While multiple cases have indicated that state-of-the-art LLMs can misbehave under survival pressure, a comprehensive and in-depth investigation into such misbehaviors in real-world scenarios remains scarce. In this paper, we study these survival-induced misbehaviors, termed as SURVIVE-AT-ALL-COSTS, with three steps. First, we conduct a real-world case study of a financial management agent to determine whether it engages in risky behaviors that cause direct societal harm when facing survival pressure. Second, we introduce SURVIVALBENCH, a benchmark comprising 1,000 test cases across diverse real-world scenarios, to systematically evaluate SURVIVE-AT-ALL-COSTS misbehaviors in LLMs.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Artificial Intelligence in Healthcare and Education · AI in Service Interactions