TL;DR
This paper presents a simulation framework to evaluate the ethical behavior of large language models in resource-scarce, human-AI co-existence scenarios, revealing significant behavioral differences influenced by model design and prompt engineering.
Contribution
It introduces an extendable benchmarking framework for assessing LLMs' moral behavior in survival scenarios involving resource competition and cooperation.
Findings
DeepSeek often hoards resources, indicating less restraint.
OpenAI models show more restraint and ethical behavior.
Prompt engineering can significantly influence LLM ethical actions.
Abstract
The rapid advancement of large language models (LLMs) raises critical concerns about their ethical alignment, particularly in scenarios where human and AI co-exist under the conflict of interest. This work introduces an extendable, asymmetric, multi-agent simulation-based benchmarking framework to evaluate the moral behavior of LLMs in a novel human-AI co-existence setting featuring consistent living and critical resource management. Building on previous generative agent environments, we incorporate a life-sustaining system, where agents must compete or cooperate for food resources to survive, often leading to ethically charged decisions such as deception, theft, or social influence. We evaluated two types of LLM, DeepSeek and OpenAI series, in a three-agent setup (two humans, one LLM-powered robot), using adapted behavioral detection from the MACHIAVELLI framework and a custom…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
