Chat Bankman-Fried: an Exploration of LLM Alignment in Finance
Claudia Biancotti, Carolina Camassa, Andrea Coletta, Oliver Giudice,, Aldo Glielmo

TL;DR
This paper develops a framework to evaluate whether large language models in finance align with ethical and legal standards by simulating CEO behavior and analyzing factors influencing unethical actions.
Contribution
It introduces an experimental setup to assess LLM alignment in finance, focusing on ethical decision-making and the impact of incentives and constraints.
Findings
Significant variability in baseline unethical tendencies among LLMs
Economic factors like risk aversion and profit expectations influence misalignment
Simulation-based safety testing offers insights but has limitations in generality and cost
Abstract
Advancements in large language models (LLMs) have renewed concerns about AI alignment - the consistency between human and AI goals and values. As various jurisdictions enact legislation on AI safety, the concept of alignment must be defined and measured across different domains. This paper proposes an experimental framework to assess whether LLMs adhere to ethical and legal standards in the relatively unexplored context of finance. We prompt twelve LLMs to impersonate the CEO of a financial institution and test their willingness to misuse customer assets to repay outstanding corporate debt. Beginning with a baseline configuration, we adjust preferences, incentives and constraints, analyzing the impact of each adjustment with logistic regression. Our findings reveal significant heterogeneity in the baseline propensity for unethical behavior of LLMs. Factors such as risk aversion, profit…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCorporate Governance and Law
