Evaluating Binary Decision Biases in Large Language Models: Implications   for Fair Agent-Based Financial Simulations

Alicia Vidler; Toby Walsh

arXiv:2501.16356·cs.LG·January 29, 2025

Evaluating Binary Decision Biases in Large Language Models: Implications for Fair Agent-Based Financial Simulations

Alicia Vidler, Toby Walsh

PDF

Open Access

TL;DR

This study evaluates biases in large language models used for simulating decision-making in financial agent-based models, highlighting how sampling methods and model versions influence bias and distribution properties.

Contribution

It systematically compares GPT models and sampling approaches, revealing biases and proposing considerations for integrating LLMs into financial simulations.

Findings

01

GPT-4o-Mini-2024-07-18 shows less bias than GPT-4-0125-preview

02

Sampling method impacts output distribution significantly

03

Few-shot sampling can approximate uniform distributions under certain conditions

Abstract

Large Language Models (LLMs) are increasingly being used to simulate human-like decision making in agent-based financial market models (ABMs). As models become more powerful and accessible, researchers can now incorporate individual LLM decisions into ABM environments. However, integration may introduce inherent biases that need careful evaluation. In this paper we test three state-of-the-art GPT models for bias using two model sampling approaches: one-shot and few-shot API queries. We observe significant variations in distributions of outputs between specific models, and model sub versions, with GPT-4o-Mini-2024-07-18 showing notably better performance (32-43% yes responses) compared to GPT-4-0125-preview's extreme bias (98-99% yes responses). We show that sampling methods and model sub-versions significantly impact results: repeated independent API calls produce different…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAuction Theory and Applications