# The realism of behavioral theory-based vs. non-theory-based AI agents during a simulated infant formula shortage

**Authors:** Linda Desens, Brandon Walling, Rhys O’Neill, Vanessa Howard, Mary Giammarino, Denise Scannell, Anya Kemble, Taylor Wilkerson, Nyalok Nhial, Sara Beth Elson, Maureen Leahy, Scott Rosen

PMC · DOI: 10.3389/frai.2026.1719703 · Frontiers in Artificial Intelligence · 2026-02-09

## TL;DR

This study shows that AI agents based on behavioral theories are perceived as more realistic than those without theories during a simulated infant formula shortage.

## Contribution

The study introduces a repeatable method for assessing behavioral fidelity in AI agents using behavioral theory.

## Key findings

- Theory-based agents received significantly higher realism ratings from human raters.
- The study provides a method to evaluate and enhance AI agent realism in crisis simulations.

## Abstract

AI-driven digital twins and autonomous AI agents are increasingly used to simulate human behavior during crises. Incorporating behavioral science frameworks may improve agent realism, but this practice is still in its infancy. This research evaluates the realism of behavioral theory-based agents in a controlled experimental design.

Using a simulated infant formula shortage in South Dallas County, we compare two conditions: one with theory-based agents, and another without. Participants (human raters) assessed the perceived realism of agent decisions across both conditions.

Results showed significantly higher realism ratings for the theory-based agents, supporting our hypothesis.

This study constitutes an early effort to assess behavioral theory in simulation frameworks and establish a repeatable method for assessing behavioral fidelity. It provides policymakers and researchers with a theory-informed approach for enhancing AI agent realism, with the goal of increasing trust in digital twin models used for decision support in high-stakes environments.

## Full-text entities

- **Diseases:** infectious disease (MESH:D003141), Food insecurity (MESH:D005517), weight gain (MESH:D015430), SCT (OMIM:300082), DT (MESH:D004200), anxiety (MESH:D001007), infant (MESH:D063766), Crisis (MESH:D001752), DECISION_CYCLE (MESH:D020195), COVID-19 (MESH:D000086382), deaths (MESH:D003643), BDI (MESH:D014202)
- **Chemicals:** DT (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12926465/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12926465/full.md

## References

70 references — full list in the complete paper: https://tomesphere.com/paper/PMC12926465/full.md

---
Source: https://tomesphere.com/paper/PMC12926465