Trust as Monitoring: Evolutionary Dynamics of User Trust and AI Developer Behaviour
Adeela Bashir, Zhao Song, Ndidi Bianca Ogbo, Nataliya Balabanova, Martin Smit, Chin-wing Leung, Paolo Bova, Manuel Chica Serrano, Dhanushka Dissanayake, Manh Hong Duong, Elias Fernandez Domingos, Nikita Huber-Kralj, Marcus Krellner, Andrew Powell, Stefan Sarkadi

TL;DR
This paper models the dynamic evolution of user trust and AI developer behavior using game theory, highlighting how monitoring costs and sanctions influence the long-term safety and adoption of AI systems.
Contribution
It introduces a dynamic, repeated-interaction model of trust and safety in AI governance, extending beyond static one-shot trust models.
Findings
Safe, widely adopted AI systems emerge when penalties outweigh safety costs.
Monitoring at low cost and meaningful sanctions are crucial for maintaining safe AI development.
Regulation alone or blind trust cannot prevent unsafe or low-adoption outcomes.
Abstract
AI safety is an increasingly urgent concern as the capabilities and adoption of AI systems grow. Existing evolutionary models of AI governance have primarily examined incentives for safe development and effective regulation, typically representing users' trust as a one-shot adoption choice rather than as a dynamic, evolving process shaped by repeated interactions. We instead model trust as reduced monitoring in a repeated, asymmetric interaction between users and AI developers, where checking AI behaviour is costly. Using evolutionary game theory, we study how user trust strategies and developer choices between safe (compliant) and unsafe (non-compliant) AI co-evolve under different levels of monitoring cost and institutional regimes. We complement the infinite-population replicator analysis with stochastic finite-population dynamics and reinforcement learning (Q-learning) simulations.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI · AI in Service Interactions · Adversarial Robustness in Machine Learning
