Trust as Monitoring: Evolutionary Dynamics of User Trust and AI Developer Behaviour

Adeela Bashir; Zhao Song; Ndidi Bianca Ogbo; Nataliya Balabanova; Martin Smit; Chin-wing Leung; Paolo Bova; Manuel Chica Serrano; Dhanushka Dissanayake; Manh Hong Duong; Elias Fernandez Domingos; Nikita Huber-Kralj; Marcus Krellner; Andrew Powell; Stefan Sarkadi; Fernando P. Santos; Zia Ush Shamszaman; Chaimaa Tarzi; Paolo Turrini; Grace Ibukunoluwa Ufeoshi; Victor A. Vargas-Perez; Alessandro Di Stefano; Simon T. Powers; The Anh Han

arXiv:2603.24742·cs.AI·March 27, 2026

Trust as Monitoring: Evolutionary Dynamics of User Trust and AI Developer Behaviour

Adeela Bashir, Zhao Song, Ndidi Bianca Ogbo, Nataliya Balabanova, Martin Smit, Chin-wing Leung, Paolo Bova, Manuel Chica Serrano, Dhanushka Dissanayake, Manh Hong Duong, Elias Fernandez Domingos, Nikita Huber-Kralj, Marcus Krellner, Andrew Powell, Stefan Sarkadi

PDF

Open Access

TL;DR

This paper models the dynamic evolution of user trust and AI developer behavior using game theory, highlighting how monitoring costs and sanctions influence the long-term safety and adoption of AI systems.

Contribution

It introduces a dynamic, repeated-interaction model of trust and safety in AI governance, extending beyond static one-shot trust models.

Findings

01

Safe, widely adopted AI systems emerge when penalties outweigh safety costs.

02

Monitoring at low cost and meaningful sanctions are crucial for maintaining safe AI development.

03

Regulation alone or blind trust cannot prevent unsafe or low-adoption outcomes.

Abstract

AI safety is an increasingly urgent concern as the capabilities and adoption of AI systems grow. Existing evolutionary models of AI governance have primarily examined incentives for safe development and effective regulation, typically representing users' trust as a one-shot adoption choice rather than as a dynamic, evolving process shaped by repeated interactions. We instead model trust as reduced monitoring in a repeated, asymmetric interaction between users and AI developers, where checking AI behaviour is costly. Using evolutionary game theory, we study how user trust strategies and developer choices between safe (compliant) and unsafe (non-compliant) AI co-evolve under different levels of monitoring cost and institutional regimes. We complement the infinite-population replicator analysis with stochastic finite-population dynamics and reinforcement learning (Q-learning) simulations.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI · AI in Service Interactions · Adversarial Robustness in Machine Learning