Learning to Trust: How Humans Mentally Recalibrate AI Confidence Signals
ZhaoBin Li, Mark Steyvers

TL;DR
Humans can learn to recalibrate their trust in AI confidence signals through repeated interactions, improving their ability to judge AI correctness despite systematic miscalibrations, with some limitations in reverse confidence scenarios.
Contribution
This study demonstrates that humans adaptively recalibrate trust in AI confidence signals via experience and introduces a computational model explaining the underlying learning dynamics.
Findings
Participants improved accuracy and calibration over trials
Humans adapt trust by updating baseline and sensitivity
Difficulty persists in reverse confidence conditions
Abstract
Productive human-AI collaboration requires appropriate reliance, yet contemporary AI systems are often miscalibrated, exhibiting systematic overconfidence or underconfidence. We investigate whether humans can learn to mentally recalibrate AI confidence signals through repeated experience. In a behavioral experiment (N = 200), participants predicted the AI's correctness across four AI calibration conditions: standard, overconfidence, underconfidence, and a counterintuitive "reverse confidence" mapping. Results demonstrate robust learning across all conditions, with participants significantly improving their accuracy, discrimination, and calibration alignment over 50 trials. We present a computational model utilizing a linear-in-log-odds (LLO) transformation and a Rescorla-Wagner learning rule to explain these dynamics. The model reveals that humans adapt by updating their baseline trust…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman-Automation Interaction and Safety · Ethics and Social Impacts of AI · Explainable Artificial Intelligence (XAI)
