Do Machines Fail Like Humans? A Human-Centred Out-of-Distribution Spectrum for Mapping Error Alignment
Binxia Xu, Xiaoliang Luo, Luke Dickens, Robert M. Mok

TL;DR
This paper introduces a human-centered OOD spectrum framework to evaluate how AI models' failure modes compare to humans across varying perceptual difficulties, improving model-human alignment analysis.
Contribution
It redefines out-of-distribution as a spectrum based on human perceptual difficulty, enabling more principled and calibrated model-human comparison across challenging stimuli.
Findings
Vision-language models show consistent human alignment across OOD conditions.
CNNs are more aligned than ViTs for near-OOD, while ViTs outperform CNNs for far-OOD.
The framework reveals regime-dependent model-human alignment profiles.
Abstract
Determining whether AI systems process information similarly to humans is central to cognitive science and trustworthy AI. While modern AI models can match human accuracy on standard tasks, such parity does not guarantee that their underlying decision-making strategies resemble those of humans. Assessing performance using error alignment metrics to compare how humans and models fail, and how this changes for distorted, or otherwise more challenging, stimuli, provides a viable pathway toward a finer characterization of model-human alignment. However, existing out-of-distribution (OOD) analyses for challenging stimuli are limited due to methodological choices: they define OOD shift relative to model training data or use arbitrary distortion-specific parameters with little correspondence to human perception, hindering principled comparisons. We propose a human-centred framework that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
