Human Attribution of Causality to AI Across Agency, Misuse, and Misalignment
Maria Victoria Carro, David Lagnado

TL;DR
This study explores how people perceive causality and responsibility in AI-related harmful incidents, revealing biases based on AI agency, role, and developer involvement, which can inform liability frameworks.
Contribution
It provides empirical insights into human causal judgments involving AI, highlighting factors influencing responsibility attribution in complex AI-human interactions.
Findings
Higher AI agency increases causal attribution to AI.
People assign more responsibility to humans when roles are reversed.
Developers are judged highly causal despite being distant in the causal chain.
Abstract
AI-related incidents are becoming increasingly frequent and severe, ranging from safety failures to misuse by malicious actors. In such complex situations, identifying which elements caused an adverse outcome, the problem of cause selection, is a critical first step for establishing liability. This paper investigates folk perceptions of causal responsibility in causal chain structures when AI systems are involved in harmful outcomes. We conduct human experiments to examine judgments of causality, blame, foreseeability, and counterfactual reasoning. Our findings show that: (1) When AI agency was moderate (human sets the goal, AI determines the means) or high (AI sets the goal and the means), participants attributed greater causal responsibility to the AI. However, under low AI agency (where a human sets both a goal and means) participants assigned greater causal responsibility to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI · Psychology of Moral and Emotional Judgment · Explainable Artificial Intelligence (XAI)
