Analyzing Intentional Behavior in Autonomous Agents under Uncertainty
Filip Cano C\'ordoba, Samuel Judson, Timos Antonopoulos, Katrine, Bj{\o}rner, Nicholas Shoemaker, Scott J. Shapiro, Ruzica Piskac, Bettina, K\"onighofer

TL;DR
This paper introduces a quantitative approach to assess intentional behavior in autonomous agents operating under uncertainty, using probabilistic model checking and counterfactual reasoning to distinguish between intentional and accidental outcomes.
Contribution
It presents a novel method combining probabilistic model checking and counterfactual reasoning to evaluate intentionality in autonomous agents within uncertain environments.
Findings
Method effectively distinguishes between intentional and accidental events.
Application to traffic collisions demonstrates practical utility.
Provides a formal framework for accountability in autonomous systems.
Abstract
Principled accountability for autonomous decision-making in uncertain environments requires distinguishing intentional outcomes from negligent designs from actual accidents. We propose analyzing the behavior of autonomous agents through a quantitative measure of the evidence of intentional behavior. We model an uncertain environment as a Markov Decision Process (MDP). For a given scenario, we rely on probabilistic model checking to compute the ability of the agent to influence reaching a certain event. We call this the scope of agency. We say that there is evidence of intentional behavior if the scope of agency is high and the decisions of the agent are close to being optimal for reaching the event. Our method applies counterfactual reasoning to automatically generate relevant scenarios that can be analyzed to increase the confidence of our assessment. In a case study, we show how our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI · Adversarial Robustness in Machine Learning
