Towards Responsible AI: Advances in Safety, Fairness, and Accountability of Autonomous Systems
Filip Cano

TL;DR
This paper presents new methods and frameworks to improve safety, fairness, transparency, and accountability in autonomous AI systems, with practical implementations and formal assessments to promote trustworthy AI deployment.
Contribution
It introduces resilient safety shields, a novel fairness enforcement approach, and a formal framework for assessing intentionality and responsibility in probabilistic autonomous agents.
Findings
Safety shields prevent collisions in autonomous driving simulations.
Fairness shields effectively enforce group fairness with minimal intervention.
Metrics for agency and intention enable retrospective responsibility analysis.
Abstract
Ensuring responsible use of artificial intelligence (AI) has become imperative as autonomous systems increasingly influence critical societal domains. However, the concept of trustworthy AI remains broad and multi-faceted. This thesis advances knowledge in the safety, fairness, transparency, and accountability of AI systems. In safety, we extend classical deterministic shielding techniques to become resilient against delayed observations, enabling practical deployment in real-world conditions. We also implement both deterministic and probabilistic safety shields into simulated autonomous vehicles to prevent collisions with road users, validating the use of these techniques in realistic driving simulators. We introduce fairness shields, a novel post-processing approach to enforce group fairness in sequential decision-making settings over finite and periodic time horizons. By optimizing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
