Towards Responsible AI: Advances in Safety, Fairness, and Accountability of Autonomous Systems

Filip Cano

arXiv:2506.10192·cs.AI·October 28, 2025

Towards Responsible AI: Advances in Safety, Fairness, and Accountability of Autonomous Systems

Filip Cano

PDF

TL;DR

This paper presents new methods and frameworks to improve safety, fairness, transparency, and accountability in autonomous AI systems, with practical implementations and formal assessments to promote trustworthy AI deployment.

Contribution

It introduces resilient safety shields, a novel fairness enforcement approach, and a formal framework for assessing intentionality and responsibility in probabilistic autonomous agents.

Findings

01

Safety shields prevent collisions in autonomous driving simulations.

02

Fairness shields effectively enforce group fairness with minimal intervention.

03

Metrics for agency and intention enable retrospective responsibility analysis.

Abstract

Ensuring responsible use of artificial intelligence (AI) has become imperative as autonomous systems increasingly influence critical societal domains. However, the concept of trustworthy AI remains broad and multi-faceted. This thesis advances knowledge in the safety, fairness, transparency, and accountability of AI systems. In safety, we extend classical deterministic shielding techniques to become resilient against delayed observations, enabling practical deployment in real-world conditions. We also implement both deterministic and probabilistic safety shields into simulated autonomous vehicles to prevent collisions with road users, validating the use of these techniques in realistic driving simulators. We introduce fairness shields, a novel post-processing approach to enforce group fairness in sequential decision-making settings over finite and periodic time horizons. By optimizing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.