Auditing Robot Learning for Safety and Compliance during Deployment
Homanga Bharadhwaj

TL;DR
This paper emphasizes the urgent need to develop auditing methods for robot learning algorithms to ensure safety and compliance with human intentions during autonomous operation, drawing inspiration from AI safety communities.
Contribution
It highlights the importance of auditing robot learning systems for failure modes and proposes high-level guidance and a conceptual approach without providing a concrete framework.
Findings
Auditing is crucial for safe autonomous robot operation.
Current lack of formal auditing frameworks for robot learning.
Proposes a high-level approach to guide future auditing efforts.
Abstract
Robots of the future are going to exhibit increasingly human-like and super-human intelligence in a myriad of different tasks. They are also likely going to fail and be incompliant with human preferences in increasingly subtle ways. Towards the goal of achieving autonomous robots, the robot learning community has made rapid strides in applying machine learning techniques to train robots through data and interaction. This makes the study of how best to audit these algorithms for checking their compatibility with humans, pertinent and urgent. In this paper, we draw inspiration from the AI Safety and Alignment communities and make the case that we need to urgently consider ways in which we can best audit our robot learning algorithms to check for failure modes, and ensure that when operating autonomously, they are indeed behaving in ways that the human algorithm designers intend them to.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Ethics and Social Impacts of AI · Reinforcement Learning in Robotics
