BaTCAVe: Trustworthy Explanations for Robot Behaviors

Som Sagar; Aditya Taparia; Harsh Mankodiya; Pranav Bidare; Yifan Zhou; Ransalu Senanayake

arXiv:2409.10733·cs.RO·October 10, 2025

BaTCAVe: Trustworthy Explanations for Robot Behaviors

Som Sagar, Aditya Taparia, Harsh Mankodiya, Pranav Bidare, Yifan Zhou, Ransalu Senanayake

PDF

Open Access 1 Repo

TL;DR

This paper introduces BaTCAVe, a method for providing trustworthy, human-interpretable explanations with uncertainty scores for neural network decisions in robots, enhancing transparency and trustworthiness in real-world applications.

Contribution

It presents a novel explainability technique grounded in high-level concepts that offers uncertainty-aware explanations for robot neural networks, addressing gaps in existing explainable AI methods.

Findings

01

Effective in simulated and real-world robot models

02

Provides human-interpretable explanations with uncertainty scores

03

Enhances trust and diagnostics in robotic decision-making

Abstract

Black box neural networks are an indispensable part of modern robots. Nevertheless, deploying such high-stakes systems in real-world scenarios poses significant challenges when the stakeholders, such as engineers and legislative bodies, lack insights into the neural networks' decision-making process. Presently, explainable AI is primarily tailored to natural language processing and computer vision, falling short in two critical aspects when applied in robots: grounding in decision-making tasks and the ability to assess trustworthiness of their explanations. In this paper, we introduce a trustworthy explainable robotics technique based on human-interpretable, high-level concepts that attribute to the decisions made by the neural network. Our proposed technique provides explanations with associated uncertainty scores for the explanation by matching neural network's activations with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

aditya-taparia/batcave
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning