Understanding a Robot's Guiding Ethical Principles via Automatically   Generated Explanations

Benjamin Krarup; Felix Lindner; Senka Krivic; Derek Long

arXiv:2206.10038·cs.AI·June 22, 2022·1 cites

Understanding a Robot's Guiding Ethical Principles via Automatically Generated Explanations

Benjamin Krarup, Felix Lindner, Senka Krivic, Derek Long

PDF

Open Access

TL;DR

This paper explores how contrastive and non-contrastive explanations can help humans understand the ethical principles guiding robot decision-making, enhancing transparency and trust.

Contribution

It introduces a method for automatically generating explanations based on an ethical framework, improving human understanding of robot ethics.

Findings

01

Generated explanations aid human comprehension of robot ethics

02

User study shows explanations improve trust in robot decisions

03

Contrastive explanations are particularly effective

Abstract

The continued development of robots has enabled their wider usage in human surroundings. Robots are more trusted to make increasingly important decisions with potentially critical outcomes. Therefore, it is essential to consider the ethical principles under which robots operate. In this paper we examine how contrastive and non-contrastive explanations can be used in understanding the ethics of robot action plans. We build upon an existing ethical framework to allow users to make suggestions about plans and receive automatically generated contrastive explanations. Results of a user study indicate that the generated explanations help humans to understand the ethical principles that underlie a robot's plan.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Ethics and Social Impacts of AI · Explainable Artificial Intelligence (XAI)