Understanding and Avoiding AI Failures: A Practical Guide

Heather M. Williams; Roman V. Yampolskiy

arXiv:2104.12582·cs.CY·March 13, 2024

Understanding and Avoiding AI Failures: A Practical Guide

Heather M. Williams, Roman V. Yampolskiy

PDF

Open Access

TL;DR

This paper presents a comprehensive framework combining accident theory and safety principles to understand and mitigate risks in AI systems, emphasizing system properties over root causes for current AI safety.

Contribution

It introduces a novel integrated framework for AI risk assessment based on accident and safety theories, focusing on system properties and near-accident analysis.

Findings

01

Identifies key system properties linked to AI failures.

02

Provides a risk quantification method for AI safety.

03

Highlights safety focus areas for current AI systems.

Abstract

As AI technologies increase in capability and ubiquity, AI accidents are becoming more common. Based on normal accident theory, high reliability theory, and open systems theory, we create a framework for understanding the risks associated with AI applications. In addition, we also use AI safety principles to quantify the unique risks of increased intelligence and human-like qualities in AI. Together, these two fields give a more complete picture of the risks of contemporary AI. By focusing on system properties near accidents instead of seeking a root cause of accidents, we identify where attention should be paid to safety for current generation AI systems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI