Affirmative safety: An approach to risk management for high-risk AI

Akash R. Wasil; Joshua Clymer; David Krueger; Emily Dardaman; Simeon; Campos; Evan R. Murphy

arXiv:2406.15371·cs.CY·June 25, 2024·1 cites

Affirmative safety: An approach to risk management for high-risk AI

Akash R. Wasil, Joshua Clymer, David Krueger, Emily Dardaman, Simeon, Campos, Evan R. Murphy

PDF

Open Access

TL;DR

This paper advocates for a proactive safety assurance approach for high-risk AI, requiring developers to demonstrate that their systems maintain risks below acceptable levels through technical and organizational evidence.

Contribution

It introduces the concept of affirmative safety cases for AI, outlining technical and operational evidence needed to support risk thresholds and proposing a framework aligned with existing risk management principles.

Findings

01

Proposes a new safety assurance framework for high-risk AI.

02

Identifies technical and organizational evidence types for safety cases.

03

Aligns AI safety practices with established high-risk industry standards.

Abstract

Prominent AI experts have suggested that companies developing high-risk AI systems should be required to show that such systems are safe before they can be developed or deployed. The goal of this paper is to expand on this idea and explore its implications for risk management. We argue that entities developing or deploying high-risk AI systems should be required to present evidence of affirmative safety: a proactive case that their activities keep risks below acceptable thresholds. We begin the paper by highlighting global security risks from AI that have been acknowledged by AI experts and world governments. Next, we briefly describe principles of risk management from other high-risk fields (e.g., nuclear safety). Then, we propose a risk management approach for advanced AI in which model developers must provide evidence that their activities keep certain risks below regulator-set…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRisk and Safety Analysis · Adversarial Robustness in Machine Learning · Occupational Health and Safety Research