Risk Alignment in Agentic AI Systems

Hayley Clatterbuck; Clinton Castro; Arvo Mu\~noz Mor\'an

arXiv:2410.01927·cs.CY·October 4, 2024·3 cites

Risk Alignment in Agentic AI Systems

Hayley Clatterbuck, Clinton Castro, Arvo Mu\~noz Mor\'an

PDF

Open Access

TL;DR

This paper explores the importance of aligning risk attitudes in agentic AI systems to ensure safety, trust, and societal benefit, addressing normative and technical challenges in designing such systems.

Contribution

It introduces a framework for understanding and designing risk alignment in agentic AIs, highlighting key normative and technical considerations.

Findings

01

Risk attitudes influence AI safety and trust.

02

Designing calibrated risk profiles can mitigate societal risks.

03

Addressing responsibility gaps is crucial for accountability.

Abstract

Agentic AIs $-$ AIs that are capable and permitted to undertake complex actions with little supervision $-$ mark a new frontier in AI capabilities and raise new questions about how to safely create and align such systems with users, developers, and society. Because agents' actions are influenced by their attitudes toward risk, one key aspect of alignment concerns the risk profiles of agentic AIs. Risk alignment will matter for user satisfaction and trust, but it will also have important ramifications for society more broadly, especially as agentic AIs become more autonomous and are allowed to control key aspects of our lives. AIs with reckless attitudes toward risk (either because they are calibrated to reckless human users or are poorly designed) may pose significant threats. They might also open 'responsibility gaps' in which there is no agent who can be held accountable for harmful…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMulti-Agent Systems and Negotiation · AI-based Problem Solving and Planning · Reinforcement Learning in Robotics

MethodsALIGN