AI Risk Management Should Incorporate Both Safety and Security

Xiangyu Qi; Yangsibo Huang; Yi Zeng; Edoardo Debenedetti; Jonas; Geiping; Luxi He; Kaixuan Huang; Udari Madhushani; Vikash Sehwag; Weijia Shi,; Boyi Wei; Tinghao Xie; Danqi Chen; Pin-Yu Chen; Jeffrey Ding; Ruoxi Jia,; Jiaqi Ma; Arvind Narayanan; Weijie J Su; Mengdi Wang; Chaowei Xiao; Bo Li,; Dawn Song; Peter Henderson; Prateek Mittal

arXiv:2405.19524·cs.CR·May 31, 2024·1 cites

AI Risk Management Should Incorporate Both Safety and Security

Xiangyu Qi, Yangsibo Huang, Yi Zeng, Edoardo Debenedetti, Jonas, Geiping, Luxi He, Kaixuan Huang, Udari Madhushani, Vikash Sehwag, Weijia Shi,, Boyi Wei, Tinghao Xie, Danqi Chen, Pin-Yu Chen, Jeffrey Ding, Ruoxi Jia,, Jiaqi Ma, Arvind Narayanan, Weijie J Su, Mengdi Wang

PDF

Open Access

TL;DR

This paper emphasizes the importance of integrating both safety and security perspectives in AI risk management, highlighting their interplay, conceptual differences, and proposing a unified framework for better collaboration.

Contribution

It introduces a unified reference framework to clarify the distinctions and interactions between AI safety and security, promoting holistic risk mitigation.

Findings

01

Safety and security are often treated separately, causing gaps in risk management.

02

A unified framework helps clarify concepts and improve cross-disciplinary collaboration.

03

Integrating safety and security leads to more effective AI risk mitigation strategies.

Abstract

The exposure of security vulnerabilities in safety-aligned language models, e.g., susceptibility to adversarial attacks, has shed light on the intricate interplay between AI safety and AI security. Although the two disciplines now come together under the overarching goal of AI risk management, they have historically evolved separately, giving rise to differing perspectives. Therefore, in this paper, we advocate that stakeholders in AI risk management should be aware of the nuances, synergies, and interplay between safety and security, and unambiguously take into account the perspectives of both disciplines in order to devise mostly effective and holistic risk mitigation approaches. Unfortunately, this vision is often obfuscated, as the definitions of the basic concepts of "safety" and "security" themselves are often inconsistent and lack consensus across communities. With AI risk…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Healthcare and Education · Ethics and Social Impacts of AI

MethodsAttentive Walk-Aggregating Graph Neural Network