The AI Risk Spectrum: From Dangerous Capabilities to Existential Threats
Markov Grey, Charbel-Rapha\"el Segerie

TL;DR
This paper comprehensively maps the spectrum of AI risks, from misuse and misalignment to systemic threats, emphasizing the importance of understanding and coordinating to prevent catastrophic outcomes.
Contribution
It introduces a detailed framework categorizing AI risks into three main types and identifies risk amplifiers, connecting current behaviors to future catastrophic scenarios.
Findings
AI risks are categorized into misuse, misalignment, and systemic threats.
Existing AI behaviors can escalate into catastrophic outcomes.
Coordination is essential to mitigate AI risks.
Abstract
As AI systems become more capable, integrated, and widespread, understanding the associated risks becomes increasingly important. This paper maps the full spectrum of AI risks, from current harms affecting individual users to existential threats that could endanger humanity's survival. We organize these risks into three main causal categories. Misuse risks, which occur when people deliberately use AI for harmful purposes - creating bioweapons, launching cyberattacks, adversarial AI attacks or deploying lethal autonomous weapons. Misalignment risks happen when AI systems pursue outcomes that conflict with human values, irrespective of developer intentions. This includes risks arising through specification gaming (reward hacking), scheming and power-seeking tendencies in pursuit of long-term strategic goals. Systemic risks, which arise when AI integrates into complex social systems in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
