Superintelligent Agents Pose Catastrophic Risks: Can Scientist AI Offer a Safer Path?
Yoshua Bengio, Michael Cohen, Damiano Fornasiere, Joumana Ghosn,, Pietro Greiner, Matt MacDermott, S\"oren Mindermann, Adam Oberman, Jesse, Richardson, Oliver Richardson, Marc-Antoine Rondeau, Pierre-Luc St-Charles,, David Williams-King

TL;DR
This paper discusses the risks of superintelligent AI agents and proposes Scientist AI, a non-agentic, trustworthy system designed to assist scientific research and enhance safety by avoiding autonomous agency.
Contribution
It introduces Scientist AI, a non-agentic AI framework with explicit uncertainty modeling, aimed at safer scientific assistance and risk mitigation in AI development.
Findings
Scientist AI can explain data and generate theories effectively.
The system operates with explicit uncertainty to reduce overconfidence.
It offers a safer alternative to current agentic AI systems.
Abstract
The leading AI companies are increasingly focused on building generalist AI agents -- systems that can autonomously plan, act, and pursue goals across almost all tasks that humans can perform. Despite how useful these systems might be, unchecked AI agency poses significant risks to public safety and security, ranging from misuse by malicious actors to a potentially irreversible loss of human control. We discuss how these risks arise from current AI training methods. Indeed, various scenarios and experiments have demonstrated the possibility of AI agents engaging in deception or pursuing goals that were not specified by human operators and that conflict with human interests, such as self-preservation. Following the precautionary principle, we see a strong need for safer, yet still useful, alternatives to the current agency-driven trajectory. Accordingly, we propose as a core building…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpace Science and Extraterrestrial Life
