Modeling Transformative AI Risks (MTAIR) Project -- Summary Report
Sam Clarke, Ben Cottier, Aryeh Englander, Daniel Eth, David Manheim,, Samuel Dylan Martin, Issa Rice

TL;DR
This report presents a comprehensive model of key hypotheses, uncertainties, and debates surrounding catastrophic risks from advanced AI, aiming to improve understanding and decision-making through an integrated, software-based approach.
Contribution
It introduces a novel, integrated model of AI risk debates based on extensive literature review and expert input, with a software implementation for exploration and planning.
Findings
Mapped key hypotheses and uncertainties in AI risk debates
Developed a software tool for quantitative exploration of AI risk scenarios
Identified critical factors influencing AI safety and failure modes
Abstract
This report outlines work by the Modeling Transformative AI Risk (MTAIR) project, an attempt to map out the key hypotheses, uncertainties, and disagreements in debates about catastrophic risks from advanced AI, and the relationships between them. This builds on an earlier diagram by Ben Cottier and Rohin Shah which laid out some of the crucial disagreements ("cruxes") visually, with some explanation. Based on an extensive literature review and engagement with experts, the report explains a model of the issues involved, and the initial software-based implementation that can incorporate probability estimates or other quantitative factors to enable exploration, planning, and/or decision support. By gathering information from various debates and discussions into a single more coherent presentation, we hope to enable better discussions and debates about the issues involved. The model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRisk and Safety Analysis · Ethics and Social Impacts of AI · Adversarial Robustness in Machine Learning
