Modeling Transformative AI Risks (MTAIR) Project -- Summary Report

Sam Clarke; Ben Cottier; Aryeh Englander; Daniel Eth; David Manheim,; Samuel Dylan Martin; Issa Rice

arXiv:2206.09360·cs.AI·June 22, 2022

Modeling Transformative AI Risks (MTAIR) Project -- Summary Report

Sam Clarke, Ben Cottier, Aryeh Englander, Daniel Eth, David Manheim,, Samuel Dylan Martin, Issa Rice

PDF

Open Access

TL;DR

This report presents a comprehensive model of key hypotheses, uncertainties, and debates surrounding catastrophic risks from advanced AI, aiming to improve understanding and decision-making through an integrated, software-based approach.

Contribution

It introduces a novel, integrated model of AI risk debates based on extensive literature review and expert input, with a software implementation for exploration and planning.

Findings

01

Mapped key hypotheses and uncertainties in AI risk debates

02

Developed a software tool for quantitative exploration of AI risk scenarios

03

Identified critical factors influencing AI safety and failure modes

Abstract

This report outlines work by the Modeling Transformative AI Risk (MTAIR) project, an attempt to map out the key hypotheses, uncertainties, and disagreements in debates about catastrophic risks from advanced AI, and the relationships between them. This builds on an earlier diagram by Ben Cottier and Rohin Shah which laid out some of the crucial disagreements ("cruxes") visually, with some explanation. Based on an extensive literature review and engagement with experts, the report explains a model of the issues involved, and the initial software-based implementation that can incorporate probability estimates or other quantitative factors to enable exploration, planning, and/or decision support. By gathering information from various debates and discussions into a single more coherent presentation, we hope to enable better discussions and debates about the issues involved. The model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRisk and Safety Analysis · Ethics and Social Impacts of AI · Adversarial Robustness in Machine Learning