Exploring the Constraints on Artificial General Intelligence: A Game-Theoretic No-Go Theorem
Mehmet S. Ismail

TL;DR
This paper introduces a game-theoretic framework revealing fundamental constraints on developing superhuman AI, showing that certain assumptions about AI and human interactions cannot all hold simultaneously, guiding policy recommendations.
Contribution
It presents a novel game-theoretic no-go theorem demonstrating the incompatibility of key assumptions in superhuman AI development, informing policy and theoretical understanding.
Findings
Four key assumptions are mutually incompatible when combined.
Relaxing any assumption leads to a consistent framework.
Policy suggestions include controlling data access and research collaboration.
Abstract
The emergence of increasingly sophisticated artificial intelligence (AI) systems have sparked intense debate among researchers, policymakers, and the public due to their potential to surpass human intelligence and capabilities in all domains. In this paper, I propose a game-theoretic framework that captures the strategic interactions between a human agent and a potential superhuman machine agent. I identify four key assumptions: Strategic Unpredictability, Access to Machine's Strategy, Rationality, and Superhuman Machine. The main result of this paper is an impossibility theorem: these four assumptions are inconsistent when taken together, but relaxing any one of them results in a consistent set of assumptions. Two straightforward policy recommendations follow: first, policymakers should control access to specific human data to maintain Strategic Unpredictability; and second, they…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI
