Safe Transformative AI via a Windfall Clause
Paolo Bova, Jonas Emanuel M\"uller, Benjamin Harack

TL;DR
This paper proposes a Windfall Clause mechanism where AI developers commit to donating large profits to good causes, encouraging safe AI development amidst fierce competition.
Contribution
It extends a TAI competition model to show how Windfall Clauses can be credible and beneficial, promoting safety and cooperation among firms.
Findings
Firms benefit from joining Windfall Clauses in various scenarios.
Higher competition danger increases firms' willingness to join.
Firms rarely withdraw support once committed.
Abstract
Society could soon see transformative artificial intelligence (TAI). Models of competition for TAI show firms face strong competitive pressure to deploy TAI systems before they are safe. This paper explores a proposed solution to this problem, a Windfall Clause, where developers commit to donating a significant portion of any eventual extremely large profits to good causes. However, a key challenge for a Windfall Clause is that firms must have reason to join one. Firms must also believe these commitments are credible. We extend a model of TAI competition with a Windfall Clause to show how firms and policymakers can design a Windfall Clause which overcomes these challenges. Encouragingly, firms benefit from joining a Windfall Clause under a wide range of scenarios. We also find that firms join the Windfall Clause more often when the competition is more dangerous. Even when firms learn…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI · Blockchain Technology Applications and Security · Digital Platforms and Economics
