Resource Rational Contractualism Should Guide AI Alignment

Sydney Levine; Matija Franklin; Tan Zhi-Xuan; Secil Yanik Guyot; Lionel Wong; Daniel Kilov; Yejin Choi; Joshua B. Tenenbaum; Noah Goodman; Seth Lazar; Iason Gabriel

arXiv:2506.17434·cs.AI·March 17, 2026

Resource Rational Contractualism Should Guide AI Alignment

Sydney Levine, Matija Franklin, Tan Zhi-Xuan, Secil Yanik Guyot, Lionel Wong, Daniel Kilov, Yejin Choi, Joshua B. Tenenbaum, Noah Goodman, Seth Lazar, Iason Gabriel

PDF

TL;DR

This paper introduces Resource-Rational Contractualism, a framework enabling AI systems to efficiently approximate human agreements by using heuristics, facilitating adaptive and scalable alignment in complex social environments.

Contribution

It proposes a novel resource-rational approach to contractualist AI alignment, combining heuristics and normative principles for scalable decision-making.

Findings

01

Framework enables efficient approximation of human agreements

02

Allows AI to adapt dynamically to social changes

03

Balances effort and accuracy in decision processes

Abstract

AI systems will soon have to navigate human environments and make decisions that affect people and other AI agents whose goals and values diverge. Contractualist alignment proposes grounding those decisions in agreements that diverse stakeholders would endorse under the right conditions, yet securing such agreement at scale remains costly and slow -- even for advanced AI. We therefore propose Resource-Rational Contractualism (RRC): a framework where AI systems approximate the agreements rational parties would form by drawing on a toolbox of normatively-grounded, cognitively-inspired heuristics that trade effort for accuracy. An RRC-aligned agent would not only operate efficiently, but also be equipped to dynamically adapt to and interpret the ever-changing human social world.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.