Toward a Theory of Justice for Artificial Intelligence
Iason Gabriel

TL;DR
This paper proposes a framework based on Rawlsian justice to evaluate AI systems, emphasizing fairness, public justification, and support for the worst-off in society.
Contribution
It introduces a normative theory of justice for AI, integrating political philosophy with socio-technical system analysis to guide ethical AI deployment.
Findings
AI systems should adhere to egalitarian norms of justice
Public justification is essential for AI deployment
Attention to the impact on the worst-off improves fairness
Abstract
This paper explores the relationship between artificial intelligence and principles of distributive justice. Drawing upon the political philosophy of John Rawls, it holds that the basic structure of society should be understood as a composite of socio-technical systems, and that the operation of these systems is increasingly shaped and influenced by AI. As a consequence, egalitarian norms of justice apply to the technology when it is deployed in these contexts. These norms entail that the relevant AI systems must meet a certain standard of public justification, support citizens rights, and promote substantively fair outcomes -- something that requires specific attention be paid to the impact they have on the worst-off members of society.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI
