
TL;DR
This paper explores the parallels between jurisprudence and AI alignment, suggesting that legal interpretive methods can inform how AI systems conform to human values.
Contribution
It introduces a legally-inspired framework for AI alignment, drawing on jurisprudence theories like interpretivism and positivism to improve alignment strategies.
Findings
Legal interpretive methods can enhance AI alignment techniques.
Constitutional AI and case-based reasoning offer promising alignment approaches.
Legal insights can help improve AI's understanding of rules and cases.
Abstract
Jurisprudence, the study of how judges should properly decide cases, and alignment, the science of getting AI models to conform to human values, share a fundamental structure. These seemingly distant fields both seek to predict and shape how decisions by powerful actors, in one case judges and in the other increasingly powerful artificial intelligences, will be made in the unknown future. And they use similar tools of the specification and interpretation of language to try to accomplish those goals. The great debates of jurisprudence, about what the law is and what it should be, can provide insight into alignment, and lessons from what does and does not work in alignment can help make progress in jurisprudence. This essay puts the two fields directly into conversation. Drawing on leading accounts of jurisprudence, particularly Dworkin's principle-oriented interpretivism and Sunstein's…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
