
TL;DR
This paper proposes Admissibility Alignment, a probabilistic decision-theoretic framework for AI alignment, operationalized through MAP-AI, which evaluates policies over outcome distributions to ensure aligned decision-making under uncertainty.
Contribution
It introduces a novel system architecture, MAP-AI, that formalizes alignment as a probabilistic property and operationalizes it through Monte Carlo estimation and admissibility-controlled policy selection.
Findings
MAP-AI enforces alignment via outcome distribution evaluation.
Distributional properties guide decision-making under uncertainty.
Framework enables practical governance of AI systems based on policy behavior.
Abstract
This paper introduces Admissibility Alignment: a reframing of AI alignment as a property of admissible action and decision selection over distributions of outcomes under uncertainty, evaluated through the behavior of candidate policies. We present MAP-AI (Monte Carlo Alignment for Policy) as a canonical system architecture for operationalizing admissibility alignment, formalizing alignment as a probabilistic, decision-theoretic property rather than a static or binary condition. MAP-AI, a new control-plane system architecture for aligned decision-making under uncertainty, enforces alignment through Monte Carlo estimation of outcome distributions and admissibility-controlled policy selection rather than static model-level constraints. The framework evaluates decision policies across ensembles of plausible futures, explicitly modeling uncertainty, intervention effects, value ambiguity,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Ethics and Social Impacts of AI · Big Data and Business Intelligence
