Leveraging Contextual Counterfactuals Toward Belief Calibration
Qiuyi (Richard) Zhang, Michael S. Lee, Sherol Chen

TL;DR
This paper introduces a belief calibration framework that uses contextual counterfactuals and multi-objective optimization to better align AI systems with diverse human beliefs across different contexts.
Contribution
It proposes a novel belief calibration cycle framework that incorporates epistemic uncertainty and counterfactual reasoning to improve belief alignment in AI systems.
Findings
Demonstrates the framework's effectiveness on a toy credit decision dataset.
Achieves a Pareto frontier of optimal belief strengths across contexts.
Enhances belief calibration by considering subjectivity and epistemic uncertainty.
Abstract
Beliefs and values are increasingly being incorporated into our AI systems through alignment processes, such as carefully curating data collection principles or regularizing the loss function used for training. However, the meta-alignment problem is that these human beliefs are diverse and not aligned across populations; furthermore, the implicit strength of each belief may not be well calibrated even among humans, especially when trying to generalize across contexts. Specifically, in high regret situations, we observe that contextual counterfactuals and recourse costs are particularly important in updating a decision maker's beliefs and the strengths to which such beliefs are held. Therefore, we argue that including counterfactuals is key to an accurate calibration of beliefs during alignment. To do this, we first segment belief diversity into two categories: subjectivity (across…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Bayesian Modeling and Causal Inference · Decision-Making and Behavioral Economics
MethodsCounterfactuals Explanations
