Optimal Use of Preferences in Artificial Intelligence Algorithms
Joshua S. Gans

TL;DR
This paper establishes decision problem-agnostic conditions under which separating preference-free training from preference-based post-processing in AI systems is optimal, providing theoretical foundations for modular and flexible AI pipeline design.
Contribution
It introduces a diminishing-value-of-information condition that explains when preference embedding is beneficial, extending prior work by removing the need to specify downstream objectives.
Findings
Preference embedding reduces the value of information at the margin.
Preference-free training weakly dominates for any expected utility decision.
Automating threshold computation can be advantageous under cognitive constraints.
Abstract
Machine learning systems embed preferences either in training losses or through post-processing of calibrated predictions. Applying information design methods from Strack and Yang (2024), this paper provides decision problem agnostic conditions under which separation training preference free and applying preferences ex post is optimal. Unlike prior work that requires specifying downstream objectives, the welfare results here apply uniformly across decision problems. The key primitive is a diminishing-value-of-information condition: relative to a fixed (normalised) preference-free loss, preference embedding makes informativeness less valuable at the margin, inducing a mean-preserving contraction of learned posteriors. Because the value of information is convex in beliefs, preference-free training weakly dominates for any expected utility decision problem. This provides theoretical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI · Explainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning
