Contextual Preference Distribution Learning

Benjamin Hudson; Laurent Charlin; Emma Frejinger

arXiv:2603.17139·cs.LG·March 19, 2026

Contextual Preference Distribution Learning

Benjamin Hudson, Laurent Charlin, Emma Frejinger

PDF

Open Access

TL;DR

This paper introduces a sequential learning pipeline that models human preference distributions conditioned on context, improving decision-making under uncertainty in risk-averse settings, demonstrated in a ridesharing simulation.

Contribution

It develops a novel method to learn and generate preference distributions conditioned on context, surpassing existing inverse optimization techniques in capturing shifts for risk-averse decisions.

Findings

01

Reduces average post-decision surprise significantly in simulations.

02

Outperforms risk-neutral and risk-averse baselines.

03

Provides a scalable approach for context-dependent preference modeling.

Abstract

Decision-making problems often feature uncertainty stemming from heterogeneous and context-dependent human preferences. To address this, we propose a sequential learning-and-optimization pipeline to learn preference distributions and leverage them to solve downstream problems, for example risk-averse formulations. We focus on human choice settings that can be formulated as (integer) linear programs. In such settings, existing inverse optimization and choice modelling methods infer preferences from observed choices but typically produce point estimates or fail to capture contextual shifts, making them unsuitable for risk-averse decision-making. Using a bounded-variance score function gradient estimator, we train a predictive model mapping contextual features to a rich class of parameterizable distributions. This approach yields a maximum likelihood estimate. The model generates scenarios…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Risk and Portfolio Optimization · Bayesian Modeling and Causal Inference