Beyond Bayes-optimality: meta-learning what you know you don't know
Jordi Grau-Moya, Gr\'egoire Del\'etang, Markus Kunesch, Tim Genewein,, Elliot Catt, Kevin Li, Anian Ruoss, Chris Cundy, Joel Veness, Jane Wang,, Marcus Hutter, Christopher Summerfield, Shane Legg, Pedro Ortega

TL;DR
This paper extends meta-learning agents beyond Bayes-optimality by developing algorithms that produce risk- and ambiguity-sensitive behaviors, aligning more closely with human decision-making in safety-critical contexts.
Contribution
It introduces modified meta-training algorithms that generate risk- and ambiguity-sensitive agents through altered experience-generation processes.
Findings
Agents exhibit risk sensitivity in decision-making tasks.
Agents demonstrate ambiguity sensitivity in novel situations.
Modified algorithms successfully produce non-Bayes-optimal behaviors.
Abstract
Meta-training agents with memory has been shown to culminate in Bayes-optimal agents, which casts Bayes-optimality as the implicit solution to a numerical optimization problem rather than an explicit modeling assumption. Bayes-optimal agents are risk-neutral, since they solely attune to the expected return, and ambiguity-neutral, since they act in new situations as if the uncertainty were known. This is in contrast to risk-sensitive agents, which additionally exploit the higher-order moments of the return, and ambiguity-sensitive agents, which act differently when recognizing situations in which they lack knowledge. Humans are also known to be averse to ambiguity and sensitive to risk in ways that aren't Bayes-optimal, indicating that such sensitivity can confer advantages, especially in safety-critical situations. How can we extend the meta-learning protocol to generate risk- and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Bayesian Modeling and Causal Inference · Adversarial Robustness in Machine Learning
MethodsTest
