Bayesian Meta-Reinforcement Learning with Laplace Variational Recurrent Networks

Joery A. de Vries; Jinke He; Mathijs M. de Weerdt; Matthijs T.J. Spaan

arXiv:2505.18591·cs.LG·November 20, 2025

Bayesian Meta-Reinforcement Learning with Laplace Variational Recurrent Networks

Joery A. de Vries, Jinke He, Mathijs M. de Weerdt, Matthijs T.J. Spaan

PDF

Open Access 1 Repo

TL;DR

This paper introduces a Laplace approximation method for Bayesian meta-reinforcement learning, enhancing uncertainty estimation in recurrent neural network-based agents without altering their architecture.

Contribution

It proposes a novel Laplace-based approach to approximate full posterior distributions in meta-RL, improving uncertainty quantification with fewer parameters.

Findings

01

Our method estimates distribution statistics effectively.

02

Point-estimate methods tend to be overconfident.

03

Performance matches full Bayesian approaches with fewer parameters.

Abstract

Meta-reinforcement learning trains a single reinforcement learning agent on a distribution of tasks to quickly generalize to new tasks outside of the training set at test time. From a Bayesian perspective, one can interpret this as performing amortized variational inference on the posterior distribution over training tasks. Among the various meta-reinforcement learning approaches, a common method is to represent this distribution with a point-estimate using a recurrent neural network. We show how one can augment this point estimate to give full distributions through the Laplace approximation, either at the start of, during, or after learning, without modifying the base model architecture. With our approximation, we are able to estimate distribution statistics (e.g., the entropy) of non-Bayesian agents and observe that point-estimate based methods produce overconfident estimators while…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

joeryjoery/laplace-vrnn
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFault Detection and Control Systems

MethodsVariational Inference · Balanced Selection · Sparse Evolutionary Training