On the Equivalence of Random Network Distillation, Deep Ensembles, and Bayesian Inference

Moritz A. Zanger; Yijun Wu; Pascal R. Van der Vaart; Wendelin B\"ohmer; Matthijs T. J. Spaan

arXiv:2602.19964·cs.LG·February 27, 2026

On the Equivalence of Random Network Distillation, Deep Ensembles, and Bayesian Inference

Moritz A. Zanger, Yijun Wu, Pascal R. Van der Vaart, Wendelin B\"ohmer, Matthijs T. J. Spaan

PDF

Open Access

TL;DR

This paper establishes theoretical connections between Random Network Distillation, deep ensembles, and Bayesian inference, showing they are equivalent in the infinite-width neural network limit, and introduces a Bayesian RND method for uncertainty quantification.

Contribution

It provides a rigorous theoretical framework linking RND with Bayesian inference and deep ensembles, and proposes a new Bayesian RND approach for sampling from the posterior.

Findings

01

RND squared error equals deep ensemble predictive variance

02

RND error distribution can mirror Bayesian posterior predictive distribution

03

Introduces Bayesian RND for exact Bayesian posterior sampling

Abstract

Uncertainty quantification is central to safe and efficient deployments of deep learning models, yet many computationally practical methods lack lacking rigorous theoretical motivation. Random network distillation (RND) is a lightweight technique that measures novelty via prediction errors against a fixed random target. While empirically effective, it has remained unclear what uncertainties RND measures and how its estimates relate to other approaches, e.g. Bayesian inference or deep ensembles. This paper establishes these missing theoretical connections by analyzing RND within the neural tangent kernel framework in the limit of infinite network width. Our analysis reveals two central findings in this limit: (1) The uncertainty signal from RND -- its squared self-predictive error -- is equivalent to the predictive variance of a deep ensemble. (2) By constructing a specific RND target…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Gaussian Processes and Bayesian Inference · Generative Adversarial Networks and Image Synthesis