Decoupled PFNs: Identifiable Epistemic-Aleatoric Decomposition via Structured Synthetic Priors

Richard Bergna; Stefan Depeweg; Jos\'e Miguel Hern\'andez-Lobato

arXiv:2605.06413·stat.ML·May 8, 2026

Decoupled PFNs: Identifiable Epistemic-Aleatoric Decomposition via Structured Synthetic Priors

Richard Bergna, Stefan Depeweg, Jos\'e Miguel Hern\'andez-Lobato

PDF

TL;DR

This paper introduces decoupled PFNs that explicitly separate epistemic uncertainty from aleatoric noise using synthetic priors, improving decision-making in noisy, heteroscedastic settings.

Contribution

It presents a novel decoupled PFN architecture with separate signal and noise heads, enabling better epistemic uncertainty estimation from synthetic task data.

Findings

01

Decoupled PFNs outperform baseline models in hyperparameter optimization.

02

Epistemic-only acquisition reduces failure modes in noisy environments.

03

Decoupled models achieve the best average rank in synthetic Bayesian optimization.

Abstract

Prior-Fitted Networks (PFNs) amortize Bayesian prediction by meta-learning over a synthetic task prior, but their standard output is a posterior predictive distribution over noisy observations. For sequential decision-making, such as active learning and Bayesian optimization, acquisition should prioritize epistemic uncertainty about the latent signal rather than irreducible aleatoric observation noise. We show that this epistemic--aleatoric split is not identifiable in general from the posterior predictive distribution alone, even when that distribution is known exactly. We then exploit a distinctive advantage of PFNs: because the synthetic data-generating process is under our control, each task can contain an explicit latent signal and noise function, and the generator can provide query-level labels for both the noiseless target and the observation-noise variance. We use these labels…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.