Decoupled PFNs: Identifiable Epistemic-Aleatoric Decomposition via Structured Synthetic Priors
Richard Bergna, Stefan Depeweg, Jos\'e Miguel Hern\'andez-Lobato

TL;DR
This paper introduces decoupled PFNs that explicitly separate epistemic uncertainty from aleatoric noise using synthetic priors, improving decision-making in noisy, heteroscedastic settings.
Contribution
It presents a novel decoupled PFN architecture with separate signal and noise heads, enabling better epistemic uncertainty estimation from synthetic task data.
Findings
Decoupled PFNs outperform baseline models in hyperparameter optimization.
Epistemic-only acquisition reduces failure modes in noisy environments.
Decoupled models achieve the best average rank in synthetic Bayesian optimization.
Abstract
Prior-Fitted Networks (PFNs) amortize Bayesian prediction by meta-learning over a synthetic task prior, but their standard output is a posterior predictive distribution over noisy observations. For sequential decision-making, such as active learning and Bayesian optimization, acquisition should prioritize epistemic uncertainty about the latent signal rather than irreducible aleatoric observation noise. We show that this epistemic--aleatoric split is not identifiable in general from the posterior predictive distribution alone, even when that distribution is known exactly. We then exploit a distinctive advantage of PFNs: because the synthetic data-generating process is under our control, each task can contain an explicit latent signal and noise function, and the generator can provide query-level labels for both the noiseless target and the observation-noise variance. We use these labels…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
