Bayesian Meta-Learning Through Variational Gaussian Processes
Vivek Myers, Nikhil Sardana

TL;DR
This paper introduces VMGP, a Bayesian meta-learning method based on Gaussian processes that can model complex, non-Gaussian uncertainties, outperforming existing methods on challenging benchmarks.
Contribution
We develop VMGP, extending Gaussian-process meta-learning to produce high-quality, non-Gaussian predictive uncertainties, addressing limitations of traditional Gaussian process models.
Findings
VMGP outperforms existing Bayesian meta-learning baselines on complex benchmarks.
The method effectively models non-Gaussian, discontinuous uncertainty structures.
Experimental results demonstrate significant improvements in predictive accuracy and uncertainty quantification.
Abstract
Recent advances in the field of meta-learning have tackled domains consisting of large numbers of small ("few-shot") supervised learning tasks. Meta-learning algorithms must be able to rapidly adapt to any individual few-shot task, fitting to a small support set within a task and using it to predict the labels of the task's query set. This problem setting can be extended to the Bayesian context, wherein rather than predicting a single label for each query data point, a model predicts a distribution of labels capturing its uncertainty. Successful methods in this domain include Bayesian ensembling of MAML-based models, Bayesian neural networks, and Gaussian processes with learned deep kernel and mean functions. While Gaussian processes have a robust Bayesian interpretation in the meta-learning context, they do not naturally model non-Gaussian predictive posteriors for expressing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification
