Gaussian Prototypical Networks for Few-Shot Learning on Omniglot
Stanislav Fort

TL;DR
This paper introduces Gaussian prototypical networks, an extension of prototypical networks that models uncertainty with Gaussian covariances, leading to improved few-shot classification performance on Omniglot.
Contribution
The paper proposes Gaussian prototypical networks that incorporate uncertainty estimates into the embedding space, enhancing classification accuracy over standard prototypical networks.
Findings
Achieved state-of-the-art results in 1-shot and 5-shot classification on Omniglot.
Gaussian prototypical networks outperform vanilla prototypical networks with similar parameters.
Down-sampling training images further improves performance, indicating robustness in noisier datasets.
Abstract
We propose a novel architecture for -shot classification on the Omniglot dataset. Building on prototypical networks, we extend their architecture to what we call Gaussian prototypical networks. Prototypical networks learn a map between images and embedding vectors, and use their clustering for classification. In our model, a part of the encoder output is interpreted as a confidence region estimate about the embedding point, and expressed as a Gaussian covariance matrix. Our network then constructs a direction and class dependent distance metric on the embedding space, using uncertainties of individual data points as weights. We show that Gaussian prototypical networks are a preferred architecture over vanilla prototypical networks with an equivalent number of parameters. We report state-of-the-art performance in 1-shot and 5-shot classification both in 5-way and 20-way regime (for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications
