Few-Shot Learning by Dimensionality Reduction in Gradient Space

Martin Gauch; Maximilian Beck; Thomas Adler; Dmytro Kotsur; Stefan; Fiel; Hamid Eghbal-zadeh; Johannes Brandstetter; Johannes Kofler; Markus; Holzleitner; Werner Zellinger; Daniel Klotz; Sepp Hochreiter; Sebastian; Lehner

arXiv:2206.03483·cs.LG·January 30, 2023·1 cites

Few-Shot Learning by Dimensionality Reduction in Gradient Space

Martin Gauch, Maximilian Beck, Thomas Adler, Dmytro Kotsur, Stefan, Fiel, Hamid Eghbal-zadeh, Johannes Brandstetter, Johannes Kofler, Markus, Holzleitner, Werner Zellinger, Daniel Klotz, Sepp Hochreiter, Sebastian, Lehner

PDF

Open Access 1 Repo

TL;DR

This paper presents SubGD, a new few-shot learning method that leverages low-dimensional gradient subspaces identified through eigendecomposition, improving generalization and efficiency across various dynamical systems.

Contribution

SubGD introduces a novel approach to few-shot learning by identifying low-dimensional gradient subspaces, combining theoretical insights and empirical validation.

Findings

01

SubGD outperforms existing methods in sample efficiency.

02

It effectively identifies suitable subspaces for diverse dynamical systems.

03

The method generalizes well across different tasks.

Abstract

We introduce SubGD, a novel few-shot learning method which is based on the recent finding that stochastic gradient descent updates tend to live in a low-dimensional parameter subspace. In experimental and theoretical analyses, we show that models confined to a suitable predefined subspace generalize well for few-shot learning. A suitable subspace fulfills three criteria across the given tasks: it (a) allows to reduce the training error by gradient flow, (b) leads to models that generalize well, and (c) can be identified by stochastic gradient descent. SubGD identifies these subspaces from an eigendecomposition of the auto-correlation matrix of update directions across different tasks. Demonstrably, we can identify low-dimensional suitable subspaces for few-shot learning of dynamical systems, which have varying properties described by one or few parameters of the analytical system…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ml-jku/subgd
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and ELM · Gaussian Processes and Bayesian Inference