Controlling the Inductive Bias of Wide Neural Networks by Modifying the   Kernel's Spectrum

Amnon Geifman; Daniel Barzilai; Ronen Basri; Meirav Galun

arXiv:2307.14531·cs.LG·March 21, 2024

Controlling the Inductive Bias of Wide Neural Networks by Modifying the Kernel's Spectrum

Amnon Geifman, Daniel Barzilai, Ronen Basri, Meirav Galun

PDF

Open Access

TL;DR

This paper introduces Modified Spectrum Kernels (MSKs) to adjust the inductive bias of wide neural networks by manipulating the kernel's spectrum, enabling faster training without affecting the final learned function.

Contribution

The paper proposes a novel family of kernels, MSKs, and a preconditioned gradient descent method to modify neural network bias and accelerate training.

Findings

01

Enables polynomial and exponential training speedups

02

Maintains the final solution despite bias modification

03

Efficient and easy to implement method

Abstract

Wide neural networks are biased towards learning certain functions, influencing both the rate of convergence of gradient descent (GD) and the functions that are reachable with GD in finite training time. As such, there is a great need for methods that can modify this bias according to the task at hand. To that end, we introduce Modified Spectrum Kernels (MSKs), a novel family of constructed kernels that can be used to approximate kernels with desired eigenvalues for which no closed form is known. We leverage the duality between wide neural networks and Neural Tangent Kernels and propose a preconditioned gradient descent method, which alters the trajectory of GD. As a result, this allows for a polynomial and, in some cases, exponential training speedup without changing the final solution. Our method is both computationally efficient and simple to implement.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and ELM · Neural Networks and Applications · Advanced Neural Network Applications