Implicit Regularization via Neural Feature Alignment

Aristide Baratin; Thomas George; C\'esar Laurent; R Devon Hjelm,; Guillaume Lajoie; Pascal Vincent; Simon Lacoste-Julien

arXiv:2008.00938·cs.LG·March 18, 2021·6 cites

Implicit Regularization via Neural Feature Alignment

Aristide Baratin, Thomas George, C\'esar Laurent, R Devon Hjelm,, Guillaume Lajoie, Pascal Vincent, Simon Lacoste-Julien

PDF

Open Access 1 Repo

TL;DR

This paper investigates how neural networks implicitly regularize during training by aligning their features along task-relevant directions, leading to feature selection and compression, supported by a new complexity measure based on tangent kernel analysis.

Contribution

It introduces a geometric perspective on implicit regularization, highlighting feature alignment effects and proposing a heuristic complexity measure related to tangent kernel evolution.

Findings

01

Neural tangent features align along task-relevant directions during training.

02

Feature alignment acts as a form of implicit regularization.

03

A new complexity measure based on tangent kernel sequences explains this phenomenon.

Abstract

We approach the problem of implicit regularization in deep learning from a geometrical viewpoint. We highlight a regularization effect induced by a dynamical alignment of the neural tangent features introduced by Jacot et al, along a small number of task-relevant directions. This can be interpreted as a combined mechanism of feature selection and compression. By extrapolating a new analysis of Rademacher complexity bounds for linear models, we motivate and study a heuristic complexity measure that captures this phenomenon, in terms of sequences of tangent kernel classes along optimization paths.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tfjgeorge/ntk_alignment
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Domain Adaptation and Few-Shot Learning · Machine Learning and Algorithms

MethodsFeature Selection