Align, then memorise: the dynamics of learning with feedback alignment

Maria Refinetti; St\'ephane d'Ascoli; Ruben Ohana; Sebastian Goldt

arXiv:2011.12428·stat.ML·June 11, 2021

Align, then memorise: the dynamics of learning with feedback alignment

Maria Refinetti, St\'ephane d'Ascoli, Ruben Ohana, Sebastian Goldt

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper develops a theory explaining why Feedback Alignment (DFA) successfully trains some neural networks by showing it undergoes an alignment and memorization process, with success depending on data structure and network architecture.

Contribution

It introduces a two-phase learning theory for DFA, highlighting the importance of gradient alignment and data structure, and explains its limitations with convolutional networks.

Findings

01

DFA converges to solutions with maximum gradient alignment.

02

Alignment in deep networks occurs sequentially from bottom to top layers.

03

Data structure influences the success of DFA in training neural networks.

Abstract

Direct Feedback Alignment (DFA) is emerging as an efficient and biologically plausible alternative to the ubiquitous backpropagation algorithm for training deep neural networks. Despite relying on random feedback weights for the backward pass, DFA successfully trains state-of-the-art models such as Transformers. On the other hand, it notoriously fails to train convolutional networks. An understanding of the inner workings of DFA to explain these diverging results remains elusive. Here, we propose a theory for the success of DFA. We first show that learning in shallow networks proceeds in two steps: an alignment phase, where the model adapts its weights to align the approximate gradient with the true gradient of the loss function, is followed by a memorisation phase, where the model focuses on fitting the data. This two-step process has a degeneracy breaking effect: out of all the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sdascoli/dfa-dynamics
pytorchOfficial

Videos

Align, then memorise: the dynamics of learning with feedback alignment· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Model Reduction and Neural Networks · Neural Networks and Applications

MethodsDirect Feedback Alignment · Feedback Alignment