Align, then memorise: the dynamics of learning with feedback alignment
Maria Refinetti, St\'ephane d'Ascoli, Ruben Ohana, Sebastian Goldt

TL;DR
This paper develops a theory explaining why Feedback Alignment (DFA) successfully trains some neural networks by showing it undergoes an alignment and memorization process, with success depending on data structure and network architecture.
Contribution
It introduces a two-phase learning theory for DFA, highlighting the importance of gradient alignment and data structure, and explains its limitations with convolutional networks.
Findings
DFA converges to solutions with maximum gradient alignment.
Alignment in deep networks occurs sequentially from bottom to top layers.
Data structure influences the success of DFA in training neural networks.
Abstract
Direct Feedback Alignment (DFA) is emerging as an efficient and biologically plausible alternative to the ubiquitous backpropagation algorithm for training deep neural networks. Despite relying on random feedback weights for the backward pass, DFA successfully trains state-of-the-art models such as Transformers. On the other hand, it notoriously fails to train convolutional networks. An understanding of the inner workings of DFA to explain these diverging results remains elusive. Here, we propose a theory for the success of DFA. We first show that learning in shallow networks proceeds in two steps: an alignment phase, where the model adapts its weights to align the approximate gradient with the true gradient of the loss function, is followed by a memorisation phase, where the model focuses on fitting the data. This two-step process has a degeneracy breaking effect: out of all the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Model Reduction and Neural Networks · Neural Networks and Applications
MethodsDirect Feedback Alignment · Feedback Alignment
