Putting An End to End-to-End: Gradient-Isolated Learning of   Representations

Sindy L\"owe; Peter O'Connor; Bastiaan S. Veeling

arXiv:1905.11786·cs.LG·January 28, 2020·49 cites

Putting An End to End-to-End: Gradient-Isolated Learning of Representations

Sindy L\"owe, Peter O'Connor, Bastiaan S. Veeling

PDF

Open Access 1 Repo

TL;DR

This paper introduces a biologically inspired, gradient-isolated training method for deep neural networks that enables scalable, label-free, self-supervised learning by training modules independently to preserve input information, achieving competitive results.

Contribution

It presents a novel module-based training approach that avoids end-to-end backpropagation, inspired by biological neural networks, allowing scalable, asynchronous training on unlabelled data.

Findings

01

Modules improve upon their predecessors' outputs

02

Achieves competitive downstream classification results

03

Enables large-scale distributed training

Abstract

We propose a novel deep learning method for local self-supervised representation learning that does not require labels nor end-to-end backpropagation but exploits the natural order in data instead. Inspired by the observation that biological neural networks appear to learn without backpropagating a global error signal, we split a deep neural network into a stack of gradient-isolated modules. Each module is trained to maximally preserve the information of its inputs using the InfoNCE bound from Oord et al. [2018]. Despite this greedy training, we demonstrate that each module improves upon the output of its predecessor, and that the representations created by the top module yield highly competitive results on downstream classification tasks in the audio and visual domain. The proposal enables optimizing modules asynchronously, allowing large-scale distributed training of very deep neural…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

loeweX/Greedy_InfoMax
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Music and Audio Processing · Neural Networks and Applications

Methods1x1 Convolution · Dense Connections · Feedforward Network · Bottleneck Residual Block · Bitcoin Customer Service Number +1-833-534-1729 · Residual Connection · Residual Block · *Communicated@Fast*How Do I Communicate to Expedia? · Linear Layer · Global Average Pooling