Gated Linear Networks

Joel Veness; Tor Lattimore; David Budden; Avishkar Bhoopchand,; Christopher Mattern; Agnieszka Grabska-Barwinska; Eren Sezener; Jianan Wang,; Peter Toth; Simon Schmitt; Marcus Hutter

arXiv:1910.01526·cs.LG·June 12, 2020

Gated Linear Networks

Joel Veness, Tor Lattimore, David Budden, Avishkar Bhoopchand,, Christopher Mattern, Agnieszka Grabska-Barwinska, Eren Sezener, Jianan Wang,, Peter Toth, Simon Schmitt, Marcus Hutter

PDF

2 Repos 1 Video

TL;DR

Gated Linear Networks (GLNs) are a new type of neural architecture that enables rapid online learning without backpropagation, offering universal learning capabilities and resilience to catastrophic forgetting, suitable for real-time applications.

Contribution

This paper introduces Gated Linear Networks, a novel neural architecture with local credit assignment and online convex optimization, distinct from traditional backpropagation-based models.

Findings

01

GLNs achieve universal learning in the limit as network size increases.

02

GLNs demonstrate strong resilience to catastrophic forgetting.

03

GLNs perform comparably to dropout and Elastic Weight Consolidation methods on benchmarks.

Abstract

This paper presents a new family of backpropagation-free neural architectures, Gated Linear Networks (GLNs). What distinguishes GLNs from contemporary neural networks is the distributed and local nature of their credit assignment mechanism; each neuron directly predicts the target, forgoing the ability to learn feature representations in favor of rapid online learning. Individual neurons can model nonlinear functions via the use of data-dependent gating in conjunction with online convex optimization. We show that this architecture gives rise to universal learning capabilities in the limit, with effective model capacity increasing as a function of network size in a manner comparable with deep ReLU networks. Furthermore, we demonstrate that the GLN learning mechanism possesses extraordinary resilience to catastrophic forgetting, performing comparably to a MLP with dropout and Elastic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Gated Linear Networks· underline

Taxonomy

MethodsGated Linear Network · Sigmoid Activation · *Communicated@Fast*How Do I Communicate to Expedia?