Gaussian Gated Linear Networks
David Budden, Adam Marblestone, Eren Sezener, Tor Lattimore, Greg, Wayne, Joel Veness

TL;DR
The paper introduces Gaussian Gated Linear Networks (G-GLN), a novel neural network model that learns efficiently without backpropagation, offering universality, interpretability, and robustness, with strong performance on regression and density estimation tasks.
Contribution
It extends the GLN framework to regression and density modeling using Gaussian mixing, demonstrating competitive performance and practical applicability.
Findings
Achieves state-of-the-art results on regression benchmarks.
Effective in online learning and density estimation tasks.
Robust and interpretable model with local credit assignment.
Abstract
We propose the Gaussian Gated Linear Network (G-GLN), an extension to the recently proposed GLN family of deep neural networks. Instead of using backpropagation to learn features, GLNs have a distributed and local credit assignment mechanism based on optimizing a convex objective. This gives rise to many desirable properties including universality, data-efficient online learning, trivial interpretability and robustness to catastrophic forgetting. We extend the GLN framework from classification to multiple regression and density modelling by generalizing geometric mixing to a product of Gaussian densities. The G-GLN achieves competitive or state-of-the-art performance on several univariate and multivariate regression benchmarks, and we demonstrate its applicability to practical tasks including online contextual bandits and density estimation via denoising.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Advanced Bandit Algorithms Research · Bayesian Modeling and Causal Inference
MethodsGated Linear Network · G-GLN Neuron · Gaussian Gated Linear Network · Interpretability
