Low-rank extended Kalman filtering for online learning of neural   networks from streaming data

Peter G. Chang; Gerardo Dur\'an-Mart\'in; Alexander Y Shestopaloff,; Matt Jones; Kevin Murphy

arXiv:2305.19535·stat.ML·June 29, 2023·2 cites

Low-rank extended Kalman filtering for online learning of neural networks from streaming data

Peter G. Chang, Gerardo Dur\'an-Mart\'in, Alexander Y Shestopaloff,, Matt Jones, Kevin Murphy

PDF

Open Access 1 Repo

TL;DR

This paper introduces a low-rank extended Kalman filtering method for online neural network learning from streaming data, offering a deterministic, efficient, and adaptive Bayesian inference approach that improves learning speed and responsiveness.

Contribution

The paper presents a novel low-rank plus diagonal EKF-based algorithm for online neural network parameter estimation, eliminating the need for step-size tuning and enhancing adaptation to non-stationary data.

Findings

01

Faster and more sample-efficient learning compared to stochastic variational inference.

02

Improved adaptation to changing data distributions.

03

Enhanced reward accumulation in contextual bandit applications.

Abstract

We propose an efficient online approximate Bayesian inference algorithm for estimating the parameters of a nonlinear function from a potentially non-stationary data stream. The method is based on the extended Kalman filter (EKF), but uses a novel low-rank plus diagonal decomposition of the posterior precision matrix, which gives a cost per step which is linear in the number of model parameters. In contrast to methods based on stochastic variational inference, our method is fully deterministic, and does not require step-size tuning. We show experimentally that this results in much faster (more sample efficient) learning, which results in more rapid adaptation to changing distributions, and faster accumulation of reward when used as part of a contextual bandit algorithm.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

probml/rebayes
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTarget Tracking and Data Fusion in Sensor Networks · Gaussian Processes and Bayesian Inference · Blind Source Separation Techniques