Training recurrent networks online without backtracking

Yann Ollivier; Corentin Tallec; Guillaume Charpiat

arXiv:1507.07680·cs.NE·November 24, 2015·26 cites

Training recurrent networks online without backtracking

Yann Ollivier, Corentin Tallec, Guillaume Charpiat

PDF

Open Access

TL;DR

The paper introduces NoBackTrack, an online, memoryless algorithm for training recurrent neural networks that avoids backpropagation through time, scales linearly, and performs well on long dependency tasks.

Contribution

It presents a novel unbiased gradient estimation method for online training of recurrent networks without backpropagation through time.

Findings

01

Scales linearly with the number of parameters.

02

Performs better than BPTT on long dependency tasks.

03

Maintains unbiased gradient estimates over time.

Abstract

We introduce the "NoBackTrack" algorithm to train the parameters of dynamical systems such as recurrent neural networks. This algorithm works in an online, memoryless setting, thus requiring no backpropagation through time, and is scalable, avoiding the large computational and memory cost of maintaining the full gradient of the current state with respect to the parameters. The algorithm essentially maintains, at each time, a single search direction in parameter space. The evolution of this search direction is partly stochastic and is constructed in such a way to provide, at every time, an unbiased random estimate of the gradient of the loss function with respect to the parameters. Because the gradient estimate is unbiased, on average over time the parameter is updated as it should. The resulting gradient estimate can then be fed to a lightweight Kalman-like filter to yield an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Gaussian Processes and Bayesian Inference · Target Tracking and Data Fusion in Sensor Networks