Learning Representations by Stochastic Meta-Gradient Descent in Neural   Networks

Vivek Veeriah; Shangtong Zhang; Richard S. Sutton

arXiv:1612.02879·cs.LG·April 28, 2017

Learning Representations by Stochastic Meta-Gradient Descent in Neural Networks

Vivek Veeriah, Shangtong Zhang, Richard S. Sutton

PDF

Open Access

TL;DR

This paper introduces crossprop, a novel incremental learning algorithm that uses meta-gradient descent to improve feature reuse in neural networks, enhancing adaptability to new tasks.

Contribution

The paper presents crossprop, a new meta-gradient based algorithm that generalizes backprop and promotes feature reuse in neural networks for incremental learning.

Findings

01

Crossprop effectively reuses features for new tasks.

02

Crossprop outperforms backprop in continual learning scenarios.

03

The algorithm introduces a memory parameter for each weight.

Abstract

Representations are fundamental to artificial intelligence. The performance of a learning system depends on the type of representation used for representing the data. Typically, these representations are hand-engineered using domain knowledge. More recently, the trend is to learn these representations through stochastic gradient descent in multi-layer neural networks, which is called backprop. Learning the representations directly from the incoming data stream reduces the human labour involved in designing a learning system. More importantly, this allows in scaling of a learning system for difficult tasks. In this paper, we introduce a new incremental learning algorithm called crossprop, which learns incoming weights of hidden units based on the meta-gradient descent approach, that was previously introduced by Sutton (1992) and Schraudolph (1999) for learning step-sizes. The final…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Neural Networks and Applications · Stochastic Gradient Optimization Techniques