On Learning to Think: Algorithmic Information Theory for Novel   Combinations of Reinforcement Learning Controllers and Recurrent Neural World   Models

Juergen Schmidhuber

arXiv:1511.09249·cs.AI·December 1, 2015·40 cites

On Learning to Think: Algorithmic Information Theory for Novel Combinations of Reinforcement Learning Controllers and Recurrent Neural World Models

Juergen Schmidhuber

PDF

Open Access

TL;DR

This paper proposes a novel RNN-based AI framework inspired by brains, enabling learning, reasoning, and planning through a predictive world model guided by algorithmic information theory.

Contribution

It introduces RNNAIs that actively query their world models for abstract reasoning and decision making, advancing beyond previous RNN RL models.

Findings

01

RNNAIs can learn from continuous task sequences

02

They can self-invent tasks to improve their models

03

The approach enables active reasoning and planning

Abstract

This paper addresses the general problem of reinforcement learning (RL) in partially observable environments. In 2013, our large RL recurrent neural networks (RNNs) learned from scratch to drive simulated cars from high-dimensional video input. However, real brains are more powerful in many ways. In particular, they learn a predictive model of their initially unknown environment, and somehow use it for abstract (e.g., hierarchical) planning and reasoning. Guided by algorithmic information theory, we describe RNN-based AIs (RNNAIs) designed to do the same. Such an RNNAI can be trained on never-ending sequences of tasks, some of them provided by the user, others invented by the RNNAI itself in a curious, playful fashion, to improve its RNN-based world model. Unlike our previous model-building RNN-based RL machines dating back to 1990, the RNNAI learns to actively query its model for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Neural Networks and Applications · Advanced Memory and Neural Computing