Evolutionary Training and Abstraction Yields Algorithmic Generalization   of Neural Computers

Daniel Tanneberg; Elmar Rueckert; Jan Peters

arXiv:2105.07957·cs.NE·May 18, 2021

Evolutionary Training and Abstraction Yields Algorithmic Generalization of Neural Computers

Daniel Tanneberg, Elmar Rueckert, Jan Peters

PDF

TL;DR

This paper introduces the Neural Harvard Computer, a memory-augmented neural network architecture that employs abstraction and evolutionary training to learn scalable, transferable, and robust algorithmic solutions with strong generalization across diverse tasks.

Contribution

The paper presents the Neural Harvard Computer, a novel architecture that decouples algorithmic operations from data manipulations, enabling scalable and transferable learning of algorithms.

Findings

01

Achieves perfect generalization to unseen task configurations.

02

Learns scalable solutions beyond training complexities.

03

Independent of data representation and domain.

Abstract

A key feature of intelligent behaviour is the ability to learn abstract strategies that scale and transfer to unfamiliar problems. An abstract strategy solves every sample from a problem class, no matter its representation or complexity -- like algorithms in computer science. Neural networks are powerful models for processing sensory data, discovering hidden patterns, and learning complex functions, but they struggle to learn such iterative, sequential or hierarchical algorithmic strategies. Extending neural networks with external memories has increased their capacities in learning such strategies, but they are still prone to data variations, struggle to learn scalable and transferable solutions, and require massive training data. We present the Neural Harvard Computer (NHC), a memory-augmented network based architecture, that employs abstraction by decoupling algorithmic operations…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.