HyperENTM: Evolving Scalable Neural Turing Machines through HyperNEAT

Jakob Merrild; Mikkel Angaju Rasmussen; Sebastian Risi

arXiv:1710.04748·cs.AI·October 16, 2017

HyperENTM: Evolving Scalable Neural Turing Machines through HyperNEAT

Jakob Merrild, Mikkel Angaju Rasmussen, Sebastian Risi

PDF

Open Access 1 Repo

TL;DR

This paper introduces HyperENTM, a scalable neural memory model that leverages HyperNEAT encoding to efficiently learn and generalize memory tasks to larger sizes without retraining.

Contribution

It presents a HyperNEAT-based Neural Turing Machine that encodes memory access geometrically, enabling scalable training and transfer to larger memory sizes.

Findings

01

Networks trained on small memory vectors can be scaled to larger sizes without retraining.

02

HyperNEAT encoding facilitates generalization to larger memory sizes in memory-augmented neural networks.

03

Results suggest potential for handling larger, more complex memory tasks in neural networks.

Abstract

Recent developments within memory-augmented neural networks have solved sequential problems requiring long-term memory, which are intractable for traditional neural networks. However, current approaches still struggle to scale to large memory sizes and sequence lengths. In this paper we show how access to memory can be encoded geometrically through a HyperNEAT-based Neural Turing Machine (HyperENTM). We demonstrate that using the indirect HyperNEAT encoding allows for training on small memory vectors in a bit-vector copy task and then applying the knowledge gained from such training to speed up training on larger size memory vectors. Additionally, we demonstrate that in some instances, networks trained to copy bit-vectors of size 9 can be scaled to sizes of 1,000 without further training. While the task in this paper is simple, these results could open up the problems amendable to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kalanzai/ENTM_CSharpPort
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFerroelectric and Negative Capacitance Devices · Advanced Memory and Neural Computing · Neural Networks and Reservoir Computing

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings