# Neuromorphic Hardware learns to learn

**Authors:** Thomas Bohnstingl, Franz Scherr, Christian Pehle, Karlheinz Meier,, Wolfgang Maass

arXiv: 1903.06493 · 2019-06-11

## TL;DR

This paper introduces a method for optimizing neuromorphic hardware's learning algorithms using gradient-free techniques, enabling efficient reward-based learning and the first demonstration of Learning-to-Learn capabilities on such hardware.

## Contribution

It applies evolutionary and cross-entropy optimization methods to neuromorphic hardware, enhancing its learning efficiency and demonstrating Learning-to-Learn for the first time.

## Key findings

- Neuromorphic agents learn efficiently from rewards.
- Meta-plasticity significantly improves learning capabilities.
- First demonstration of Learning-to-Learn on neuromorphic hardware.

## Abstract

Hyperparameters and learning algorithms for neuromorphic hardware are usually chosen by hand. In contrast, the hyperparameters and learning algorithms of networks of neurons in the brain, which they aim to emulate, have been optimized through extensive evolutionary and developmental processes for specific ranges of computing and learning tasks. Occasionally this process has been emulated through genetic algorithms, but these require themselves hand-design of their details and tend to provide a limited range of improvements. We employ instead other powerful gradient-free optimization tools, such as cross-entropy methods and evolutionary strategies, in order to port the function of biological optimization processes to neuromorphic hardware. As an example, we show that this method produces neuromorphic agents that learn very efficiently from rewards. In particular, meta-plasticity, i.e., the optimization of the learning rule which they use, substantially enhances reward-based learning capability of the hardware. In addition, we demonstrate for the first time Learning-to-Learn benefits from such hardware, in particular, the capability to extract abstract knowledge from prior learning experiences that speeds up the learning of new but related tasks. Learning-to-Learn is especially suited for accelerated neuromorphic hardware, since it makes it feasible to carry out the required very large number of network computations.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1903.06493/full.md

## Figures

11 figures with captions in the complete paper: https://tomesphere.com/paper/1903.06493/full.md

## References

35 references — full list in the complete paper: https://tomesphere.com/paper/1903.06493/full.md

---
Source: https://tomesphere.com/paper/1903.06493