# Neural Network Architecture Search with Differentiable Cartesian Genetic   Programming for Regression

**Authors:** Marcus M\"artens, Dario Izzo

arXiv: 1907.01939 · 2019-07-04

## TL;DR

This paper introduces a differentiable Cartesian Genetic Programming approach combined with a memetic algorithm to automatically design neural network architectures optimized for regression tasks, reducing parameters and improving accuracy.

## Contribution

It presents a novel method that integrates differentiable genetic programming with evolutionary search to optimize neural network topology and parameters simultaneously.

## Key findings

- Evolved architectures have fewer parameters and better accuracy.
- The method effectively rewires, prunes, and adds skip connections.
- Improved training efficiency and model performance on regression tasks.

## Abstract

The ability to design complex neural network architectures which enable effective training by stochastic gradient descent has been the key for many achievements in the field of deep learning. However, developing such architectures remains a challenging and resourceintensive process full of trial-and-error iterations. All in all, the relation between the network topology and its ability to model the data remains poorly understood. We propose to encode neural networks with a differentiable variant of Cartesian Genetic Programming (dCGPANN) and present a memetic algorithm for architecture design: local searches with gradient descent learn the network parameters while evolutionary operators act on the dCGPANN genes shaping the network architecture towards faster learning. Studying a particular instance of such a learning scheme, we are able to improve the starting feed forward topology by learning how to rewire and prune links, adapt activation functions and introduce skip connections for chosen regression tasks. The evolved network architectures require less space for network parameters and reach, given the same amount of time, a significantly lower error on average.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1907.01939/full.md

## Figures

30 figures with captions in the complete paper: https://tomesphere.com/paper/1907.01939/full.md

## References

39 references — full list in the complete paper: https://tomesphere.com/paper/1907.01939/full.md

---
Source: https://tomesphere.com/paper/1907.01939