# A Hybrid GA-PSO Method for Evolving Architecture and Short Connections   of Deep Convolutional Neural Networks

**Authors:** Bin Wang, Yanan Sun, Bing Xue, Mengjie Zhang

arXiv: 1903.03893 · 2019-03-12

## TL;DR

This paper introduces a hybrid evolutionary algorithm combining genetic algorithms and particle swarm optimization to automatically design CNN architectures with optimized shortcut connections, improving image classification accuracy.

## Contribution

It presents a novel encoding strategy and a hybrid EC method for automatic CNN architecture and shortcut connection evolution, enhancing design efficiency.

## Key findings

- Outperforms 12 non-EC competitors in accuracy
- Achieves superior results on three benchmark datasets
- Demonstrates effectiveness of hybrid EC approach

## Abstract

Image classification is a difficult machine learning task, where Convolutional Neural Networks (CNNs) have been applied for over 20 years in order to solve the problem. In recent years, instead of the traditional way of only connecting the current layer with its next layer, shortcut connections have been proposed to connect the current layer with its forward layers apart from its next layer, which has been proved to be able to facilitate the training process of deep CNNs. However, there are various ways to build the shortcut connections, it is hard to manually design the best shortcut connections when solving a particular problem, especially given the design of the network architecture is already very challenging.   In this paper, a hybrid evolutionary computation (EC) method is proposed to \textit{automatically} evolve both the architecture of deep CNNs and the shortcut connections. Three major contributions of this work are: Firstly, a new encoding strategy is proposed to encode a CNN, where the architecture and the shortcut connections are encoded separately; Secondly, a hybrid two-level EC method, which combines particle swarm optimisation and genetic algorithms, is developed to search for the optimal CNNs; Lastly, an adjustable learning rate is introduced for the fitness evaluations, which provides a better learning rate for the training process given a fixed number of epochs. The proposed algorithm is evaluated on three widely used benchmark datasets of image classification and compared with 12 peer Non-EC based competitors and one EC based competitor. The experimental results demonstrate that the proposed method outperforms all of the peer competitors in terms of classification accuracy.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1903.03893/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/1903.03893/full.md

## References

31 references — full list in the complete paper: https://tomesphere.com/paper/1903.03893/full.md

---
Source: https://tomesphere.com/paper/1903.03893