Particle Swarm Optimisation for Evolving Deep Neural Networks for Image Classification by Evolving and Stacking Transferable Blocks
Bin Wang, Bing Xue, Mengjie Zhang

TL;DR
This paper introduces EPSOCNN, an efficient particle swarm optimization method that evolves CNN architectures by focusing on transferable blocks, significantly reducing computational costs while maintaining high classification accuracy on CIFAR-10.
Contribution
The paper presents a novel, cost-effective NAS approach using particle swarm optimization that evolves transferable CNN blocks and stacks them for image classification.
Findings
EPSOCNN outperforms 13 peer methods in accuracy, parameters, and computational cost.
Reduces search space by focusing on a single block, lowering computational requirements.
Maintains competitive accuracy by stacking evolved blocks for the full dataset.
Abstract
Deep Convolutional Neural Networks (CNNs) have been widely used in image classification tasks, but the process of designing CNN architectures is very complex, so Neural Architecture Search (NAS), automatically searching for optimal CNN architectures, has attracted more and more research interests. However, the computational cost of NAS is often too high to apply NAS on real-life applications. In this paper, an efficient particle swarm optimisation method named EPSOCNN is proposed to evolve CNN architectures inspired by the idea of transfer learning. EPSOCNN successfully reduces the computation cost by minimising the search space to a single block and utilising a small subset of the training set to evaluate CNNs during evolutionary process. Meanwhile, EPSOCNN also keeps very competitive classification accuracy by stacking the evolved block multiple times to fit the whole dataset. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Machine Learning and ELM · Domain Adaptation and Few-Shot Learning
MethodsSigmoid Activation · Tanh Activation · Softmax · Long Short-Term Memory
