# Efficient Network Construction through Structural Plasticity

**Authors:** Xiaocong Du, Zheng Li, Yufei Ma, Yu Cao

arXiv: 1905.11530 · 2019-12-19

## TL;DR

This paper introduces CGaP, a training scheme inspired by biological structural plasticity, which starts with a small network, continuously grows important units, and prunes secondary ones, leading to more efficient neural networks.

## Contribution

It proposes a novel training approach that dynamically adjusts network structure during training, improving efficiency and adaptability over traditional fixed-structure methods.

## Key findings

- Reduces FLOPs by 63.3% on ResNet-110
- Decreases model size and energy consumption significantly
- Achieves 40.2% reduction in inference latency

## Abstract

Deep Neural Networks (DNNs) on hardware is facing excessive computation cost due to the massive number of parameters. A typical training pipeline to mitigate over-parameterization is to pre-define a DNN structure first with redundant learning units (filters and neurons) under the goal of high accuracy, then to prune redundant learning units after training with the purpose of efficient inference. We argue that it is sub-optimal to introduce redundancy into training for the purpose of reducing redundancy later in inference. Moreover, the fixed network structure further results in poor adaption to dynamic tasks, such as lifelong learning. In contrast, structural plasticity plays an indispensable role in mammalian brains to achieve compact and accurate learning. Throughout the lifetime, active connections are continuously created while those no longer important are degenerated. Inspired by such observation, we propose a training scheme, namely Continuous Growth and Pruning (CGaP), where we start the training from a small network seed, then literally execute continuous growth by adding important learning units and finally prune secondary ones for efficient inference. The inference model generated from CGaP is sparse in the structure, largely decreasing the inference power and latency when deployed on hardware platforms. With popular DNN structures on representative datasets, the efficacy of CGaP is benchmarked by both algorithm simulation and architectural modeling on Field-programmable Gate Arrays (FPGA). For example, CGaP decreases the FLOPs, model size, DRAM access energy and inference latency by 63.3%, 64.0%, 11.8% and 40.2%, respectively, for ResNet-110 on CIFAR-10.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1905.11530/full.md

## Figures

22 figures with captions in the complete paper: https://tomesphere.com/paper/1905.11530/full.md

## References

46 references — full list in the complete paper: https://tomesphere.com/paper/1905.11530/full.md

---
Source: https://tomesphere.com/paper/1905.11530