Weight Evolution: Improving Deep Neural Networks Training through Evolving Inferior Weight Values
Zhenquan Lin, Kailing Guo, Xiaofen Xing, Xiangmin Xu

TL;DR
This paper introduces weight evolution, a novel method inspired by genetic algorithms, to improve neural network training by selectively reactivating and updating unimportant weights through crossover with important weights, enhancing performance especially in lightweight networks.
Contribution
The paper proposes a new weight reactivation method called weight evolution that combines global and local selection with crossover strategies, outperforming existing methods in neural network training.
Findings
WE outperforms existing reactivation methods.
WE improves lightweight network performance.
The method is compatible with various architectures.
Abstract
To obtain good performance, convolutional neural networks are usually over-parameterized. This phenomenon has stimulated two interesting topics: pruning the unimportant weights for compression and reactivating the unimportant weights to make full use of network capability. However, current weight reactivation methods usually reactivate the entire filters, which may not be precise enough. Looking back in history, the prosperity of filter pruning is mainly due to its friendliness to hardware implementation, but pruning at a finer structure level, i.e., weight elements, usually leads to better network performance. We study the problem of weight element reactivation in this paper. Motivated by evolution, we select the unimportant filters and update their unimportant elements by combining them with the important elements of important filters, just like gene crossover to produce better…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification
MethodsPruning
