A Directed-Evolution Method for Sparsification and Compression of Neural Networks with Application to Object Identification and Segmentation and considerations of optimal quantization using small number of bits
Luiz M Franca-Neto

TL;DR
This paper presents Directed-Evolution, a novel sparsification method for neural networks that assesses parameter relevance, combined with quantization techniques to significantly compress models while maintaining high accuracy, demonstrated on various datasets including COCO.
Contribution
The paper introduces the Directed-Evolution method for neural network sparsification that efficiently identifies parameters to zero out and combines it with optimal quantization strategies for compression.
Findings
Achieved 80% sparsification on YOLOv3 with minimal accuracy loss.
Demonstrated 40x to 80x compression ratios on large networks.
Maintained high object detection and segmentation performance after compression.
Abstract
This work introduces Directed-Evolution (DE) method for sparsification of neural networks, where the relevance of parameters to the network accuracy is directly assessed and the parameters that produce the least effect on accuracy when tentatively zeroed are indeed zeroed. DE method avoids a potentially combinatorial explosion of all possible candidate sets of parameters to be zeroed in large networks by mimicking evolution in the natural world. DE uses a distillation context [5]. In this context, the original network is the teacher and DE evolves the student neural network to the sparsification goal while maintaining minimal divergence between teacher and student. After the desired sparsification level is reached in each layer of the network by DE, a variety of quantization alternatives are used on the surviving parameters to find the lowest number of bits for their representation with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Image Processing Techniques and Applications · CCD and CMOS Imaging Sensors
MethodsBNB Customer Service Number +1-833-534-1729 · Average Pooling · Global Average Pooling · 1x1 Convolution · Batch Normalization · Convolution · Softmax · Residual Connection · k-Means Clustering · Logistic Regression
