Optimizing Convolutional Neural Network Architecture
Luis Balderas, Miguel Lastra, Jos\'e M. Ben\'itez

TL;DR
This paper introduces OCNNA, a novel CNN optimization method using pruning and knowledge distillation, which improves deployment efficiency on resource-limited devices while maintaining high accuracy.
Contribution
The paper presents a new CNN optimization technique, OCNNA, that outperforms over 20 existing algorithms in reducing complexity and energy consumption.
Findings
OCNNA achieves competitive accuracy with fewer parameters.
It outperforms 20+ existing CNN simplification algorithms.
Effective on datasets like CIFAR-10, CIFAR-100, and ImageNet.
Abstract
Convolutional Neural Networks (CNN) are widely used to face challenging tasks like speech recognition, natural language processing or computer vision. As CNN architectures get larger and more complex, their computational requirements increase, incurring significant energetic costs and challenging their deployment on resource-restricted devices. In this paper, we propose Optimizing Convolutional Neural Network Architecture (OCNNA), a novel CNN optimization and construction method based on pruning and knowledge distillation designed to establish the importance of convolutional layers. The proposal has been evaluated though a thorough empirical study including the best known datasets (CIFAR-10, CIFAR-100 and Imagenet) and CNN architectures (VGG-16, ResNet-50, DenseNet-40 and MobileNet), setting Accuracy Drop and Remaining Parameters Ratio as objective metrics to compare the performance of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Brain Tumor Detection and Classification · Human Pose and Action Recognition
