Building Efficient Lightweight CNN Models
Nathan Isong

TL;DR
This paper presents a novel methodology for constructing lightweight CNNs that maintain high accuracy while significantly reducing computational and memory requirements, suitable for resource-constrained environments.
Contribution
It introduces a two-stage training approach combining dual-input-output models and transfer learning with progressive unfreezing to enhance lightweight CNN performance.
Findings
Achieved 99% accuracy on MNIST with 14,862 parameters
Reached 89% accuracy on Fashion MNIST with low model size
Attained 65% accuracy on CIFAR-10 with fewer than 20,000 parameters
Abstract
Convolutional Neural Networks (CNNs) are pivotal in image classification tasks due to their robust feature extraction capabilities. However, their high computational and memory requirements pose challenges for deployment in resource-constrained environments. This paper introduces a methodology to construct lightweight CNNs while maintaining competitive accuracy. The approach integrates two stages of training; dual-input-output model and transfer learning with progressive unfreezing. The dual-input-output model train on original and augmented datasets, enhancing robustness. Progressive unfreezing is applied to the unified model to optimize pre-learned features during fine-tuning, enabling faster convergence and improved model accuracy. The methodology was evaluated on three benchmark datasets; handwritten digit MNIST, fashion MNIST, and CIFAR-10. The proposed model achieved a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBrain Tumor Detection and Classification · Neural Networks and Applications · Advanced Neural Network Applications
