Massive Dimensions Reduction and Hybridization with Meta-heuristics in Deep Learning
Rasa Khosrowshahli, Shahryar Rahnamayan, Beatrice Ombuki-Berman

TL;DR
This paper introduces Histogram-based Blocking Differential Evolution (HBDE), a hybrid optimization method that significantly reduces parameters in deep neural networks like ResNet-18, improving training efficiency and performance on CIFAR datasets.
Contribution
The study proposes a novel hybrid meta-heuristic algorithm combining gradient-based and gradient-free methods with blocking to optimize large-scale DNNs, reducing parameters drastically.
Findings
HBDE reduces ResNet-18 parameters from 11 million to 3,000.
HBDE outperforms baseline algorithms on CIFAR-10 and CIFAR-100.
The approach demonstrates effective large-scale optimization with lower computational costs.
Abstract
Deep learning is mainly based on utilizing gradient-based optimization for training Deep Neural Network (DNN) models. Although robust and widely used, gradient-based optimization algorithms are prone to getting stuck in local minima. In this modern deep learning era, the state-of-the-art DNN models have millions and billions of parameters, including weights and biases, making them huge-scale optimization problems in terms of search space. Tuning a huge number of parameters is a challenging task that causes vanishing/exploding gradients and overfitting; likewise, utilized loss functions do not exactly represent our targeted performance metrics. A practical solution to exploring large and complex solution space is meta-heuristic algorithms. Since DNNs exceed thousands and millions of parameters, even robust meta-heuristic algorithms, such as Differential Evolution, struggle to efficiently…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing and 3D Reconstruction · Face and Expression Recognition · Advanced Vision and Imaging
