BGADAM: Boosting based Genetic-Evolutionary ADAM for Neural Network Optimization
Jiyang Bai, Yuxiang Ren, Jiawei Zhang

TL;DR
BGADAM is a novel optimization algorithm that combines boosting strategies with genetic algorithms and ADAM to improve neural network training, helping models escape local optima and achieve better convergence.
Contribution
The paper introduces BGADAM, integrating boosting with genetic algorithms and ADAM, providing a new approach for neural network optimization with theoretical and empirical validation.
Findings
Boosting enhances genetic algorithm effectiveness in neural network training.
BGADAM helps models escape local optima and find better solutions.
Empirical results confirm improved convergence and performance.
Abstract
For various optimization methods, gradient descent-based algorithms can achieve outstanding performance and have been widely used in various tasks. Among those commonly used algorithms, ADAM owns many advantages such as fast convergence with both the momentum term and the adaptive learning rate. However, since the loss functions of most deep neural networks are non-convex, ADAM also shares the drawback of getting stuck in local optima easily. To resolve such a problem, the idea of combining genetic algorithm with base learners is introduced to rediscover the best solutions. Nonetheless, from our analysis, the idea of combining genetic algorithm with a batch of base learners still has its shortcomings. The effectiveness of genetic algorithm can hardly be guaranteed if the unit models converge to close or the same solutions. To resolve this problem and further maximize the advantages of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Machine Learning and ELM · Stochastic Gradient Optimization Techniques
MethodsAdam
