BGADAM: Boosting based Genetic-Evolutionary ADAM for Neural Network   Optimization

Jiyang Bai; Yuxiang Ren; Jiawei Zhang

arXiv:1908.08015·cs.NE·May 5, 2021

BGADAM: Boosting based Genetic-Evolutionary ADAM for Neural Network Optimization

Jiyang Bai, Yuxiang Ren, Jiawei Zhang

PDF

Open Access

TL;DR

BGADAM is a novel optimization algorithm that combines boosting strategies with genetic algorithms and ADAM to improve neural network training, helping models escape local optima and achieve better convergence.

Contribution

The paper introduces BGADAM, integrating boosting with genetic algorithms and ADAM, providing a new approach for neural network optimization with theoretical and empirical validation.

Findings

01

Boosting enhances genetic algorithm effectiveness in neural network training.

02

BGADAM helps models escape local optima and find better solutions.

03

Empirical results confirm improved convergence and performance.

Abstract

For various optimization methods, gradient descent-based algorithms can achieve outstanding performance and have been widely used in various tasks. Among those commonly used algorithms, ADAM owns many advantages such as fast convergence with both the momentum term and the adaptive learning rate. However, since the loss functions of most deep neural networks are non-convex, ADAM also shares the drawback of getting stuck in local optima easily. To resolve such a problem, the idea of combining genetic algorithm with base learners is introduced to rediscover the best solutions. Nonetheless, from our analysis, the idea of combining genetic algorithm with a batch of base learners still has its shortcomings. The effectiveness of genetic algorithm can hardly be guaranteed if the unit models converge to close or the same solutions. To resolve this problem and further maximize the advantages of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Machine Learning and ELM · Stochastic Gradient Optimization Techniques

MethodsAdam