Binary Search and First Order Gradient Based Method for Stochastic Optimization
Vijay Pandey

TL;DR
This paper introduces Binary Search Gradient Optimization (BSG), a novel stochastic method combining binary search with first-order gradients to improve convergence and generalization in neural network training.
Contribution
The paper proposes BSG, a new optimization algorithm that efficiently identifies convex regions in non-convex surfaces using binary search, enhancing training stability and performance.
Findings
BSG outperforms traditional gradient methods on MNIST, IMDB, and CIFAR10.
BSG effectively handles vanishing and exploding gradients in deep networks.
The method generalizes better on unseen data.
Abstract
In this paper, we present a novel stochastic optimization method, which uses the binary search technique with first order gradient based optimization method, called Binary Search Gradient Optimization (BSG) or BiGrad. In this optimization setup, a non-convex surface is treated as a set of convex surfaces. In BSG, at first, a region is defined, assuming region is convex. If region is not convex, then the algorithm leaves the region very fast and defines a new one, otherwise, it tries to converge at the optimal point of the region. In BSG, core purpose of binary search is to decide, whether region is convex or not in logarithmic time, whereas, first order gradient based method is primarily applied, to define a new region. In this paper, Adam is used as a first order gradient based method, nevertheless, other methods of this class may also be considered. In deep neural network setup, it…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Neural Network Applications
MethodsLogistic Regression · Adam
