Incremental Gauss-Newton Descent for Machine Learning
Mikalai Korbit, Mario Zanon

TL;DR
This paper introduces Incremental Gauss-Newton Descent (IGND), a novel optimization method that incorporates approximate second-order information to improve convergence speed and robustness in machine learning tasks, with comparable computational cost to SGD.
Contribution
The paper proposes IGND, a new incremental optimization algorithm leveraging approximate second-order information, which enhances convergence and robustness over standard SGD.
Findings
IGND converges faster than SGD on certain problems.
IGND can be accelerated and tuned more easily.
IGND performs well in supervised and reinforcement learning tasks.
Abstract
Stochastic Gradient Descent (SGD) is a popular technique used to solve problems arising in machine learning. While very effective, SGD also has some weaknesses and various modifications of the basic algorithm have been proposed in order to at least partially tackle them, mostly yielding accelerated versions of SGD. Filling a gap in the literature, we present a modification of the SGD algorithm exploiting approximate second-order information based on the Gauss-Newton approach. The new method, which we call Incremental Gauss-Newton Descent (IGND), has essentially the same computational burden as standard SGD, appears to converge faster on certain classes of problems, and can also be accelerated. The key intuition making it possible to implement IGND efficiently is that, in the incremental case, approximate second-order information can be condensed into a scalar value that acts as a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Physics and Python Applications · Neural Networks and Applications
MethodsStochastic Gradient Descent
