Convergence of ease-controlled Random Reshuffling gradient Algorithms under Lipschitz smoothness
Ruggiero Seccia, Corrado Coppola, Giampaolo Liuzzi, Laura Palagi

TL;DR
This paper introduces ease-controlled modifications of Incremental Gradient and Random Reshuffling algorithms that adaptively manage stepsizes and convergence criteria, improving efficiency in large-scale non-convex optimization.
Contribution
It proposes novel ease-controlled IG/RR schemes with watchdog and linesearch mechanisms that ensure convergence without pre-set stepsize rules, validated through theoretical proofs and extensive experiments.
Findings
Algorithms achieve convergence under Lipschitz smoothness.
Comparable computational effort to existing online methods.
Potential for faster objective function decrease.
Abstract
In this work, we consider minimizing the average of a very large number of smooth and possibly non-convex functions, and we focus on two widely used minibatch frameworks to tackle this optimization problem: Incremental Gradient (IG) and Random Reshuffling (RR). We define ease-controlled modifications of the IG/RR schemes, which require a light additional computational effort {but} can be proved to converge under {weak} and standard assumptions. In particular, we define two algorithmic schemes in which the IG/RR iteration is controlled by using a watchdog rule and a derivative-free linesearch that activates only sporadically to guarantee convergence. The two schemes differ in the watchdog and the linesearch, which are performed using either a monotonic or a non-monotonic rule. The two schemes also allow controlling the updating of the stepsize used in the main IG/RR iteration, avoiding…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Domain Adaptation and Few-Shot Learning · Sparse and Compressive Sensing Techniques
