Understanding the Impact of Model Incoherence on Convergence of Incremental SGD with Random Reshuffle
Shaocong Ma, Yi Zhou

TL;DR
This paper investigates how model incoherence influences the convergence behavior of SGD with random reshuffle, revealing that certain incoherence measures can improve convergence speed and accuracy.
Contribution
It introduces the concept of model incoherence to analyze its effect on SGD convergence, providing theoretical insights into the benefits of random reshuffle.
Findings
Model incoherence affects convergence error and rate.
SGD with random reshuffle converges faster under certain incoherence conditions.
Results justify the practical superiority of random reshuffle in SGD.
Abstract
Although SGD with random reshuffle has been widely-used in machine learning applications, there is a limited understanding of how model characteristics affect the convergence of the algorithm. In this work, we introduce model incoherence to characterize the diversity of model characteristics and study its impact on convergence of SGD with random reshuffle under weak strong convexity. Specifically, minimizer incoherence measures the discrepancy between the global minimizers of a sample loss and those of the total loss and affects the convergence error of SGD with random reshuffle. In particular, we show that the variable sequence generated by SGD with random reshuffle converges to a certain global minimizer of the total loss under full minimizer coherence. The other curvature incoherence measures the quality of condition numbers of the sample losses and determines the convergence rate of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Advanced Bandit Algorithms Research · Statistical Methods and Inference
MethodsStochastic Gradient Descent
