Compositional Stochastic Average Gradient for Machine Learning and Related Applications
Tsung-Yu Hsieh, Yasser EL-Manzalawy, Yiwei Sun, Vasant Honavar

TL;DR
This paper introduces C-SAG, a novel stochastic gradient method for compositional finite-sum problems, demonstrating faster convergence and lower complexity than existing methods like C-SVRG through theoretical analysis and experiments.
Contribution
C-SAG extends the SAG method to compositional problems, offering improved convergence speed and lower query complexity compared to C-SVRG.
Findings
C-SAG achieves linear convergence for strongly convex objectives.
C-SAG has lower oracle query complexity per iteration than C-SVRG.
Experiments show C-SAG converges faster than full gradient and C-SVRG.
Abstract
Many machine learning, statistical inference, and portfolio optimization problems require minimization of a composition of expected value functions (CEVF). Of particular interest is the finite-sum versions of such compositional optimization problems (FS-CEVF). Compositional stochastic variance reduced gradient (C-SVRG) methods that combine stochastic compositional gradient descent (SCGD) and stochastic variance reduced gradient descent (SVRG) methods are the state-of-the-art methods for FS-CEVF problems. We introduce compositional stochastic average gradient descent (C-SAG) a novel extension of the stochastic average gradient method (SAG) to minimize composition of finite-sum functions. C-SAG, like SAG, estimates gradient by incorporating memory of previous gradient information. We present theoretical analyses of C-SAG which show that C-SAG, like SAG, and C-SVRG, achieves a linear…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Domain Adaptation and Few-Shot Learning · Machine Learning and Algorithms
