Stochastic Momentum Methods for Non-smooth Non-Convex Finite-Sum Coupled Compositional Optimization
Xingyu Chen, Bokun Wang, Ming Yang, Qihang Lin, Tianbao Yang

TL;DR
This paper introduces stochastic momentum methods for non-smooth non-convex FCCO problems, achieving improved iteration complexity and practical effectiveness in machine learning applications.
Contribution
The paper proposes novel stochastic momentum algorithms with convergence guarantees for non-smooth FCCO, improving iteration complexity to $O(1/)$ and demonstrating practical benefits.
Findings
Achieved a new iteration complexity of $O(1/)$ for non-smooth FCCO.
Demonstrated effectiveness on three machine learning tasks.
Applied algorithms to inequality constrained non-convex optimization.
Abstract
Finite-sum Coupled Compositional Optimization (FCCO), characterized by its coupled compositional objective structure, emerges as an important optimization paradigm for addressing a wide range of machine learning problems. In this paper, we focus on a challenging class of non-convex non-smooth FCCO, where the outer functions are non-smooth weakly convex or convex and the inner functions are smooth or weakly convex. Existing state-of-the-art result face two key limitations: (1) a high iteration complexity of under the assumption that the stochastic inner functions are Lipschitz continuous in expectation; (2) reliance on vanilla SGD-type updates, which are not suitable for deep learning applications. Our main contributions are two fold: (i) We propose stochastic momentum methods tailored for non-smooth FCCO that come with provable convergence guarantees; (ii) We establish…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Geochemistry and Geologic Mapping · Advanced Optimization Algorithms Research
