Stochastic Nested Compositional Bi-level Optimization for Robust Feature Learning
Xuxing Chen, Krishnakumar Balasubramanian, Saeed Ghadimi

TL;DR
This paper introduces stochastic algorithms for nested compositional bi-level optimization problems, achieving efficient solutions without matrix inversions, and demonstrates their application in robust feature learning under covariate shift.
Contribution
The paper develops a novel stochastic approximation algorithm for complex nested bi-level problems, addressing multiple sources of bias and providing theoretical convergence guarantees.
Findings
Achieves $ ilde{O}_T(1/epsilon^{2})$ oracle complexity for $epsilon$-stationary solutions.
Handles three sources of bias in stochastic gradients due to composition, bi-level structure, and Neumann series approximation.
Demonstrates effectiveness in robust feature learning for deep neural networks under covariate shift.
Abstract
We develop and analyze stochastic approximation algorithms for solving nested compositional bi-level optimization problems. These problems involve a nested composition of potentially non-convex smooth functions in the upper-level, and a smooth and strongly convex function in the lower-level. Our proposed algorithm does not rely on matrix inversions or mini-batches and can achieve an -stationary solution with an oracle complexity of approximately , assuming the availability of stochastic first-order oracles for the individual functions in the composition and the lower-level, which are unbiased and have bounded moments. Here, hides polylog factors and constants that depend on . The key challenge we address in establishing this result relates to handling three distinct sources of bias in the stochastic gradients. The first source…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Medical Image Segmentation Techniques
