TRSVR: An Adaptive Stochastic Trust-Region Method with Variance Reduction
Yuchen Fang, Xinshou Zheng, Javad Lavaei

TL;DR
This paper introduces TRSVR, a stochastic trust-region method with variance reduction for nonconvex optimization, achieving faster convergence without function evaluations, and demonstrating superior performance over SGD and Adam in experiments.
Contribution
The paper presents a novel stochastic trust-region algorithm that incorporates variance reduction, enabling efficient convergence in nonconvex optimization without function value computations.
Findings
Converges to a first-order stationary point under mild assumptions.
Achieves iteration and sample complexity bounds comparable to SVRG-based methods.
Outperforms SGD and Adam in machine learning tasks.
Abstract
We propose a stochastic trust-region method for unconstrained nonconvex optimization that incorporates stochastic variance-reduced gradients (SVRG) to accelerate convergence. Unlike classical trust-region methods, the proposed algorithm relies solely on stochastic gradient information and does not require function value evaluations. The trust-region radius is adaptively adjusted based on a radius-control parameter and the stochastic gradient estimate. Under mild assumptions, we establish that the algorithm converges in expectation to a first-order stationary point. Moreover, the method achieves iteration and sample complexity bounds that match those of SVRG-based first-order methods, while allowing stochastic and potentially gradient-dependent second-order information. Extensive numerical experiments demonstrate that incorporating SVRG accelerates convergence, and that the use of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Gaussian Processes and Bayesian Inference · Sparse and Compressive Sensing Techniques
