Boosting Data-Driven Mirror Descent with Randomization, Equivariance, and Acceleration
Hong Ye Tan, Subhadip Mukherjee, Junqi Tang, Carola-Bibiane, Sch\"onlieb

TL;DR
This paper advances learned mirror descent (LMD) by introducing accelerated, stochastic, and equivariant variants, improving scalability, convergence, and efficiency for large-scale optimization in data science applications.
Contribution
It presents novel accelerated, stochastic, and equivariant extensions of LMD, enhancing its stability, scalability, and applicability to high-dimensional problems.
Findings
Accelerated LMD improves convergence rates.
Stochastic LMD reduces computational complexity.
Equivariant parameterization enhances efficiency in neural network training.
Abstract
Learning-to-optimize (L2O) is an emerging research area in large-scale optimization with applications in data science. Recently, researchers have proposed a novel L2O framework called learned mirror descent (LMD), based on the classical mirror descent (MD) algorithm with learnable mirror maps parameterized by input-convex neural networks. The LMD approach has been shown to significantly accelerate convex solvers while inheriting the convergence properties of the classical MD algorithm. This work proposes several practical extensions of the LMD algorithm, addressing its instability, scalability, and feasibility for high-dimensional problems. We first propose accelerated and stochastic variants of LMD, leveraging classical momentum-based acceleration and stochastic optimization techniques for improving the convergence rate and per-iteration computational complexity. Moreover, for the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Advanced Neural Network Applications · Neural Networks and Applications
