Boosting Data-Driven Mirror Descent with Randomization, Equivariance,   and Acceleration

Hong Ye Tan; Subhadip Mukherjee; Junqi Tang; Carola-Bibiane; Sch\"onlieb

arXiv:2308.05045·math.OC·May 13, 2024·Trans. Mach. Learn. Res.

Boosting Data-Driven Mirror Descent with Randomization, Equivariance, and Acceleration

Hong Ye Tan, Subhadip Mukherjee, Junqi Tang, Carola-Bibiane, Sch\"onlieb

PDF

Open Access

TL;DR

This paper advances learned mirror descent (LMD) by introducing accelerated, stochastic, and equivariant variants, improving scalability, convergence, and efficiency for large-scale optimization in data science applications.

Contribution

It presents novel accelerated, stochastic, and equivariant extensions of LMD, enhancing its stability, scalability, and applicability to high-dimensional problems.

Findings

01

Accelerated LMD improves convergence rates.

02

Stochastic LMD reduces computational complexity.

03

Equivariant parameterization enhances efficiency in neural network training.

Abstract

Learning-to-optimize (L2O) is an emerging research area in large-scale optimization with applications in data science. Recently, researchers have proposed a novel L2O framework called learned mirror descent (LMD), based on the classical mirror descent (MD) algorithm with learnable mirror maps parameterized by input-convex neural networks. The LMD approach has been shown to significantly accelerate convex solvers while inheriting the convergence properties of the classical MD algorithm. This work proposes several practical extensions of the LMD algorithm, addressing its instability, scalability, and feasibility for high-dimensional problems. We first propose accelerated and stochastic variants of LMD, leveraging classical momentum-based acceleration and stochastic optimization techniques for improving the convergence rate and per-iteration computational complexity. Moreover, for the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Advanced Neural Network Applications · Neural Networks and Applications