A Geometry-Aware Efficient Algorithm for Compositional Entropic Risk Minimization
Xiyuan Wei, Linli Zhou, Bokun Wang, Chih-Jen Lin, Tianbao Yang

TL;DR
This paper introduces SCENT, a geometry-aware stochastic algorithm for efficient optimization of compositional entropic risk minimization problems, addressing limitations of existing methods with theoretical guarantees and empirical success.
Contribution
The paper proposes a novel geometry-aware stochastic algorithm, SCENT, with a stochastic proximal mirror descent update for dual variables, achieving improved convergence and stability.
Findings
Achieves an $O(1/ oot 2 T)$ convergence rate for convex problems.
Demonstrates superior empirical performance on classification and robustness tasks.
Outperforms existing baselines in multiple machine learning applications.
Abstract
This paper studies optimization for a family of problems termed , in which each data's loss is formulated as a Log-Expectation-Exponential (Log-E-Exp) function. The Log-E-Exp formulation serves as an abstraction of the Log-Sum-Exponential (LogSumExp) function when the explicit summation inside the logarithm is taken over a gigantic number of items and is therefore expensive to evaluate. While entropic risk objectives of this form arise in many machine learning problems, existing optimization algorithms suffer from several fundamental limitations including non-convergence, numerical instability, and slow convergence rates. To address these limitations, we propose a geometry-aware stochastic algorithm, termed , for the dual formulation of entropic risk minimization cast as a min--min optimization problem. The key to our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Risk and Portfolio Optimization · Gaussian Processes and Bayesian Inference
