Online Self-Concordant and Relatively Smooth Minimization, With Applications to Online Portfolio Selection and Learning Quantum States
Chung-En Tsai, Hao-Chung Cheng, Yen-Huan Li

TL;DR
This paper develops regret bounds for online convex optimization with self-concordant barriers, improving existing results and applying to online portfolio selection and quantum state learning.
Contribution
It introduces a unified analysis of online mirror descent with self-concordant functions, leading to improved regret bounds in portfolio selection and quantum learning.
Findings
Improved regret bound for exponentiated gradient: O(T^{2/3} d^{1/3})
Optimal regret for mirror descent with logarithmic barrier: O(\,sqrt{T d})
Shorter per-iteration time for quantum state learning algorithms
Abstract
Consider an online convex optimization problem where the loss functions are self-concordant barriers, smooth relative to a convex function , and possibly non-Lipschitz. We analyze the regret of online mirror descent with . Then, based on the result, we prove the following in a unified manner. Denote by the time horizon and the parameter dimension. 1. For online portfolio selection, the regret of , a variant of exponentiated gradient due to Helmbold et al., is when . This improves on the original regret bound for . 2. For online portfolio selection, the regret of online mirror descent with the logarithmic barrier is . The regret bound is the same as that of Soft-Bayes due to Orseau et al. up to logarithmic terms. 3. For…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Quantum Computing Algorithms and Architecture · Sparse and Compressive Sensing Techniques
