Exp-Concavity of Proper Composite Losses
Parameswaran Kamalaruban, Robert C. Williamson, Xinhua Zhang

TL;DR
This paper characterizes the exp-concavity of proper composite losses and demonstrates how to transform any $eta$-mixable binary proper loss into a $eta$-exp-concave loss, bridging two key concepts in online prediction.
Contribution
It provides a complete characterization of exp-concavity for proper composite losses and shows how to convert $eta$-mixable losses into $eta$-exp-concave ones, enhancing theoretical understanding.
Findings
Complete characterization of exp-concavity for proper composite losses
Transformation method for binary $eta$-mixable proper losses into $eta$-exp-concave losses
Approximate transformation approach for multi-class losses
Abstract
The goal of online prediction with expert advice is to find a decision strategy which will perform almost as well as the best expert in a given pool of experts, on any sequence of outcomes. This problem has been widely studied and and regret bounds can be achieved for convex losses (\cite{zinkevich2003online}) and strictly convex losses with bounded first and second derivatives (\cite{hazan2007logarithmic}) respectively. In special cases like the Aggregating Algorithm (\cite{vovk1995game}) with mixable losses and the Weighted Average Algorithm (\cite{kivinen1999averaging}) with exp-concave losses, it is possible to achieve regret bounds. \cite{van2012exp} has argued that mixability and exp-concavity are roughly equivalent under certain conditions. Thus by understanding the underlying relationship between these two notions we can gain the best of both…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Optimization and Search Problems · Machine Learning and Algorithms
