Proper losses regret at least 1/2-order
Han Bao, Asuka Takatsu

TL;DR
This paper investigates the convergence rates of proper losses in machine learning, proving that the p-norm error cannot decrease faster than the square root of the surrogate regret for a broad class of losses, highlighting the optimality of strongly proper losses.
Contribution
It establishes a fundamental lower bound of 1/2-order convergence for p-norm errors in proper loss frameworks and confirms the optimality of strongly proper losses.
Findings
Surrogate regret bounds are linked to strict properness of losses.
The p-norm error cannot converge faster than the 1/2-order of surrogate regret.
Strongly proper losses achieve the optimal convergence rate.
Abstract
A fundamental challenge in machine learning is the choice of a loss as it characterizes our learning task, is minimized in the training phase, and serves as an evaluation criterion for estimators. Proper losses are commonly chosen, ensuring minimizers of the full risk match the true probability vector. Estimators induced from a proper loss are widely used to construct forecasters for downstream tasks such as classification and ranking. In this procedure, how does the forecaster based on the obtained estimator perform well under a given downstream task? This question is substantially relevant to the behavior of the -norm between the estimated and true probability vectors when the estimator is updated. In the proper loss framework, the suboptimality of the estimated probability vector from the true probability vector is measured by a surrogate regret. First, we analyze a surrogate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Forecasting Techniques and Applications · Advanced Bandit Algorithms Research
