Fundamental Novel Consistency Theory: $H$-Consistency Bounds
Yutao Zhong

TL;DR
This paper introduces $H$-consistency bounds that provide stronger, more informative guarantees on the estimation error between surrogate and target losses in machine learning, applicable to various classification settings and loss functions.
Contribution
It develops a comprehensive framework for deriving $H$-consistency bounds for multiple loss functions, including new bounds for multi-class and comp-sum losses, advancing theoretical understanding of surrogate-target loss relationships.
Findings
Established tight distribution-dependent and -independent bounds for binary classification.
Derived the first $H$-consistency bounds for multi-class max, sum, and constrained losses.
Proposed smooth adversarial variants of losses for robust learning.
Abstract
In machine learning, the loss functions optimized during training often differ from the target loss that defines task performance due to computational intractability or lack of differentiability. We present an in-depth study of the target loss estimation error relative to the surrogate loss estimation error. Our analysis leads to -consistency bounds, which are guarantees accounting for the hypothesis set . These bounds offer stronger guarantees than Bayes-consistency or -calibration and are more informative than excess error bounds. We begin with binary classification, establishing tight distribution-dependent and -independent bounds. We provide explicit bounds for convex surrogates (including linear models and neural networks) and analyze the adversarial setting for surrogates like -margin and sigmoid loss. Extending to multi-class classification, we present the first…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Stochastic Gradient Optimization Techniques · Ethics and Social Impacts of AI
