Multiclass Classification, Information, Divergence, and Surrogate Risk
John C. Duchi, Khashayar Khosravi, Feng Ruan

TL;DR
This paper unifies various information measures, divergence concepts, and loss functions in multi-class classification, extending binary results to more general settings and characterizing optimal loss functions for joint classification and data representation.
Contribution
It introduces a comprehensive framework linking divergences, information measures, and losses for multiclass problems, extending existing binary results to multiple classes and joint optimization of classifiers and data representations.
Findings
Established equivalence between divergences, information measures, and losses in multiclass settings.
Characterized convex losses that are consistent for joint classification and data representation.
Extended binary classification calibration results to multiclass scenarios.
Abstract
We provide a unifying view of statistical information measures, multi-way Bayesian hypothesis testing, loss functions for multi-class classification problems, and multi-distribution -divergences, elaborating equivalence results between all of these objects, and extending existing results for binary outcome spaces to more general ones. We consider a generalization of -divergences to multiple distributions, and we provide a constructive equivalence between divergences, statistical information (in the sense of DeGroot), and losses for multiclass classification. A major application of our results is in multi-class classification problems in which we must both infer a discriminant function ---for making predictions on a label from datum ---and a data representation (or, in the setting of a hypothesis testing problem, an experimental design), represented as a quantizer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Statistical Process Monitoring · Advanced Statistical Methods and Models
