Confusion Matrix Stability Bounds for Multiclass Classification
Pierre Machart (LIF, LSIS), Liva Ralaivola (LIF)

TL;DR
This paper introduces new theoretical bounds on the generalization of multiclass classifiers by analyzing the confusion matrix's stability, proposing a novel evaluation measure, and demonstrating its application to SVMs.
Contribution
It is the first work to analyze the confusion matrix from a theoretical perspective and derive generalization bounds using matrix concentration inequalities.
Findings
Derived bounds on confusion matrix size using stability analysis.
Showed two SVM procedures are confusion-friendly.
Introduced a new evaluation measure for multiclass classifiers.
Abstract
In this paper, we provide new theoretical results on the generalization properties of learning algorithms for multiclass classification problems. The originality of our work is that we propose to use the confusion matrix of a classifier as a measure of its quality; our contribution is in the line of work which attempts to set up and study the statistical properties of new evaluation measures such as, e.g. ROC curves. In the confusion-based learning framework we propose, we claim that a targetted objective is to minimize the size of the confusion matrix C, measured through its operator norm ||C||. We derive generalization bounds on the (size of the) confusion matrix in an extended framework of uniform stability, adapted to the case of matrix valued loss. Pivotal to our study is a very recent matrix concentration inequality that generalizes McDiarmid's inequality. As an illustration of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Face and Expression Recognition · Imbalanced Data Classification Techniques
