Estimating the Accuracies of Multiple Classifiers Without Labeled Data

Ariel Jaffe; Boaz Nadler; Yuval Kluger

arXiv:1407.7644·stat.ML·October 31, 2014·28 cites

Estimating the Accuracies of Multiple Classifiers Without Labeled Data

Ariel Jaffe, Boaz Nadler, Yuval Kluger

PDF

Open Access

TL;DR

This paper introduces efficient spectral algorithms to estimate classifier accuracies and build improved unsupervised ensemble classifiers using only unlabeled predictions, under independence assumptions.

Contribution

It presents novel spectral methods for accuracy estimation and ensemble construction without labeled data, with proven consistency and asymptotic analysis.

Findings

01

Algorithms are computationally efficient and scalable.

02

Methods achieve competitive accuracy in experiments.

03

Theoretical guarantees under classifier independence.

Abstract

In various situations one is given only the predictions of multiple classifiers over a large unlabeled test data. This scenario raises the following questions: Without any labeled data and without any a-priori knowledge about the reliability of these different classifiers, is it possible to consistently and computationally efficiently estimate their accuracies? Furthermore, also in a completely unsupervised manner, can one construct a more accurate unsupervised ensemble classifier? In this paper, focusing on the binary case, we present simple, computationally efficient algorithms to solve these questions. Furthermore, under standard classifier independence assumptions, we prove our methods are consistent and study their asymptotic error. Our approach is spectral, based on the fact that the off-diagonal entries of the classifiers' covariance matrix and 3-d tensor are rank-one. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Anomaly Detection Techniques and Applications · Imbalanced Data Classification Techniques