Classification at the Accuracy Limit -- Facing the Problem of Data Ambiguity
Claus Metzner, Achim Schilling, Maximilian Traxdorf, Konstantin, Tziridis, Holger Schulze, Patrick Krauss

TL;DR
This paper investigates the fundamental limits of classification accuracy imposed by data ambiguity, showing that different classifiers reach this universal limit regardless of their principles, and that data transformations do not improve accuracy beyond this boundary.
Contribution
The authors derive the theoretical accuracy limit caused by data overlap and demonstrate that diverse classifiers perform at this limit, unaffected by non-linear data transformations.
Findings
All classifiers tested reach the accuracy limit set by data ambiguity.
Non-reversible non-linear transformations do not improve classification accuracy.
Categories remain distinguishable after unsupervised dimensionality reduction.
Abstract
Data classification, the process of analyzing data and organizing it into categories, is a fundamental computing problem of natural and artificial information processing systems. Ideally, the performance of classifier models would be evaluated using unambiguous data sets, where the 'correct' assignment of category labels to the input data vectors is unequivocal. In real-world problems, however, a significant fraction of actually occurring data vectors will be located in a boundary zone between or outside of all categories, so that perfect classification cannot even in principle be achieved. We derive the theoretical limit for classification accuracy that arises from the overlap of data categories. By using a surrogate data generation model with adjustable statistical properties, we show that sufficiently powerful classifiers based on completely different principles, such as perceptrons…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · EEG and Brain-Computer Interfaces · Blind Source Separation Techniques
