Classifying real-world data with the $DD\alpha$-procedure
Pavlo Mozharovskyi, Karl Mosler, Tatjana Lange

TL;DR
The paper introduces the $DD ext{ extalpha}$-classifier, a robust, nonparametric method that transforms data into a depth space and classifies using the $ ext{ extalpha}$-procedure, demonstrating effectiveness across diverse real-world datasets.
Contribution
It presents a novel, robust classification approach combining data depth transformations with the $ ext{ extalpha}$-procedure, applicable to various real-world problems.
Findings
Tukey depth yields the lowest error rates and highest robustness.
The method effectively handles high-dimensional data with appropriate feature space extension.
The $DD ext{ extalpha}$-classifier is implemented as an R-package for practical use.
Abstract
The -classifier, a nonparametric fast and very robust procedure, is described and applied to fifty classification problems regarding a broad spectrum of real-world data. The procedure first transforms the data from their original property space into a depth space, which is a low-dimensional unit cube, and then separates them by a projective invariant procedure, called -procedure. To each data point the transformation assigns its depth values with respect to the given classes. Several alternative depth notions (spatial depth, Mahalanobis depth, projection depth, and Tukey depth, the latter two being approximated by univariate projections) are used in the procedure, and compared regarding their average error rates. With the Tukey depth, which fits the distributions' shape best and is most robust, `outsiders', that is data points having zero depth in all classes, need an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models · Advanced Statistical Process Monitoring
