Validation of Soft Classification Models using Partial Class Memberships: An Extended Concept of Sensitivity & Co. applied to the Grading of Astrocytoma Tissues
Claudia Beleites, Reiner Salzer, and Valter Sergo

TL;DR
This paper extends performance evaluation metrics for soft classification models using partial class memberships, allowing for more accurate assessment of classifiers in uncertain and borderline cases, demonstrated through astrocytoma tissue grading.
Contribution
It introduces new performance measures for soft classification that account for partial memberships and applies them to medical tissue grading, implemented in an R package.
Findings
Extended sensitivity and performance measures for soft classification.
Application to astrocytoma tissue grading with borderline cases.
Implementation available in the 'softclassval' R package.
Abstract
We use partial class memberships in soft classification to model uncertain labelling and mixtures of classes. Partial class memberships are not restricted to predictions, but may also occur in reference labels (ground truth, gold standard diagnosis) for training and validation data. Classifier performance is usually expressed as fractions of the confusion matrix, such as sensitivity, specificity, negative and positive predictive values. We extend this concept to soft classification and discuss the bias and variance properties of the extended performance measures. Ambiguity in reference labels translates to differences between best-case, expected and worst-case performance. We show a second set of measures comparing expected and ideal performance which is closely related to regression performance, namely the root mean squared error RMSE and the mean absolute error MAE. All…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
