Silhouettes and quasi residual plots for neural nets and tree-based   classifiers

Jakob Raymaekers; Peter J. Rousseeuw

arXiv:2106.08814·stat.ML·July 8, 2024·1 cites

Silhouettes and quasi residual plots for neural nets and tree-based classifiers

Jakob Raymaekers, Peter J. Rousseeuw

PDF

Open Access

TL;DR

This paper introduces visualization techniques like silhouette and quasi residual plots to analyze and interpret neural network and tree-based classifiers by examining classification confidence and potential label bias.

Contribution

It proposes new visualization methods to analyze classifier decisions and label bias using PAC-based silhouette plots and residual plots, enhancing interpretability.

Findings

01

Silhouette plots help compare classification quality.

02

Quasi residual plots reveal data feature influences.

03

Visualizations applied to diverse benchmark datasets.

Abstract

Classification by neural nets and by tree-based methods are powerful tools of machine learning. There exist interesting visualizations of the inner workings of these and other classifiers. Here we pursue a different goal, which is to visualize the cases being classified, either in training data or in test data. An important aspect is whether a case has been classified to its given class (label) or whether the classifier wants to assign it to different class. This is reflected in the (conditional and posterior) probability of the alternative class (PAC). A high PAC indicates label bias, i.e. the possibility that the case was mislabeled. The PAC is used to construct a silhouette plot which is similar in spirit to the silhouette plot for cluster analysis (Rousseeuw, 1987). The average silhouette width can be used to compare different classifications of the same dataset. We will also draw…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Advanced Statistical Methods and Models · Neural Networks and Applications