ConfusionFlow: A model-agnostic visualization for temporal analysis of classifier confusion
Andreas Hinterreiter, Peter Ruch, Holger Stitz, Martin Ennemoser,, J\"urgen Bernard, Hendrik Strobelt, Marc Streit

TL;DR
ConfusionFlow is an interactive visualization tool that enables temporal and comparative analysis of classifier confusion matrices, aiding data scientists in model evaluation and debugging across different models and datasets.
Contribution
It introduces ConfusionFlow, a model-agnostic visualization system that combines confusion matrices with temporal analysis for improved classifier performance assessment.
Findings
Effective in comparing models over time
Scalable to large datasets and complex models
Useful in active learning and neural network pruning contexts
Abstract
Classifiers are among the most widely used supervised machine learning algorithms. Many classification models exist, and choosing the right one for a given task is difficult. During model selection and debugging, data scientists need to assess classifiers' performances, evaluate their learning behavior over time, and compare different models. Typically, this analysis is based on single-number performance measures such as accuracy. A more detailed evaluation of classifiers is possible by inspecting class errors. The confusion matrix is an established way for visualizing these class errors, but it was not designed with temporal or comparative analysis in mind. More generally, established performance analysis systems do not allow a combined temporal and comparative analysis of class-level information. To address this issue, we propose ConfusionFlow, an interactive, comparative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsTest
