ConfusionFlow: A model-agnostic visualization for temporal analysis of   classifier confusion

Andreas Hinterreiter; Peter Ruch; Holger Stitz; Martin Ennemoser,; J\"urgen Bernard; Hendrik Strobelt; Marc Streit

arXiv:1910.00969·cs.LG·October 15, 2020

ConfusionFlow: A model-agnostic visualization for temporal analysis of classifier confusion

Andreas Hinterreiter, Peter Ruch, Holger Stitz, Martin Ennemoser,, J\"urgen Bernard, Hendrik Strobelt, Marc Streit

PDF

TL;DR

ConfusionFlow is an interactive visualization tool that enables temporal and comparative analysis of classifier confusion matrices, aiding data scientists in model evaluation and debugging across different models and datasets.

Contribution

It introduces ConfusionFlow, a model-agnostic visualization system that combines confusion matrices with temporal analysis for improved classifier performance assessment.

Findings

01

Effective in comparing models over time

02

Scalable to large datasets and complex models

03

Useful in active learning and neural network pruning contexts

Abstract

Classifiers are among the most widely used supervised machine learning algorithms. Many classification models exist, and choosing the right one for a given task is difficult. During model selection and debugging, data scientists need to assess classifiers' performances, evaluate their learning behavior over time, and compare different models. Typically, this analysis is based on single-number performance measures such as accuracy. A more detailed evaluation of classifiers is possible by inspecting class errors. The confusion matrix is an established way for visualizing these class errors, but it was not designed with temporal or comparative analysis in mind. More generally, established performance analysis systems do not allow a combined temporal and comparative analysis of class-level information. To address this issue, we propose ConfusionFlow, an interactive, comparative…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsTest