Human-in-the-loop Extraction of Interpretable Concepts in Deep Learning   Models

Zhenge Zhao; Panpan Xu; Carlos Scheidegger; Liu Ren

arXiv:2108.03738·cs.HC·August 10, 2021·6 cites

Human-in-the-loop Extraction of Interpretable Concepts in Deep Learning Models

Zhenge Zhao, Panpan Xu, Carlos Scheidegger, Liu Ren

PDF

Open Access

TL;DR

This paper introduces a human-in-the-loop method using active learning to extract interpretable visual concepts from deep neural networks, aiding model understanding and improving performance through targeted data augmentation.

Contribution

The paper presents a novel interactive system, ConceptExtract, that efficiently combines human feedback with active learning to identify meaningful concepts affecting model decisions.

Findings

01

Active learning accurately extracts visual concepts

02

Identified concepts that negatively impact performance

03

Data augmentation based on concepts improves model accuracy

Abstract

The interpretation of deep neural networks (DNNs) has become a key topic as more and more people apply them to solve various problems and making critical decisions. Concept-based explanations have recently become a popular approach for post-hoc interpretation of DNNs. However, identifying human-understandable visual concepts that affect model decisions is a challenging task that is not easily addressed with automatic approaches. We present a novel human-in-the-loop approach to generate user-defined concepts for model interpretation and diagnostics. Central to our proposal is the use of active learning, where human knowledge and feedback are combined to train a concept extractor with very little human labeling effort. We integrate this process into an interactive system, ConceptExtract. Through two case studies, we show how our approach helps analyze model behavior and extract…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Anomaly Detection Techniques and Applications