Human-in-the-loop Extraction of Interpretable Concepts in Deep Learning Models
Zhenge Zhao, Panpan Xu, Carlos Scheidegger, Liu Ren

TL;DR
This paper introduces a human-in-the-loop method using active learning to extract interpretable visual concepts from deep neural networks, aiding model understanding and improving performance through targeted data augmentation.
Contribution
The paper presents a novel interactive system, ConceptExtract, that efficiently combines human feedback with active learning to identify meaningful concepts affecting model decisions.
Findings
Active learning accurately extracts visual concepts
Identified concepts that negatively impact performance
Data augmentation based on concepts improves model accuracy
Abstract
The interpretation of deep neural networks (DNNs) has become a key topic as more and more people apply them to solve various problems and making critical decisions. Concept-based explanations have recently become a popular approach for post-hoc interpretation of DNNs. However, identifying human-understandable visual concepts that affect model decisions is a challenging task that is not easily addressed with automatic approaches. We present a novel human-in-the-loop approach to generate user-defined concepts for model interpretation and diagnostics. Central to our proposal is the use of active learning, where human knowledge and feedback are combined to train a concept extractor with very little human labeling effort. We integrate this process into an interactive system, ConceptExtract. Through two case studies, we show how our approach helps analyze model behavior and extract…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Anomaly Detection Techniques and Applications
