TCNL: Transparent and Controllable Network Learning Via Embedding Human-Guided Concepts
Zhihao Wang, Chuang Zhu

TL;DR
TCNL introduces a human-guided, concept-based approach to enhance transparency and controllability in CNN models, enabling better understanding and manipulation of learned features for specific tasks.
Contribution
The paper proposes a novel concept-based method, TCNL, integrating human-defined concepts into CNNs to improve interpretability and control over the model's learned representations.
Findings
Improved transparency-interpretability through concept encoding.
Enhanced controllability of CNNs with human-guided concepts.
Generalizable approach applicable to various classification tasks.
Abstract
Explaining deep learning models is of vital importance for understanding artificial intelligence systems, improving safety, and evaluating fairness. To better understand and control the CNN model, many methods for transparency-interpretability have been proposed. However, most of these works are less intuitive for human understanding and have insufficient human control over the CNN model. We propose a novel method, Transparent and Controllable Network Learning (TCNL), to overcome such challenges. Towards the goal of improving transparency-interpretability, in TCNL, we define some concepts for specific classification tasks through scientific human-intuition study and incorporate concept information into the CNN model. In TCNL, the shallow feature extractor gets preliminary features first. Then several concept feature extractors are built right after the shallow feature extractor to learn…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Topic Modeling
