Restyling Unsupervised Concept Based Interpretable Networks with Generative Models
Jayneel Parekh, Quentin Bouniot, Pavlo Mozharovskyi, Alasdair Newson,, Florence d'Alch\'e-Buc

TL;DR
This paper introduces a novel approach that maps high-level concepts in interpretable neural networks to the latent space of pretrained generative models, enabling better visualization, interpretation, and training efficiency for large-scale image recognition tasks.
Contribution
The method leverages pretrained generative models to improve visualization, interpretability, and training efficiency of unsupervised concept-based neural networks.
Findings
High-quality visualization of concepts achieved
Improved interpretability and fidelity of learned concepts
Efficient training through leveraging pretrained generative models
Abstract
Developing inherently interpretable models for prediction has gained prominence in recent years. A subclass of these models, wherein the interpretable network relies on learning high-level concepts, are valued because of closeness of concept representations to human communication. However, the visualization and understanding of the learnt unsupervised dictionary of concepts encounters major limitations, especially for large-scale images. We propose here a novel method that relies on mapping the concept features to the latent space of a pretrained generative model. The use of a generative model enables high quality visualization, and lays out an intuitive and interactive procedure for better interpretation of the learnt concepts by imputing concept activations and visualizing generated modifications. Furthermore, leveraging pretrained generative models has the additional advantage of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Explainable Artificial Intelligence (XAI) · Advanced Image and Video Retrieval Techniques
