Towards Learning a Vocabulary of Visual Concepts and Operators using Deep Neural Networks
Sunil Kumar Vengalil, Neelam Sinha

TL;DR
This paper proposes a method to analyze and extract visual concepts from trained deep neural networks, specifically using MNIST data, to improve explainability and data augmentation.
Contribution
It introduces a technique to derive primitive visual concepts from feature maps, enhancing model explainability and enabling effective data augmentation.
Findings
Reduced reconstruction loss from 120 to 60 with augmentation
Generated about 60,000 new images using visual concepts
Demonstrated improved explainability of deep models
Abstract
Deep neural networks have become the default choice for many applications like image and video recognition, segmentation and other image and video related tasks.However, a critical challenge with these models is the lack of explainability.This requirement of generating explainable predictions has motivated the research community to perform various analysis on trained models.In this study, we analyze the learned feature maps of trained models using MNIST images for achieving more explainable predictions.Our study is focused on deriving a set of primitive elements, here called visual concepts, that can be used to generate any arbitrary sample from the data generating distribution.We derive the primitive elements from the feature maps learned by the model.We illustrate the idea by generating visual concepts from a Variational Autoencoder trained using MNIST images.We augment the training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI) · Machine Learning and Data Classification
