Explaining Deep Convolutional Neural Networks via Latent Visual-Semantic   Filter Attention

Yu Yang; Seungbae Kim; Jungseock Joo

arXiv:2204.04601·cs.CV·April 12, 2022·1 cites

Explaining Deep Convolutional Neural Networks via Latent Visual-Semantic Filter Attention

Yu Yang, Seungbae Kim, Jungseock Joo

PDF

Open Access 1 Repo

TL;DR

This paper introduces LaViSE, a framework that enables convolutional neural networks to generate textual explanations of their internal filters, aiding interpretability and bias detection without requiring semantic labels during training.

Contribution

LaViSE is a novel modular framework that maps visual features to semantic descriptions at the filter level, applicable to any trained CNN regardless of original training data.

Findings

01

Generates meaningful descriptions for CNN filters beyond training categories

02

Enables unsupervised bias analysis in datasets

03

Works with any trained CNN without needing original training data

Abstract

Interpretability is an important property for visual models as it helps researchers and users understand the internal mechanism of a complex model. However, generating semantic explanations about the learned representation is challenging without direct supervision to produce such explanations. We propose a general framework, Latent Visual Semantic Explainer (LaViSE), to teach any existing convolutional neural network to generate text descriptions about its own latent representations at the filter level. Our method constructs a mapping between the visual and semantic spaces using generic image datasets, using images and category names. It then transfers the mapping to the target domain which does not have semantic labels. The proposed framework employs a modular structure and enables to analyze any trained network whether or not its original training data is available. We show that our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yuyang0901/lavise
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI) · Human Pose and Action Recognition