LICO: Explainable Models with Language-Image Consistency

Yiming Lei; Zilong Li; Yangyang Li; Junping Zhang; Hongming Shan

arXiv:2310.09821·cs.CV·October 17, 2023·5 cites

LICO: Explainable Models with Language-Image Consistency

Yiming Lei, Zilong Li, Yangyang Li, Junping Zhang, Hongming Shan

PDF

Open Access

TL;DR

LICO introduces a novel explainable image classification approach that aligns language prompts with visual features, producing more accurate saliency maps and improving classification performance without extra inference costs.

Contribution

The paper proposes LICO, a new model that correlates linguistic prompts with visual features using coarse-to-fine alignment and optimal transport, enhancing interpretability and accuracy.

Findings

01

LICO generates more explainable attention maps than existing methods.

02

LICO improves classification accuracy without additional inference overhead.

03

Experimental results on eight datasets validate LICO's effectiveness.

Abstract

Interpreting the decisions of deep learning models has been actively studied since the explosion of deep neural networks. One of the most convincing interpretation approaches is salience-based visual interpretation, such as Grad-CAM, where the generation of attention maps depends merely on categorical labels. Although existing interpretation methods can provide explainable decision clues, they often yield partial correspondence between image and saliency maps due to the limited discriminative information from one-hot labels. This paper develops a Language-Image COnsistency model for explainable image classification, termed LICO, by correlating learnable linguistic prompts with corresponding visual features in a coarse-to-fine manner. Specifically, we first establish a coarse global manifold structure alignment by minimizing the distance between the distributions of image and language…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · Advanced Neural Network Applications