LG-CAV: Train Any Concept Activation Vector with Language Guidance

Qihan Huang; Jie Song; Mengqi Xue; Haofei Zhang; Bingde Hu; Huiqiong; Wang; Hao Jiang; Xingen Wang; Mingli Song

arXiv:2410.10308·cs.CV·October 15, 2024

LG-CAV: Train Any Concept Activation Vector with Language Guidance

Qihan Huang, Jie Song, Mengqi Xue, Haofei Zhang, Bingde Hu, Huiqiong, Wang, Hao Jiang, Xingen Wang, Mingli Song

PDF

Open Access 1 Reviews

TL;DR

LG-CAV leverages pre-trained vision-language models to train concept activation vectors without labeled data, enabling explainability and model correction across diverse datasets and architectures.

Contribution

Proposes a novel language-guided approach to train CAVs without labeled data using vision-language models and introduces a model correction technique called activation sample reweighting.

Findings

01

LG-CAV outperforms previous CAV methods in quality across datasets.

02

The model correction method achieves state-of-the-art results.

03

Applicable to various architectures and datasets.

Abstract

Concept activation vector (CAV) has attracted broad research interest in explainable AI, by elegantly attributing model predictions to specific concepts. However, the training of CAV often necessitates a large number of high-quality images, which are expensive to curate and thus limited to a predefined set of concepts. To address this issue, we propose Language-Guided CAV (LG-CAV) to harness the abundant concept knowledge within the certain pre-trained vision-language models (e.g., CLIP). This method allows training any CAV without labeled data, by utilizing the corresponding concept descriptions as guidance. To bridge the gap between vision-language model and the target model, we calculate the activation values of concept descriptions on a common pool of images (probe images) with vision-language model and utilize them as language guidance to train the LG-CAV. Furthermore, after…

Peer Reviews

Decision·NeurIPS 2024 poster

Reviewer 01Rating 5Confidence 3

Strengths

1. The paper is well-written. 2. The use of vision-language models allows for training CAVs without the need for labeled data. 3. The introduction of GA, CE, and DSR modules improves the quality of LG-CAVs. 4. Beyond generating explanations, the method is applied to model correction, leading to improved performance in target models. 5. The results show substantial improvements in both concept accuracy and concept-to-class accuracy compared to existing methods.

Weaknesses

1. The method proposed in this paper does not clearly address the data scarcity problem of the original CAV methods, which was highlighted at the beginning. Although the method is effective, it is not evident why it successfully addresses the data scarcity issue. 2. The method heavily relies on the availability and performance of pre-trained vision-language models like CLIP, which may not always be accessible or optimal for all tasks. 3. The introduction of multiple enhancement modules increases

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Semantic Web and Ontologies

MethodsSparse Evolutionary Training