Variational Information Pursuit with Large Language and Multimodal   Models for Interpretable Predictions

Kwan Ho Ryan Chan; Aditya Chattopadhyay; Benjamin David Haeffele; Rene; Vidal

arXiv:2308.12562·cs.LG·August 25, 2023

Variational Information Pursuit with Large Language and Multimodal Models for Interpretable Predictions

Kwan Ho Ryan Chan, Aditya Chattopadhyay, Benjamin David Haeffele, Rene, Vidal

PDF

Open Access

TL;DR

This paper extends Variational Information Pursuit by integrating Large Language and Multimodal Models to generate interpretable concepts, enabling scalable, interpretable, and high-performing predictive models without manual concept annotation.

Contribution

The work introduces a novel FM+V-IP framework that leverages foundational models to automatically generate and annotate concepts, eliminating the need for manual dense labeling and filtering.

Findings

01

FM+V-IP achieves better test performance than V-IP with human concepts.

02

The method requires fewer concepts/queries than CBMs for similar performance.

03

No concept filtering is necessary with sufficiently informative query sets.

Abstract

Variational Information Pursuit (V-IP) is a framework for making interpretable predictions by design by sequentially selecting a short chain of task-relevant, user-defined and interpretable queries about the data that are most informative for the task. While this allows for built-in interpretability in predictive models, applying V-IP to any task requires data samples with dense concept-labeling by domain experts, limiting the application of V-IP to small-scale tasks where manual data annotation is feasible. In this work, we extend the V-IP framework with Foundational Models (FMs) to address this limitation. More specifically, we use a two-step process, by first leveraging Large Language Models (LLMs) to generate a sufficiently large candidate set of task-relevant interpretable concepts, then using Large Multimodal Models to annotate each data sample by semantic similarity with each…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Machine Learning in Materials Science · Explainable Artificial Intelligence (XAI)