Variational Information Pursuit with Large Language and Multimodal Models for Interpretable Predictions
Kwan Ho Ryan Chan, Aditya Chattopadhyay, Benjamin David Haeffele, Rene, Vidal

TL;DR
This paper extends Variational Information Pursuit by integrating Large Language and Multimodal Models to generate interpretable concepts, enabling scalable, interpretable, and high-performing predictive models without manual concept annotation.
Contribution
The work introduces a novel FM+V-IP framework that leverages foundational models to automatically generate and annotate concepts, eliminating the need for manual dense labeling and filtering.
Findings
FM+V-IP achieves better test performance than V-IP with human concepts.
The method requires fewer concepts/queries than CBMs for similar performance.
No concept filtering is necessary with sufficiently informative query sets.
Abstract
Variational Information Pursuit (V-IP) is a framework for making interpretable predictions by design by sequentially selecting a short chain of task-relevant, user-defined and interpretable queries about the data that are most informative for the task. While this allows for built-in interpretability in predictive models, applying V-IP to any task requires data samples with dense concept-labeling by domain experts, limiting the application of V-IP to small-scale tasks where manual data annotation is feasible. In this work, we extend the V-IP framework with Foundational Models (FMs) to address this limitation. More specifically, we use a two-step process, by first leveraging Large Language Models (LLMs) to generate a sufficiently large candidate set of task-relevant interpretable concepts, then using Large Multimodal Models to annotate each data sample by semantic similarity with each…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Machine Learning in Materials Science · Explainable Artificial Intelligence (XAI)
