LatentExplainer: Explaining Latent Representations in Deep Generative Models with Multimodal Large Language Models

Mengdan Zhu; Raasikh Kanjiani; Jiahui Lu; Andrew Choi; Qirui Ye; Liang Zhao

arXiv:2406.14862·cs.LG·December 22, 2025·1 cites

LatentExplainer: Explaining Latent Representations in Deep Generative Models with Multimodal Large Language Models

Mengdan Zhu, Raasikh Kanjiani, Jiahui Lu, Andrew Choi, Qirui Ye, Liang Zhao

PDF

Open Access 1 Repo

TL;DR

LatentExplainer is a novel framework that uses multimodal large language models to generate human-understandable explanations of latent variables in deep generative models, improving interpretability and understanding of these complex models.

Contribution

This paper introduces LatentExplainer, the first method to leverage multimodal large language models for explaining latent variables in deep generative models, addressing interpretability challenges.

Findings

01

Outperforms existing methods in explanation quality

02

Effectively interprets latent variables across datasets

03

Enhances model transparency and trustworthiness

Abstract

Deep generative models like VAEs and diffusion models have advanced various generation tasks by leveraging latent variables to learn data distributions and generate high-quality samples. Despite the field of explainable AI making strides in interpreting machine learning models, understanding latent variables in generative models remains challenging. This paper introduces LatentExplainer, a framework for automatically generating semantically meaningful explanations of latent variables in deep generative models. LatentExplainer tackles three main challenges: inferring the meaning of latent variables, aligning explanations with inductive biases, and handling varying degrees of explainability. Our approach perturbs latent variables, interprets changes in generated data, and uses multimodal large language models (MLLMs) to produce human-understandable explanations. We evaluate our proposed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mengdanzhu/latentexplainer
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Semantic Web and Ontologies

MethodsDiffusion