LLMs Understand Glass-Box Models, Discover Surprises, and Suggest Repairs
Benjamin J. Lengerich, Sebastian Bordt, Harsha Nori, Mark E. Nunnally,, Yin Aphinyanaphongs, Manolis Kellis, Rich Caruana

TL;DR
This paper demonstrates that large language models can interpret, analyze, and repair glass-box models like GAMs by leveraging hierarchical reasoning and background knowledge, enhancing data science tasks.
Contribution
It introduces a hierarchical reasoning approach enabling LLMs to work with interpretable models, automating anomaly detection, explanation, and repair in data science.
Findings
LLMs effectively interpret and analyze GAMs.
LLMs can detect and explain anomalies in models.
The paper provides an open-source tool for LLM-GAM interaction.
Abstract
We show that large language models (LLMs) are remarkably good at working with interpretable models that decompose complex outcomes into univariate graph-represented components. By adopting a hierarchical approach to reasoning, LLMs can provide comprehensive model-level summaries without ever requiring the entire model to fit in context. This approach enables LLMs to apply their extensive background knowledge to automate common tasks in data science such as detecting anomalies that contradict prior knowledge, describing potential reasons for the anomalies, and suggesting repairs that would remove the anomalies. We use multiple examples in healthcare to demonstrate the utility of these new capabilities of LLMs, with particular emphasis on Generalized Additive Models (GAMs). Finally, we present the package as an open-source LLM-GAM interface.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Explainable Artificial Intelligence (XAI)
