Augmenting Interpretable Models with LLMs during Training
Chandan Singh, Armin Askari, Rich Caruana, Jianfeng Gao

TL;DR
This paper introduces Augmented Interpretable Models (Aug-imodels) that leverage LLMs during training to create highly efficient, interpretable models for NLP tasks, achieving superior performance and transparency compared to traditional models and large LLMs.
Contribution
The paper proposes a novel framework, Aug-imodels, that uses LLMs during training to enhance interpretability and efficiency without sacrificing accuracy.
Findings
Aug-imodels outperform non-augmented models on text classification.
Aug-GAM can surpass larger models like GPT-J in performance.
Aug-imodels provide interpretable insights in scientific NLP applications.
Abstract
Recent large language models (LLMs) have demonstrated remarkable prediction performance for a growing array of tasks. However, their proliferation into high-stakes domains (e.g. medicine) and compute-limited settings has created a burgeoning need for interpretability and efficiency. We address this need by proposing Augmented Interpretable Models (Aug-imodels), a framework for leveraging the knowledge learned by LLMs to build extremely efficient and interpretable models. Aug-imodels use LLMs during fitting but not during inference, allowing complete transparency and often a speed/memory improvement of greater than 1,000x for inference compared to LLMs. We explore two instantiations of Aug-imodels in natural-language processing: (i) Aug-GAM, which augments a generalized additive model with decoupled embeddings from an LLM and (ii) Aug-Tree, which augments a decision tree with LLM feature…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Machine Learning in Materials Science
