Augmenting Interpretable Models with LLMs during Training

Chandan Singh; Armin Askari; Rich Caruana; Jianfeng Gao

arXiv:2209.11799·cs.AI·December 5, 2023·1 cites

Augmenting Interpretable Models with LLMs during Training

Chandan Singh, Armin Askari, Rich Caruana, Jianfeng Gao

PDF

Open Access 4 Repos 5 Models

TL;DR

This paper introduces Augmented Interpretable Models (Aug-imodels) that leverage LLMs during training to create highly efficient, interpretable models for NLP tasks, achieving superior performance and transparency compared to traditional models and large LLMs.

Contribution

The paper proposes a novel framework, Aug-imodels, that uses LLMs during training to enhance interpretability and efficiency without sacrificing accuracy.

Findings

01

Aug-imodels outperform non-augmented models on text classification.

02

Aug-GAM can surpass larger models like GPT-J in performance.

03

Aug-imodels provide interpretable insights in scientific NLP applications.

Abstract

Recent large language models (LLMs) have demonstrated remarkable prediction performance for a growing array of tasks. However, their proliferation into high-stakes domains (e.g. medicine) and compute-limited settings has created a burgeoning need for interpretability and efficiency. We address this need by proposing Augmented Interpretable Models (Aug-imodels), a framework for leveraging the knowledge learned by LLMs to build extremely efficient and interpretable models. Aug-imodels use LLMs during fitting but not during inference, allowing complete transparency and often a speed/memory improvement of greater than 1,000x for inference compared to LLMs. We explore two instantiations of Aug-imodels in natural-language processing: (i) Aug-GAM, which augments a generalized additive model with decoupled embeddings from an LLM and (ii) Aug-Tree, which augments a decision tree with LLM feature…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Machine Learning in Materials Science