Interpreto: An Explainability Library for Transformers

Antonin Poch\'e; Thomas Mullor; Gabriele Sarti; Fr\'ed\'eric Boisnard; Corentin Friedrich; Charlotte Claye; Fran\c{c}ois Hoofd; Raphael Bernas; C\'eline Hudelot; Fanny Jourdan

arXiv:2512.09730·cs.CL·February 24, 2026

Interpreto: An Explainability Library for Transformers

Antonin Poch\'e, Thomas Mullor, Gabriele Sarti, Fr\'ed\'eric Boisnard, Corentin Friedrich, Charlotte Claye, Fran\c{c}ois Hoofd, Raphael Bernas, C\'eline Hudelot, Fanny Jourdan

PDF

Open Access

TL;DR

Interpreto is an open-source Python library that offers comprehensive interpretability tools for HuggingFace language models, including attribution and concept-based explanations, with an end-to-end pipeline for deeper insights.

Contribution

It introduces a unified API for explanation workflows and features an end-to-end concept-based pipeline, advancing interpretability methods for transformer models.

Findings

01

Supports a wide range of models from BERT to LLMs

02

Provides a unified API for classification and text generation

03

Includes an end-to-end concept-based explanation pipeline

Abstract

Interpreto is an open-source Python library for interpreting HuggingFace language models, from early BERT variants to LLMs. It provides two complementary families of methods: attribution methods and concept-based explanations. The library bridges recent research and practical tooling by exposing explanation workflows through a unified API for both classification and text generation. A key differentiator is its end-to-end concept-based pipeline (from activation extraction to concept learning, interpretation, and scoring), which goes beyond feature-level attributions and is uncommon in existing libraries.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Scientific Computing and Data Management · Machine Learning in Materials Science