Captum: A unified and generic model interpretability library for PyTorch

Narine Kokhlikyan; Vivek Miglani; Miguel Martin; Edward Wang; Bilal; Alsallakh; Jonathan Reynolds; Alexander Melnikov; Natalia Kliushkina; Carlos; Araya; Siqi Yan; Orion Reblitz-Richardson

arXiv:2009.07896·cs.LG·September 18, 2020·635 cites

Captum: A unified and generic model interpretability library for PyTorch

Narine Kokhlikyan, Vivek Miglani, Miguel Martin, Edward Wang, Bilal, Alsallakh, Jonathan Reynolds, Alexander Melnikov, Natalia Kliushkina, Carlos, Araya, Siqi Yan, Orion Reblitz-Richardson

PDF

Open Access 2 Repos

TL;DR

Captum is an open-source, versatile library for interpreting PyTorch models, supporting various attribution algorithms, modalities, and evaluation metrics, with an interactive visualization tool for model debugging.

Contribution

It introduces a unified, extensible interpretability library for PyTorch models, including a visualization tool, supporting multiple data modalities and scalable computations.

Findings

01

Supports multiple data modalities including images, text, audio, video

02

Provides scalable, memory-efficient attribution algorithms

03

Includes an interactive visualization tool for model debugging

Abstract

In this paper we introduce a novel, unified, open-source model interpretability library for PyTorch [12]. The library contains generic implementations of a number of gradient and perturbation-based attribution algorithms, also known as feature, neuron and layer importance algorithms, as well as a set of evaluation metrics for these algorithms. It can be used for both classification and non-classification models including graph-structured models built on Neural Networks (NN). In this paper we give a high-level overview of supported attribution algorithms and show how to perform memory-efficient and scalable computations. We emphasize that the three main characteristics of the library are multimodality, extensibility and ease of use. Multimodality supports different modality of inputs such as image, text, audio or video. Extensibility allows adding new algorithms and features. The library…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Model Reduction and Neural Networks

MethodsInterpretability