i-Algebra: Towards Interactive Interpretability of Deep Neural Networks
Xinyang Zhang, Ren Pang, Shouling Ji, Fenglong Ma, Ting Wang

TL;DR
i-Algebra introduces an interactive framework for interpreting deep neural networks, enabling users to perform flexible, composable analyses that improve understanding and usability of DNNs in critical domains.
Contribution
This paper presents i-Algebra, the first interactive interpretability framework for DNNs using composable operators and a declarative query language.
Findings
User studies show improved interpretability and usability.
Effective analysis of adversarial inputs and data cleansing.
Demonstrated flexibility in various interpretability tasks.
Abstract
Providing explanations for deep neural networks (DNNs) is essential for their use in domains wherein the interpretability of decisions is a critical prerequisite. Despite the plethora of work on interpreting DNNs, most existing solutions offer interpretability in an ad hoc, one-shot, and static manner, without accounting for the perception, understanding, or response of end-users, resulting in their poor usability in practice. In this paper, we argue that DNN interpretability should be implemented as the interactions between users and models. We present i-Algebra, a first-of-its-kind interactive framework for interpreting DNNs. At its core is a library of atomic, composable operators, which explain model behaviors at varying input granularity, during different inference stages, and from distinct interpretation perspectives. Leveraging a declarative query language, users are enabled to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Advanced Neural Network Applications
