Making Neural Networks Interpretable with Attribution: Application to   Implicit Signals Prediction

Darius Afchar; Romain Hennequin

arXiv:2008.11406·cs.LG·August 27, 2020

Making Neural Networks Interpretable with Attribution: Application to Implicit Signals Prediction

Darius Afchar, Romain Hennequin

PDF

1 Repo

TL;DR

This paper introduces a novel, inherently interpretable neural network architecture for attribution tasks, enabling deep feature attribution and improved explainability without sacrificing predictive accuracy.

Contribution

The paper presents a new interpretable neural network design using masked weights and sub-networks, distinct from post-hoc explanation methods.

Findings

01

Achieves comparable accuracy to non-interpretable models

02

Provides detailed attribution explanations

03

Effective on synthetic and real-world data

Abstract

Explaining recommendations enables users to understand whether recommended items are relevant to their needs and has been shown to increase their trust in the system. More generally, if designing explainable machine learning models is key to check the sanity and robustness of a decision process and improve their efficiency, it however remains a challenge for complex architectures, especially deep neural networks that are often deemed "black-box". In this paper, we propose a novel formulation of interpretable deep neural networks for the attribution task. Differently to popular post-hoc methods, our approach is interpretable by design. Using masked weights, hidden features can be deeply attributed, split into several input-restricted sub-networks and trained as a boosted mixture of experts. Experimental results on synthetic data and real-world recommendation tasks demonstrate that our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

deezer/interpretable_nn_attribution
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.