Hide-and-Seek: A Template for Explainable AI

Thanos Tagaris; Andreas Stafylopatis

arXiv:2005.00130·cs.LG·May 4, 2020·6 cites

Hide-and-Seek: A Template for Explainable AI

Thanos Tagaris, Andreas Stafylopatis

PDF

Open Access 1 Repo

TL;DR

This paper introduces the Hide-and-Seek framework for training interpretable neural networks, providing a theoretical basis and demonstrating that high interpretability can be achieved without losing predictive accuracy.

Contribution

It presents a novel training framework for interpretable neural networks and offers a theoretical foundation for evaluating similar approaches.

Findings

01

Neural networks can be made highly interpretable without sacrificing accuracy

02

Theoretical analysis supports the effectiveness of the Hide-and-Seek framework

03

Experimental results validate the interpretability and performance of the proposed method

Abstract

Lack of transparency has been the Achilles heal of Neural Networks and their wider adoption in industry. Despite significant interest this shortcoming has not been adequately addressed. This study proposes a novel framework called Hide-and-Seek (HnS) for training Interpretable Neural Networks and establishes a theoretical foundation for exploring and comparing similar ideas. Extensive experimentation indicates that a high degree of interpretability can be imputed into Neural Networks, without sacrificing their predictive power.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

djib2011/hide-and-seek
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications

MethodsInterpretability