Library network, a possible path to explainable neural networks

Jung Hoon Lee

arXiv:1909.13360·cs.LG·March 19, 2020

Library network, a possible path to explainable neural networks

Jung Hoon Lee

PDF

Open Access

TL;DR

This paper proposes a new algorithm that enhances understanding of deep neural networks' decision processes and detects adversarial attacks, addressing transparency and vulnerability issues in high-stakes applications.

Contribution

The paper introduces an algorithm that traces DNN decision pathways across layers and identifies adversarial attacks, improving interpretability and robustness.

Findings

01

Algorithm effectively traces decision processes across layers

02

Detects adversarial attacks reliably

03

Improves transparency of DNN decision-making

Abstract

Deep neural networks (DNNs) may outperform human brains in complex tasks, but the lack of transparency in their decision-making processes makes us question whether we could fully trust DNNs with high stakes problems. As DNNs' operations rely on a massive number of both parallel and sequential linear/nonlinear computations, predicting their mistakes is nearly impossible. Also, a line of studies suggests that DNNs can be easily deceived by adversarial attacks, indicating that their decisions can easily be corrupted by unexpected factors. Such vulnerability must be overcome if we intend to take advantage of DNNs' efficiency in high stakes problems. Here, we propose an algorithm that can help us better understand DNNs' decision-making processes. Our empirical evaluations suggest that this algorithm can effectively trace DNNs' decision processes from one layer to another and detect…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Anomaly Detection Techniques and Applications