Hidden Activations Are Not Enough: A General Approach to Neural Network   Predictions

Samuel Leblanc; Aiky Rasolomanana; Marco Armenta

arXiv:2409.13163·cs.LG·September 23, 2024

Hidden Activations Are Not Enough: A General Approach to Neural Network Predictions

Samuel Leblanc, Aiky Rasolomanana, Marco Armenta

PDF

Open Access 1 Repo

TL;DR

This paper presents a new mathematical framework using quiver representation theory to analyze neural network predictions, capturing more information than traditional methods and applicable across architectures and tasks.

Contribution

Introduces a novel, architecture- and task-agnostic framework based on quiver representations to analyze neural network predictions and detect adversarial examples.

Findings

01

Effective adversarial example detection on MNIST and FashionMNIST

02

Framework captures richer information than hidden activations

03

Applicable to various architectures and attack methods

Abstract

We introduce a novel mathematical framework for analyzing neural networks using tools from quiver representation theory. This framework enables us to quantify the similarity between a new data sample and the training data, as perceived by the neural network. By leveraging the induced quiver representation of a data sample, we capture more information than traditional hidden layer outputs. This quiver representation abstracts away the complexity of the computations of the forward pass into a single matrix, allowing us to employ simple geometric and statistical arguments in a matrix space to study neural network predictions. Our mathematical results are architecture-agnostic and task-agnostic, making them broadly applicable. As proof of concept experiments, we apply our results for the MNIST and FashionMNIST datasets on the problem of detecting adversarial examples on different MLP…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

marcoarmenta/hidden-activations-are-not-enough
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications