Towards Utilising a Range of Neural Activations for Comprehending   Representational Associations

Laura O'Mahony; Nikola S. Nikolov; David JP O'Sullivan

arXiv:2411.10019·cs.CV·November 18, 2024

Towards Utilising a Range of Neural Activations for Comprehending Representational Associations

Laura O'Mahony, Nikola S. Nikolov, David JP O'Sullivan

PDF

Open Access 1 Repo

TL;DR

This paper investigates the informational value of non-extremal neural activations in deep networks, demonstrating that analyzing a range of activation levels reveals complex representations and can improve model robustness against confounding factors.

Contribution

It introduces a novel approach to study mid-level neural activations, showing their usefulness in understanding representations and mitigating confounding concepts in neural networks.

Findings

01

Mid-level activations contain valuable information beyond maximal responses.

02

Analyzing a range of activations helps identify confounding concepts.

03

Method improves model robustness by curating data based on non-maximal activations.

Abstract

Recent efforts to understand intermediate representations in deep neural networks have commonly attempted to label individual neurons and combinations of neurons that make up linear directions in the latent space by examining extremal neuron activations and the highest direction projections. In this paper, we show that this approach, although yielding a good approximation for many purposes, fails to capture valuable information about the behaviour of a representation. Neural network activations are generally dense, and so a more complex, but realistic scenario is that linear directions encode information at various levels of stimulation. We hypothesise that non-extremal level activations contain complex information worth investigating, such as statistical associations, and thus may be used to locate confounding human interpretable concepts. We explore the value of studying a range of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lauraaisling/MID-level-activations
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications