NeurFlow: Interpreting Neural Networks through Neuron Groups and   Functional Interactions

Tue M. Cao; Nhat X. Hoang; Hieu H. Pham; Phi Le Nguyen; My T. Thai

arXiv:2502.16105·cs.CV·February 25, 2025

NeurFlow: Interpreting Neural Networks through Neuron Groups and Functional Interactions

Tue M. Cao, Nhat X. Hoang, Hieu H. Pham, Phi Le Nguyen, My T. Thai

PDF

Open Access 1 Repo

TL;DR

NeurFlow is a novel framework that enhances neural network interpretability by analyzing neuron groups and their functional interactions, providing clearer insights into internal processes and practical applications.

Contribution

This paper introduces NeurFlow, a framework that shifts from individual neuron analysis to neuron group interactions, improving interpretability and reducing computational costs.

Findings

01

NeurFlow accurately identifies core neuron groups.

02

It constructs hierarchical circuits of neuron interactions.

03

Demonstrates utility in image debugging and concept labeling.

Abstract

Understanding the inner workings of neural networks is essential for enhancing model performance and interpretability. Current research predominantly focuses on examining the connection between individual neurons and the model's final predictions. Which suffers from challenges in interpreting the internal workings of the model, particularly when neurons encode multiple unrelated features. In this paper, we propose a novel framework that transitions the focus from analyzing individual neurons to investigating groups of neurons, shifting the emphasis from neuron-output relationships to functional interaction between neurons. Our automated framework, NeurFlow, first identifies core neurons and clusters them into groups based on shared functional relationships, enabling a more coherent and interpretable view of the network's internal processes. This approach facilitates the construction of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tue147/neurflow
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsFocus