Graph-based Neural Modules to Inspect Attention-based Architectures: A Position Paper
Breno W. Carvalho, Artur D'Avilla Garcez, Luis C. Lamb

TL;DR
This position paper proposes a graph-based abstraction for encoder-decoder neural architectures to enhance interpretability, visualization, and integration with symbolic reasoning, aiming to improve understanding and performance of deep learning models.
Contribution
It introduces a novel two-way graph representation for encoder-decoder models that links network segments with tensor data, facilitating visualization and neuro-symbolic integration.
Findings
Proposes a graph-based abstraction for neural networks.
Enables visualization and editing of model components.
Facilitates integration of symbolic reasoning with deep learning.
Abstract
Encoder-decoder architectures are prominent building blocks of state-of-the-art solutions for tasks across multiple fields where deep learning (DL) or foundation models play a key role. Although there is a growing community working on the provision of interpretation for DL models as well as considerable work in the neuro-symbolic community seeking to integrate symbolic representations and DL, many open questions remain around the need for better tools for visualization of the inner workings of DL architectures. In particular, encoder-decoder models offer an exciting opportunity for visualization and editing by humans of the knowledge implicitly represented in model weights. In this work, we explore ways to create an abstraction for segments of the network as a two-way graph-based representation. Changes to this graph structure should be reflected directly in the underlying tensor…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFerroelectric and Negative Capacitance Devices · Parallel Computing and Optimization Techniques · Advanced Memory and Neural Computing
