The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns via Spotlights of Attention
Kazuki Irie, R\'obert Csord\'as, J\"urgen Schmidhuber

TL;DR
This paper revisits the dual form of neural networks, revealing how test predictions relate to training patterns through attention mechanisms, enabling visualization of training data influence at test time.
Contribution
It demonstrates the practical application of the dual formulation to interpret neural network predictions by visualizing attention over training data.
Findings
Dual form links test predictions to training data via attention.
Visualization of training data influence improves interpretability.
Experiments show potential and limitations of this approach.
Abstract
Linear layers in neural networks (NNs) trained by gradient descent can be expressed as a key-value memory system which stores all training datapoints and the initial weights, and produces outputs using unnormalised dot attention over the entire training experience. While this has been technically known since the 1960s, no prior work has effectively studied the operations of NNs in such a form, presumably due to prohibitive time and space complexities and impractical model sizes, all of them growing linearly with the number of training patterns which may get very large. However, this dual formulation offers a possibility of directly visualising how an NN makes use of training patterns at test time, by examining the corresponding attention weights. We conduct experiments on small scale supervised image classification tasks in single-task, multi-task, and continual learning settings, as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Neural Networks and Applications · Machine Learning and Data Classification
