Routing brain traffic through the von Neumann bottleneck: Efficient cache usage in spiking neural network simulation code on general purpose computers
Jari Pronold, Jakob Jordan, Brian J. N. Wylie, Itaru Kitayama, Markus, Diesmann, Susanne Kunkel

TL;DR
This paper investigates how cache optimization techniques can significantly improve the efficiency of large-scale spiking neural network simulations on general-purpose computers, addressing the von Neumann bottleneck.
Contribution
It demonstrates that applying cache-aware techniques like prefetching and pipelining can halve simulation times in neural network models.
Findings
Algorithmic changes reduce simulation time by up to 50%.
Cache optimization techniques improve performance on many-core systems.
Irregular memory access patterns are a key bottleneck in neural simulations.
Abstract
Simulation is a third pillar next to experiment and theory in the study of complex dynamic systems such as biological neural networks. Contemporary brain-scale networks correspond to directed graphs of a few million nodes, each with an in-degree and out-degree of several thousands of edges, where nodes and edges correspond to the fundamental biological units, neurons and synapses, respectively. When considering a random graph, each node's edges are distributed across thousands of parallel processes. The activity in neuronal networks is also sparse. Each neuron occasionally transmits a brief signal, called spike, via its outgoing synapses to the corresponding target neurons. This spatial and temporal sparsity represents an inherent bottleneck for simulations on conventional computers: Fundamentally irregular memory-access patterns cause poor cache utilization. Using an established…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
