Routing brain traffic through the von Neumann bottleneck: Parallel sorting and refactoring
Jari Pronold, Jakob Jordan, Brian J. N. Wylie, Itaru Kitayama, Markus, Diesmann, Susanne Kunkel

TL;DR
This paper introduces a parallel sorting algorithm for spike delivery in neuronal network simulations, significantly reducing simulation time and improving scalability on many-core systems.
Contribution
It proposes a novel parallel sorting approach for spike dispatching that halves instruction count and enhances efficiency in large-scale neuronal simulations.
Findings
Spike delivery time reduced by up to 40%
Parallel sorting enables fully parallel spike dispatching
Algorithm scales well with network size from 100,000 to 1 billion neurons
Abstract
Generic simulation code for spiking neuronal networks spends the major part of time in the phase where spikes have arrived at a compute node and need to be delivered to their target neurons. These spikes were emitted over the last interval between communication steps by source neurons distributed across many compute nodes and are inherently irregular with respect to their targets. For finding the targets, the spikes need to be dispatched to a three-dimensional data structure with decisions on target thread and synapse type to be made on the way. With growing network size a compute node receives spikes from an increasing number of different source neurons until in the limit each synapse on the compute node has a unique source. Here we show analytically how this sparsity emerges over the practically relevant range of network sizes from a hundred thousand to a billion neurons. By profiling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
