Modeling and Optimizing Performance Bottlenecks for Neuromorphic Accelerators
Jason Yik, Walter Gallego Gomez, Andrew Cheng, Benedetto Leto, Alessandro Pierro, Noah Pacik-Nelson, Korneel Van den Berghe, Vittorio Fra, Andreea Danielescu, Gianvito Urgese, Vijay Janapa Reddi

TL;DR
This paper provides a comprehensive analysis of neuromorphic accelerators, revealing their performance bottlenecks and proposing an optimization methodology that significantly improves runtime and energy efficiency.
Contribution
It introduces the first detailed performance bound and bottleneck analysis for neuromorphic accelerators, along with a floorline performance model and an optimization approach.
Findings
Identified three distinct bottleneck states: memory-bound, compute-bound, traffic-bound.
Developed the floorline performance model to visualize and understand performance limits.
Achieved up to 3.86x runtime improvement and 3.38x energy reduction with the proposed optimization.
Abstract
Neuromorphic accelerators offer promising platforms for machine learning (ML) inference by leveraging event-driven, spatially-expanded architectures that naturally exploit unstructured sparsity through co-located memory and compute. However, their unique architectural characteristics create performance dynamics that differ fundamentally from conventional accelerators. Existing workload optimization approaches for neuromorphic accelerators rely on aggregate network-wide sparsity and operation counting, but the extent to which these metrics actually improve deployed performance remains unknown. This paper presents the first comprehensive performance bound and bottleneck analysis of neuromorphic accelerators, revealing the shortcomings of the conventional metrics and offering an understanding of what facets matter for workload performance. We present both theoretical analytical modeling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Neural Networks and Reservoir Computing
