An End-to-End DNN Inference Framework for the SpiNNaker2 Neuromorphic MPSoC
Matthias Jobst, Tim Langer, Chen Liu, Mehmet Alici, Hector A. Gonzalez, Christian Mayr

TL;DR
This paper introduces a comprehensive framework that enables efficient end-to-end inference of large DNNs, including transformers, on the SpiNNaker2 neuromorphic chip, bridging PyTorch models to hardware execution.
Contribution
It extends OctopuScheduler with a multi-layer DNN scheduling framework and a front-end for quantization and lowering, facilitating edge-based inference on SpiNNaker2.
Findings
Supports large and complex DNNs including transformers
Enables end-to-end inference from PyTorch to SpiNNaker2
Improves efficiency of neuromorphic DNN deployment
Abstract
This work presents a multi-layer DNN scheduling framework as an extension of OctopuScheduler, providing an end-to-end flow from PyTorch models to inference on a single SpiNNaker2 chip. Together with a front-end comprised of quantization and lowering steps, the proposed framework enables the edge-based execution of large and complex DNNs up to transformer scale using the neuromorphic platform SpiNNaker2.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Advanced Memory and Neural Computing · Embedded Systems Design Techniques
