When Spike Sparsity Does Not Translate to Deployed Cost: VS-WNO on Jetson Orin Nano
Jason Yoo, Shailesh Garg, Souvik Chakraborty, Syed Bahauddin Alam

TL;DR
This study evaluates whether spike sparsity in neuromorphic neural networks reduces latency and energy on Jetson Orin Nano, finding that it does not due to dense runtime operations.
Contribution
It demonstrates that spike sparsity in VS-WNO does not translate into lower deployed cost on commodity GPU hardware.
Findings
Spike sparsity decreases mean spike rates significantly.
VS-WNO has higher latency and energy consumption than dense WNO.
Runtime analysis shows dense kernels dominate GPU time regardless of sparsity.
Abstract
Spiking neural operators are appealing for neuromorphic edge computing because event-driven substrates can, in principle, translate sparse activity into lower latency and energy. Whether that advantage survives deployment on commodity edge-GPU software stacks, however, remains unclear. We study this question on a Jetson Orin Nano 8 GB using five pretrained variable-spiking wavelet neural operator (VS-WNO) checkpoints and five matched dense wavelet neural operator (WNO) checkpoints on the Darcy rectangular benchmark. On a reference-aligned path, VS-WNO exhibits substantial algorithmic sparsity, with mean spike rates decreasing from 54.26% at the first spiking layer to 18.15% at the fourth. On a deployment-style request path, however, this sparsity does not reduce deployed cost: VS-WNO reaches 59.6 ms latency and 228.0 mJ dynamic energy per inference, whereas dense WNO reaches 53.2 ms and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
