Energy-Efficient Stochastic Computing (SC) Neural Networks for Internet of Things Devices With Layer-Wise Adjustable Sequence Length (ASL)

Ziheng Wang; Pedro Reviriego; Farzad Niknia; Zhen Gao; Javier Conde; Shanshan Liu; Fabrizio Lombardi

arXiv:2508.09163·cs.LG·August 14, 2025

Energy-Efficient Stochastic Computing (SC) Neural Networks for Internet of Things Devices With Layer-Wise Adjustable Sequence Length (ASL)

Ziheng Wang, Pedro Reviriego, Farzad Niknia, Zhen Gao, Javier Conde, Shanshan Liu, Fabrizio Lombardi

PDF

TL;DR

This paper introduces Adjustable Sequence Length (ASL), a mixed-precision scheme for stochastic computing neural networks that significantly reduces energy and latency in IoT devices while maintaining accuracy.

Contribution

The paper proposes a novel layer-wise mixed-precision approach called ASL for SC neural networks, supported by a theoretical model and validation methods.

Findings

01

ASL reduces energy and latency by over 60%.

02

Theoretical model accurately predicts truncation noise propagation.

03

Validation confirms negligible accuracy loss with ASL.

Abstract

Stochastic computing (SC) has emerged as an efficient low-power alternative for deploying neural networks (NNs) in resource-limited scenarios, such as the Internet of Things (IoT). By encoding values as serial bitstreams, SC significantly reduces energy dissipation compared to conventional floating-point (FP) designs; however, further improvement of layer-wise mixed-precision implementation for SC remains unexplored. This article introduces Adjustable Sequence Length (ASL), a novel scheme that applies mixed-precision concepts specifically to SC NNs. By introducing an operator-norm-based theoretical model, this article shows that truncation noise can cumulatively propagate through the layers by the estimated amplification factors. An extended sensitivity analysis is presented, using random forest (RF) regression to evaluate multilayer truncation effects and validate the alignment of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.