SIRA: Scaled-Integer Range Analysis for Optimizing FPGA Dataflow Neural Network Accelerators

Yaman Umuroglu; Christoph Berganski; Felix Jentzsch; Michal Danilowicz; Tomasz Kryjak; Charalampos Bezaitis; Magnus Sjalander; Ian Colbert; Thomas Preusser; Jakoba Petri-Koenig; Michaela Blott

arXiv:2508.21493·cs.AR·September 1, 2025

SIRA: Scaled-Integer Range Analysis for Optimizing FPGA Dataflow Neural Network Accelerators

Yaman Umuroglu, Christoph Berganski, Felix Jentzsch, Michal Danilowicz, Tomasz Kryjak, Charalampos Bezaitis, Magnus Sjalander, Ian Colbert, Thomas Preusser, Jakoba Petri-Koenig, Michaela Blott

PDF

Open Access

TL;DR

SIRA is a static analysis technique that optimizes FPGA neural network accelerators by tailoring precision and reducing resource usage, leading to significant improvements in LUTs, DSPs, and accumulator bitwidths.

Contribution

The paper introduces SIRA, a novel static analysis method for quantized neural networks that enables resource-efficient FPGA accelerator design through precision tailoring.

Findings

01

17% reduction in LUTs

02

66% reduction in DSPs

03

22% reduction in accumulator bitwidths

Abstract

While neural network quantization effectively reduces the cost of matrix multiplications, aggressive quantization can expose non-matrix-multiply operations as significant performance and resource bottlenecks on embedded systems. Addressing such bottlenecks requires a comprehensive approach to tailoring the precision across operations in the inference computation. To this end, we introduce scaled-integer range analysis (SIRA), a static analysis technique employing interval arithmetic to determine the range, scale, and bias for tensors in quantized neural networks. We show how this information can be exploited to reduce the resource footprint of FPGA dataflow neural network accelerators via tailored bitwidth adaptation for accumulators and downstream operations, aggregation of scales and biases, and conversion of consecutive elementwise operations to thresholding operations. We integrate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNumerical Methods and Algorithms · Advanced Neural Network Applications · Low-power high-performance VLSI design