Kratos: An FPGA Benchmark for Unrolled DNNs with Fine-Grained Sparsity and Mixed Precision
Xilai Dai, Yuzong Chen, Mohamed S. Abdelfattah

TL;DR
Kratos is an FPGA benchmark designed for unrolled DNNs with fine-grained sparsity and mixed precision, demonstrating high-frequency operation and significant area reduction, enabling more efficient FPGA-based DNN inference.
Contribution
The paper introduces Kratos, a benchmark for unrolled DNN primitives on FPGAs, and shows how architectural tailoring can improve efficiency for sparse, low-precision DNNs.
Findings
Unrolled DNNs can operate at near maximum FPGA frequencies.
Fine-grained sparsity and low bit-width significantly reduce area.
Architectural modifications can achieve ~2× area reduction.
Abstract
FPGAs offer a flexible platform for accelerating deep neural network (DNN) inference, particularly for non-uniform workloads featuring fine-grained unstructured sparsity and mixed arithmetic precision. To leverage these redundancies, an emerging approach involves partially or fully unrolling computations for each DNN layer. That way, parameter-level and bit-level ineffectual operations can be completely skipped, thus saving the associated area and power. Regardless, unrolled implementations scale poorly and limit the size of a DNN that can be unrolled on an FPGA. This motivates the investigation of new reconfigurable architectures to improve the efficiency of unrolled DNNs, while taking advantage of sparsity and mixed precision. To enable this, we present Kratos: a focused FPGA benchmark of unrolled DNN primitives with varying levels of sparsity and different arithmetic precisions. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Brain Tumor Detection and Classification
