CGRA4ML: A Hardware/Software Framework to Implement Neural Networks for Scientific Edge Computing
G Abarajithan, Zhenghua Ma, Ravidu Munasinghe, Francesco Restuccia, and Ryan Kastner

TL;DR
cgra4ml is an open-source framework that enables the design and deployment of customizable neural network accelerators on scientific edge computing devices, simplifying hardware integration and optimization.
Contribution
It introduces a modular, full-stack infrastructure for generating parameterizable CGRA accelerators tailored to scientific ML applications, supporting ASIC and FPGA flows.
Findings
Successfully implemented scientific edge neural networks using ASIC and FPGA flows.
Provides a comprehensive, easy-to-use toolchain including hardware, software, and verification components.
Abstract
The scientific community increasingly relies on machine learning (ML) for near-sensor processing, leveraging its strengths in tasks such as pattern recognition, anomaly detection, and real-time decision-making. These deployments demand accelerators that combine extremely high performance with programmability, ease of integration, and straightforward verification. We present cgra4ml, an open-source, modular framework that generates parameterizable CGRA accelerators in synthesizable SystemVerilog RTL, tailored to common ML compute patterns found in scientific applications. The framework supports seamless system integration through AXI-compliant interfaces and open-source DMA components, and it includes automatic firmware generation for programming the accelerator. A comprehensive verification suite and a runtime firmware stack further support deployment across diverse SoC platforms.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
