MING: An Automated CNN-to-Edge MLIR HLS framework
Jiahong Bi, Lars Sch\"utze, Jeronimo Castrillon

TL;DR
MING is an MLIR-based framework that automates FPGA design for CNNs, optimizing resource use and latency, achieving significant speedups over existing frameworks especially for resource-constrained edge devices.
Contribution
MING introduces a resource-aware, MLIR-based automation framework for FPGA CNN design, improving efficiency and latency handling compared to prior methods.
Findings
Achieves 15x speedup for 4-layer CNN kernels
Achieves 200x speedup for single-layer kernels
Effectively handles resource constraints for large input kernels
Abstract
Driven by the increasing demand for low-latency and real-time processing, machine learning applications are steadily migrating toward edge computing platforms, where Field-Programmable Gate Arrays (FPGAs) are widely adopted for their energy efficiency compared to CPUs and GPUs. To generate high-performance and low-power FPGA designs, several frameworks built upon High Level Synthesis (HLS) vendor tools have been proposed, among which MLIR-based frameworks are gaining significant traction due to their extensibility and ease of use. However, existing state-of-the-art frameworks often overlook the stringent resource constraints of edge devices. To address this limitation, we propose MING, an Multi-Level Intermediate Representation (MLIR)-based framework that abstracts and automates the HLS design process. Within this framework, we adopt a streaming architecture with carefully managed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Embedded Systems Design Techniques · Parallel Computing and Optimization Techniques
