MING: An Automated CNN-to-Edge MLIR HLS framework

Jiahong Bi; Lars Sch\"utze; Jeronimo Castrillon

arXiv:2602.11966·cs.AR·February 13, 2026

MING: An Automated CNN-to-Edge MLIR HLS framework

Jiahong Bi, Lars Sch\"utze, Jeronimo Castrillon

PDF

Open Access

TL;DR

MING is an MLIR-based framework that automates FPGA design for CNNs, optimizing resource use and latency, achieving significant speedups over existing frameworks especially for resource-constrained edge devices.

Contribution

MING introduces a resource-aware, MLIR-based automation framework for FPGA CNN design, improving efficiency and latency handling compared to prior methods.

Findings

01

Achieves 15x speedup for 4-layer CNN kernels

02

Achieves 200x speedup for single-layer kernels

03

Effectively handles resource constraints for large input kernels

Abstract

Driven by the increasing demand for low-latency and real-time processing, machine learning applications are steadily migrating toward edge computing platforms, where Field-Programmable Gate Arrays (FPGAs) are widely adopted for their energy efficiency compared to CPUs and GPUs. To generate high-performance and low-power FPGA designs, several frameworks built upon High Level Synthesis (HLS) vendor tools have been proposed, among which MLIR-based frameworks are gaining significant traction due to their extensibility and ease of use. However, existing state-of-the-art frameworks often overlook the stringent resource constraints of edge devices. To address this limitation, we propose MING, an Multi-Level Intermediate Representation (MLIR)-based framework that abstracts and automates the HLS design process. Within this framework, we adopt a streaming architecture with carefully managed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Embedded Systems Design Techniques · Parallel Computing and Optimization Techniques