A Reconfigurable Framework for AI-FPGA Agent Integration and Acceleration
Aybars Yunusoglu, Talha Coskun, Hiruna Vishwamith, Murat Isik, I. Can Dikmen

TL;DR
This paper introduces a reconfigurable AI-FPGA framework that simplifies model deployment, achieving significant latency and energy efficiency improvements for neural network inference in constrained environments.
Contribution
It proposes an agent-driven system that dynamically partitions models and manages data transfers, streamlining AI-FPGA integration and acceleration.
Findings
Over 10x latency reduction compared to CPU
2-3x higher energy efficiency than GPU
Maintains classification accuracy within 0.2% of full-precision
Abstract
Artificial intelligence (AI) is increasingly deployed in real-time and energy-constrained environments, driving demand for hardware platforms that can deliver high performance and power efficiency. While central processing units (CPUs) and graphics processing units (GPUs) have traditionally served as the primary inference engines, their general-purpose nature often leads to inefficiencies under strict latency or power budgets. Field-Programmable Gate Arrays (FPGAs) offer a promising alternative by enabling custom-tailored parallelism and hardware-level optimizations. However, mapping AI workloads to FPGAs remains challenging due to the complexity of hardware-software co-design and data orchestration. This paper presents AI FPGA Agent, an agent-driven framework that simplifies the integration and acceleration of deep neural network inference on FPGAs. The proposed system employs a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmbedded Systems Design Techniques · Advanced Neural Network Applications · Advanced Memory and Neural Computing
