A Reconfigurable Framework for AI-FPGA Agent Integration and Acceleration

Aybars Yunusoglu; Talha Coskun; Hiruna Vishwamith; Murat Isik; I. Can Dikmen

arXiv:2601.19263·cs.AR·January 28, 2026

A Reconfigurable Framework for AI-FPGA Agent Integration and Acceleration

Aybars Yunusoglu, Talha Coskun, Hiruna Vishwamith, Murat Isik, I. Can Dikmen

PDF

Open Access

TL;DR

This paper introduces a reconfigurable AI-FPGA framework that simplifies model deployment, achieving significant latency and energy efficiency improvements for neural network inference in constrained environments.

Contribution

It proposes an agent-driven system that dynamically partitions models and manages data transfers, streamlining AI-FPGA integration and acceleration.

Findings

01

Over 10x latency reduction compared to CPU

02

2-3x higher energy efficiency than GPU

03

Maintains classification accuracy within 0.2% of full-precision

Abstract

Artificial intelligence (AI) is increasingly deployed in real-time and energy-constrained environments, driving demand for hardware platforms that can deliver high performance and power efficiency. While central processing units (CPUs) and graphics processing units (GPUs) have traditionally served as the primary inference engines, their general-purpose nature often leads to inefficiencies under strict latency or power budgets. Field-Programmable Gate Arrays (FPGAs) offer a promising alternative by enabling custom-tailored parallelism and hardware-level optimizations. However, mapping AI workloads to FPGAs remains challenging due to the complexity of hardware-software co-design and data orchestration. This paper presents AI FPGA Agent, an agent-driven framework that simplifies the integration and acceleration of deep neural network inference on FPGAs. The proposed system employs a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEmbedded Systems Design Techniques · Advanced Neural Network Applications · Advanced Memory and Neural Computing