Accelerating CRONet on AMD Versal AIE-ML Engines

Kaustubh Mhatre; Vedant Tewari; Aditya Ray; Farhan Khan; Ridwan Olabiyi; Ashif Iquebal; Aman Arora

arXiv:2604.14700·cs.AR·April 17, 2026

Accelerating CRONet on AMD Versal AIE-ML Engines

Kaustubh Mhatre, Vedant Tewari, Aditya Ray, Farhan Khan, Ridwan Olabiyi, Ashif Iquebal, Aman Arora

PDF

TL;DR

This paper presents a hardware-accelerated implementation of CRONet neural network on AMD Versal AIE-ML engines, significantly improving latency and energy efficiency for topology optimization tasks.

Contribution

First end-to-end neural network implementation on AIE-ML that fully utilizes on-chip memory, reducing latency and energy consumption compared to GPU solutions.

Findings

01

Achieves up to 2.49x latency improvement

02

Achieves up to 4.18x energy efficiency gain

03

Demonstrates potential for low-latency, energy-efficient topology optimization

Abstract

Topology optimization is a computational method used to determine the optimal material distribution within a prescribed design domain, aiming to minimize structural weight while satisfying load and boundary conditions. For critical infrastructure applications, such as structural health monitoring of bridges and buildings, particularly in digital twin contexts, low-latency energy-efficient topology optimization is essential. Traditionally, topology optimization relies on finite element analysis (FEA), a computationally intensive process. Recent advances in deep neural networks (DNNs) have introduced data driven alternatives to FEA, substantially reducing computation time while maintaining solution quality. These DNNs have complex architectures and implementing them on inference-class GPUs results in high latency and poor energy efficiency. To address this challenge, we present a hardware…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.