Accelerating CRONet on AMD Versal AIE-ML Engines
Kaustubh Mhatre, Vedant Tewari, Aditya Ray, Farhan Khan, Ridwan Olabiyi, Ashif Iquebal, Aman Arora

TL;DR
This paper presents a hardware-accelerated implementation of CRONet neural network on AMD Versal AIE-ML engines, significantly improving latency and energy efficiency for topology optimization tasks.
Contribution
First end-to-end neural network implementation on AIE-ML that fully utilizes on-chip memory, reducing latency and energy consumption compared to GPU solutions.
Findings
Achieves up to 2.49x latency improvement
Achieves up to 4.18x energy efficiency gain
Demonstrates potential for low-latency, energy-efficient topology optimization
Abstract
Topology optimization is a computational method used to determine the optimal material distribution within a prescribed design domain, aiming to minimize structural weight while satisfying load and boundary conditions. For critical infrastructure applications, such as structural health monitoring of bridges and buildings, particularly in digital twin contexts, low-latency energy-efficient topology optimization is essential. Traditionally, topology optimization relies on finite element analysis (FEA), a computationally intensive process. Recent advances in deep neural networks (DNNs) have introduced data driven alternatives to FEA, substantially reducing computation time while maintaining solution quality. These DNNs have complex architectures and implementing them on inference-class GPUs results in high latency and poor energy efficiency. To address this challenge, we present a hardware…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
