Hardware-Aware Neural Network Compilation with Learned Optimization: A RISC-V Accelerator Approach
Ravindra Ganti, Steve Xu

TL;DR
This paper introduces XgenSilicon, an automated compiler that transforms high-level ML models into optimized RISC-V assembly for ASICs, significantly improving power, performance, and area metrics through innovative auto-tuning, quantization, and hardware-aware validation.
Contribution
The paper presents a comprehensive, fully automated compilation framework that unifies software and hardware cost models, enabling efficient ML model deployment on custom RISC-V ASIC accelerators with multiple innovations.
Findings
ASICs achieve 2.5-4.5x better performance
Power consumption reduced by 3-6x
Area reduced by 40-60%
Abstract
We present XgenSilicon ML Compiler, a fully automated end-to-end compilation framework that transforms high-level machine learning models into optimized RISC-V assembly code for custom ASIC accelerators. By unifying the system's cost model across software and hardware, the compiler achieves significant improvements in Power, Performance, and Area (PPA) metrics compared to standard off-the-shelf components and hand-designed chips through five key innovations: (1) a multi-algorithm auto-tuning framework with five search strategies (Bayesian Optimization, Genetic Algorithm, Simulated Annealing, Random Search, Grid Search) combined with a learned cost model, (2) an integrated quantization framework supporting extreme precisions from FP32 to Binary with full KL divergence calibration (2048-bin histogram optimization) and momentum-based QAT gradient updates, (3) hardware-aware validation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Parallel Computing and Optimization Techniques · Embedded Systems Design Techniques
