RAMAN: Resource-efficient ApproxiMate Posit Processing for Algorithm-Hardware Co-desigN
Mohd Faisal Khan, and Mukul Lokhande, and Santosh Kumar Vishvakarma

TL;DR
RAMAN introduces a resource-efficient approximate posit-based MAC architecture with hardware-software co-design, significantly reducing area and power consumption while maintaining high accuracy for edge-AI applications.
Contribution
It proposes a novel approximate posit MAC architecture and a co-design framework that enhances hardware efficiency without substantially sacrificing AI model accuracy.
Findings
Achieves up to 46% LUT savings on FPGA
Reduces area by 35.66% and power by 31.28% on ASIC
Maintains 98.45% accuracy in digit recognition
Abstract
Edge-AI applications still face considerable challenges in enhancing computational efficiency in resource-constrained environments. This work presents RAMAN, a resource-efficient and approximate posit(8,2)-based Multiply-Accumulate (MAC) architecture designed to improve hardware efficiency within bandwidth limitations. The proposed REAP (Resource-Efficient Approximate Posit) MAC engine, which is at the core of RAMAN, uses approximation in the posit multiplier to achieve significant area and power reductions with an impact on accuracy. To support diverse AI workloads, this MAC unit is incorporated in a scalable Vector Execution Unit (VEU), which permits hardware reuse and parallelism among deep neural network layers. Furthermore, we propose an algorithm-hardware co-design framework incorporating approximation-aware training to evaluate the impact of hardware-level approximation on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
