A Hybrid-Domain Floating-Point Compute-in-Memory Architecture for   Efficient Acceleration of High-Precision Deep Neural Networks

Zhiqiang Yi; Yiwen Liang; Weidong Cao

arXiv:2502.07212·cs.AR·February 12, 2025

A Hybrid-Domain Floating-Point Compute-in-Memory Architecture for Efficient Acceleration of High-Precision Deep Neural Networks

Zhiqiang Yi, Yiwen Liang, Weidong Cao

PDF

Open Access

TL;DR

This paper presents a hybrid analog-digital compute-in-memory architecture that significantly improves energy efficiency and accuracy for high-precision deep neural network acceleration, addressing power consumption issues of digital-only CIM solutions.

Contribution

It introduces a novel hybrid domain CIM architecture combining analog and digital methods within the same memory cell for high-precision DNNs.

Findings

01

Demonstrates high energy efficiency through circuit-level simulations

02

Achieves lossless accuracy on benchmark tests

03

Develops area-efficient and energy-efficient ADC techniques

Abstract

Compute-in-memory (CIM) has shown significant potential in efficiently accelerating deep neural networks (DNNs) at the edge, particularly in speeding up quantized models for inference applications. Recently, there has been growing interest in developing floating-point-based CIM macros to improve the accuracy of high-precision DNN models, including both inference and training tasks. Yet, current implementations rely primarily on digital methods, leading to substantial power consumption. This paper introduces a hybrid domain CIM architecture that integrates analog and digital CIM within the same memory cell to efficiently accelerate high-precision DNNs. Specifically, we develop area-efficient circuits and energy-efficient analog-to-digital conversion techniques to realize this architecture. Comprehensive circuit-level simulations reveal the notable energy efficiency and lossless accuracy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsParallel Computing and Optimization Techniques · Neural Networks and Reservoir Computing · Neural Networks and Applications