CADC: Crossbar-Aware Dendritic Convolution for Efficient In-memory Computing
Shuai Dong, Junyi Yang, Ye Ke, Hongyang Shang, Arindam Basu

TL;DR
CADC introduces a dendritic convolution method that embeds nonlinear functions within crossbar computations, significantly reducing partial sums and system overhead in in-memory CNN accelerators with minimal accuracy loss.
Contribution
The paper proposes CADC, a novel crossbar-aware dendritic convolution approach that increases sparsity in partial sums, reducing overhead and improving efficiency in in-memory computing architectures.
Findings
Reduces partial sums by up to 88% in various CNN models.
Achieves 29.3% buffer/transfer and 47.9% accumulation overhead reduction.
Maintains high accuracy with less than 1% degradation across models.
Abstract
Convolutional neural networks (CNNs) are computationally intensive and often accelerated using crossbar-based in-memory computing (IMC) architectures. However, large convolutional layers must be partitioned across multiple crossbars, generating numerous partial sums (psums) that require additional buffer, transfer, and accumulation, thus introducing significant system-level overhead. Inspired by dendritic computing principles from neuroscience, we propose crossbar-aware dendritic convolution (CADC), a novel approach that dramatically increases sparsity in psums by embedding a nonlinear dendritic function (zeroing negative values) directly within crossbar computations. Experimental results demonstrate that CADC significantly reduces psums, eliminating 80% in LeNet-5 on MNIST, 54% in ResNet-18 on CIFAR-10, 66% in VGG-16 on CIFAR-100, and up to 88% in spiking neural networks (SNN) on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Parallel Computing and Optimization Techniques · Ferroelectric and Negative Capacitance Devices
