CADC: Crossbar-Aware Dendritic Convolution for Efficient In-memory Computing

Shuai Dong; Junyi Yang; Ye Ke; Hongyang Shang; Arindam Basu

arXiv:2511.22166·cs.AR·December 1, 2025

CADC: Crossbar-Aware Dendritic Convolution for Efficient In-memory Computing

Shuai Dong, Junyi Yang, Ye Ke, Hongyang Shang, Arindam Basu

PDF

Open Access

TL;DR

CADC introduces a dendritic convolution method that embeds nonlinear functions within crossbar computations, significantly reducing partial sums and system overhead in in-memory CNN accelerators with minimal accuracy loss.

Contribution

The paper proposes CADC, a novel crossbar-aware dendritic convolution approach that increases sparsity in partial sums, reducing overhead and improving efficiency in in-memory computing architectures.

Findings

01

Reduces partial sums by up to 88% in various CNN models.

02

Achieves 29.3% buffer/transfer and 47.9% accumulation overhead reduction.

03

Maintains high accuracy with less than 1% degradation across models.

Abstract

Convolutional neural networks (CNNs) are computationally intensive and often accelerated using crossbar-based in-memory computing (IMC) architectures. However, large convolutional layers must be partitioned across multiple crossbars, generating numerous partial sums (psums) that require additional buffer, transfer, and accumulation, thus introducing significant system-level overhead. Inspired by dendritic computing principles from neuroscience, we propose crossbar-aware dendritic convolution (CADC), a novel approach that dramatically increases sparsity in psums by embedding a nonlinear dendritic function (zeroing negative values) directly within crossbar computations. Experimental results demonstrate that CADC significantly reduces psums, eliminating 80% in LeNet-5 on MNIST, 54% in ResNet-18 on CIFAR-10, 66% in VGG-16 on CIFAR-100, and up to 88% in spiking neural networks (SNN) on the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Parallel Computing and Optimization Techniques · Ferroelectric and Negative Capacitance Devices