TetrisG-SDK: Efficient Convolutional Layer Mapping with Adaptive Windows and Grouped Convolutions for Fast In-Memory Computing

Ke Dong; Kejie Huang; Tao Luo; Bo Wang

arXiv:2604.25377·cs.AR·April 29, 2026

TetrisG-SDK: Efficient Convolutional Layer Mapping with Adaptive Windows and Grouped Convolutions for Fast In-Memory Computing

Ke Dong, Kejie Huang, Tao Luo, Bo Wang

PDF

TL;DR

TetrisG-SDK is a new framework that optimizes convolutional layer mapping on compute-in-memory hardware using adaptive windows and grouped convolutions, significantly improving speed and energy efficiency.

Contribution

It introduces adaptive windows and grouped convolution techniques to enhance mapping performance and inter-macro parallelism in CIM hardware.

Findings

01

Achieves 1.2x to 1.3x speed-up on various CNN models.

02

Reduces system latency and energy consumption by up to 2.4x and 1.7x.

03

Decreases EDAP by up to 70% with macro-level parallelism.

Abstract

Shifted-and-Duplicated-Kernel (SDK) mapping has emerged as an effective strategy to accelerate convolutional layers on compute-in-memory (CIM) hardware. However, existing SDK variants (e.g., VWC-SDK) merely optimize mapping for a single CIM macro, leaving inter-macro parallelism unexplored. Moreover, their mapping methodologies are still suboptimal. To address these limitations, we present TetrisG-SDK, a novel framework that employs adaptive windows to boost mapping performance. The proposed windows accommodate more input channels, increase array utilization at marginal space, and adapt to different channel depths. More importantly, TetrisG-SDK reduces compute latency by searching for optimal window configurations across multiple CIM macros with a fixed hardware budget. Besides, it incorporates grouped convolution to further decrease computing cycles while maintaining near-lossless…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.