P2M: A Processing-in-Pixel-in-Memory Paradigm for Resource-Constrained TinyML Applications
Gourav Datta, Souvik Kundu, Zihan Yin, Ravi Teja Lakkireddy, Joe, Mathai, Ajey Jacob, Peter A. Beerel, Akhilesh R. Jaiswal

TL;DR
This paper introduces P2M, a novel in-memory processing paradigm integrated into pixel arrays to enable energy-efficient, bandwidth-reducing CNN processing directly within CMOS image sensors for TinyML applications.
Contribution
It presents a new pixel array design supporting analog multi-channel, multi-bit convolution, batch normalization, and ReLU, enabling in-pixel CNN layer processing as a drop-in replacement.
Findings
Reduces data transfer bandwidth by ~21x.
Lowers energy-delay product by ~11x for TinyML CNNs.
Maintains test accuracy comparable to standard methods.
Abstract
The demand to process vast amounts of data generated from state-of-the-art high resolution cameras has motivated novel energy-efficient on-device AI solutions. Visual data in such cameras are usually captured in the form of analog voltages by a sensor pixel array, and then converted to the digital domain for subsequent AI processing using analog-to-digital converters (ADC). Recent research has tried to take advantage of massively parallel low-power analog/digital computing in the form of near- and in-sensor processing, in which the AI computation is performed partly in the periphery of the pixel array and partly in a separate on-board CPU/accelerator. Unfortunately, high-resolution input images still need to be streamed between the camera and the AI processing unit, frame by frame, causing energy, bandwidth, and security bottlenecks. To mitigate this problem, we propose a novel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCCD and CMOS Imaging Sensors · Advanced Memory and Neural Computing · Neuroscience and Neural Engineering
MethodsBatch Normalization · Pointwise Convolution · Depthwise Convolution · Depthwise Separable Convolution · 1x1 Convolution · Inverted Residual Block · High-resolution input · Average Pooling · Convolution
