PixelRNN: In-pixel Recurrent Neural Networks for End-to-end-optimized Perception with Neural Sensors
Haley M. So, Laurie Bose, Piotr Dudek, and Gordon Wetzstein

TL;DR
PixelRNN is a novel in-sensor neural network architecture that encodes spatio-temporal features directly on the sensor using binary operations, significantly reducing data transmission for perception tasks on edge devices.
Contribution
It introduces PixelRNN, an efficient recurrent neural network design that leverages sensor programmability to perform in-sensor processing with minimal data transmission.
Findings
Reduces data transmission by 64x compared to traditional systems.
Achieves competitive accuracy in gesture recognition and lip reading.
Validated on SCAMP-5 sensor-processor platform.
Abstract
Conventional image sensors digitize high-resolution images at fast frame rates, producing a large amount of data that needs to be transmitted off the sensor for further processing. This is challenging for perception systems operating on edge devices, because communication is power inefficient and induces latency. Fueled by innovations in stacked image sensor fabrication, emerging sensor-processors offer programmability and minimal processing capabilities directly on the sensor. We exploit these capabilities by developing an efficient recurrent neural network architecture, PixelRNN, that encodes spatio-temporal features on the sensor using purely binary operations. PixelRNN reduces the amount of data to be transmitted off the sensor by a factor of 64x compared to conventional systems while offering competitive accuracy for hand gesture recognition and lip reading tasks. We experimentally…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCCD and CMOS Imaging Sensors · Advanced Neural Network Applications · Advanced Memory and Neural Computing
MethodsMasked Convolution · Sigmoid Activation · Tanh Activation · Long Short-Term Memory · Pixel Recurrent Neural Network
