Minimalist Vision with Freeform Pixels
Jeremy Klotz, Shree K. Nayar

TL;DR
This paper introduces minimalist vision systems with freeform pixels, designed as neural network layers, achieving comparable performance to traditional cameras with fewer pixels, enhancing privacy and enabling self-powered operation.
Contribution
The paper presents a novel approach to designing minimalist cameras with freeform pixels optimized via neural network training, demonstrating effective vision tasks with significantly fewer pixels.
Findings
Achieved comparable performance to traditional cameras with only 8 pixels.
Designed minimalist cameras for indoor monitoring, lighting measurement, and traffic estimation.
Showed that minimalist cameras can be self-powered due to low measurement requirements.
Abstract
A minimalist vision system uses the smallest number of pixels needed to solve a vision task. While traditional cameras use a large grid of square pixels, a minimalist camera uses freeform pixels that can take on arbitrary shapes to increase their information content. We show that the hardware of a minimalist camera can be modeled as the first layer of a neural network, where the subsequent layers are used for inference. Training the network for any given task yields the shapes of the camera's freeform pixels, each of which is implemented using a photodetector and an optical mask. We have designed minimalist cameras for monitoring indoor spaces (with 8 pixels), measuring room lighting (with 8 pixels), and estimating traffic flow (with 8 pixels). The performance demonstrated by these systems is on par with a traditional camera with orders of magnitude more pixels. Minimalist vision has…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
