TL;DR
This paper introduces DeepBinaryMask, a neural network that learns an optimized binary sensing mask for video compressive sensing, improving reconstruction quality over traditional masks.
Contribution
It presents an end-to-end neural network model that jointly learns the sensing mask and video reconstruction, a novel approach in video compressive sensing.
Findings
Trained sensing masks outperform random masks in reconstruction quality.
The learned masks are binary and optimized for better performance.
Insights into mask characteristics that enhance reconstruction are provided.
Abstract
In this paper, we propose a novel encoder-decoder neural network model referred to as DeepBinaryMask for video compressive sensing. In video compressive sensing one frame is acquired using a set of coded masks (sensing matrix) from which a number of video frames is reconstructed, equal to the number of coded masks. The proposed framework is an end-to-end model where the sensing matrix is trained along with the video reconstruction. The encoder learns the binary elements of the sensing matrix and the decoder is trained to recover the unknown video sequence. The reconstruction performance is found to improve when using the trained sensing mask from the network as compared to other mask designs such as random, across a wide variety of compressive sensing reconstruction algorithms. Finally, our analysis and discussion offers insights into understanding the characteristics of the trained…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
