CoordFlow: Coordinate Flow for Pixel-wise Neural Video Representation
Daniel Silver, Ron Kimmel

TL;DR
CoordFlow introduces a novel pixel-wise implicit neural representation for video compression that achieves state-of-the-art results among pixel-wise methods and matches top frame-wise techniques, while also enabling unsupervised segmentation and various video processing features.
Contribution
It presents a new pixel-wise INR model that separates visual layers for improved compression and includes motion compensation, segmentation, and multiple video enhancement capabilities.
Findings
Achieves state-of-the-art results among pixel-wise INRs.
Performs on par with leading frame-wise video compression methods.
Provides additional functionalities like upsampling, stabilization, inpainting, and denoising.
Abstract
In the field of video compression, the pursuit for better quality at lower bit rates remains a long-lasting goal. Recent developments have demonstrated the potential of Implicit Neural Representation (INR) as a promising alternative to traditional transform-based methodologies. Video INRs can be roughly divided into frame-wise and pixel-wise methods according to the structure the network outputs. While the pixel-based methods are better for upsampling and parallelization, frame-wise methods demonstrated better performance. We introduce CoordFlow, a novel pixel-wise INR for video compression. It yields state-of-the-art results compared to other pixel-wise INRs and on-par performance compared to leading frame-wise techniques. The method is based on the separation of the visual information into visually consistent layers, each represented by a dedicated network that compensates for the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks · Human Pose and Action Recognition
