CoordFlow: Coordinate Flow for Pixel-wise Neural Video Representation

Daniel Silver; Ron Kimmel

arXiv:2501.00975·cs.CV·January 3, 2025

CoordFlow: Coordinate Flow for Pixel-wise Neural Video Representation

Daniel Silver, Ron Kimmel

PDF

Open Access

TL;DR

CoordFlow introduces a novel pixel-wise implicit neural representation for video compression that achieves state-of-the-art results among pixel-wise methods and matches top frame-wise techniques, while also enabling unsupervised segmentation and various video processing features.

Contribution

It presents a new pixel-wise INR model that separates visual layers for improved compression and includes motion compensation, segmentation, and multiple video enhancement capabilities.

Findings

01

Achieves state-of-the-art results among pixel-wise INRs.

02

Performs on par with leading frame-wise video compression methods.

03

Provides additional functionalities like upsampling, stabilization, inpainting, and denoising.

Abstract

In the field of video compression, the pursuit for better quality at lower bit rates remains a long-lasting goal. Recent developments have demonstrated the potential of Implicit Neural Representation (INR) as a promising alternative to traditional transform-based methodologies. Video INRs can be roughly divided into frame-wise and pixel-wise methods according to the structure the network outputs. While the pixel-based methods are better for upsampling and parallelization, frame-wise methods demonstrated better performance. We introduce CoordFlow, a novel pixel-wise INR for video compression. It yields state-of-the-art results compared to other pixel-wise INRs and on-par performance compared to leading frame-wise techniques. The method is based on the separation of the visual information into visually consistent layers, each represented by a dedicated network that compensates for the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks · Human Pose and Action Recognition