Learned Scalable Video Coding For Humans and Machines
Hadi Hadizadeh, Ivan V. Baji\'c

TL;DR
This paper presents a novel end-to-end learnable scalable video codec that efficiently supports both machine vision tasks and human viewing, outperforming existing codecs in compression for machine analysis while maintaining quality for human perception.
Contribution
The authors introduce a scalable video coding framework that integrates machine and human viewing capabilities within a single learnable system, utilizing conditional coding for improved compression.
Findings
Outperforms state-of-the-art learned codecs in machine vision tasks.
Maintains comparable quality for human viewing as traditional codecs.
Demonstrates effectiveness across four standard video datasets.
Abstract
Video coding has traditionally been developed to support services such as video streaming, videoconferencing, digital TV, and so on. The main intent was to enable human viewing of the encoded content. However, with the advances in deep neural networks (DNNs), encoded video is increasingly being used for automatic video analytics performed by machines. In applications such as automatic traffic monitoring, analytics such as vehicle detection, tracking and counting, would run continuously, while human viewing could be required occasionally to review potential incidents. To support such applications, a new paradigm for video coding is needed that will facilitate efficient representation and compression of video for both machine and human use in a scalable manner. In this manuscript, we introduce an end-to-end learnable video codec that supports a machine vision task in its base layer, while…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Advanced Data Compression Techniques · Advanced Image Processing Techniques
MethodsBalanced Selection
