End-to-End Learnable Multi-Scale Feature Compression for VCM

Yeongwoong Kim; Hyewon Jeong; Janghyun Yu; Younhee Kim; Jooyoung Lee,; Se Yoon Jeong; and Hui Yong Kim

arXiv:2306.16670·cs.CV·August 9, 2023·1 cites

End-to-End Learnable Multi-Scale Feature Compression for VCM

Yeongwoong Kim, Hyewon Jeong, Janghyun Yu, Younhee Kim, Jooyoung Lee,, Se Yoon Jeong, and Hui Yong Kim

PDF

Open Access

TL;DR

This paper introduces an end-to-end learnable multi-scale feature compression method optimized for machine vision, significantly reducing bitrate and encoding time compared to previous approaches.

Contribution

It proposes a novel integrated compression and fusion model that enables end-to-end training and lightweight encoding for multi-scale features in VCM.

Findings

01

Achieves at least 52% BD-rate reduction over previous methods.

02

Reduces encoding time by 5 to 27 times.

03

Outperforms existing approaches in feature compression efficiency.

Abstract

The proliferation of deep learning-based machine vision applications has given rise to a new type of compression, so called video coding for machine (VCM). VCM differs from traditional video coding in that it is optimized for machine vision performance instead of human visual quality. In the feature compression track of MPEG-VCM, multi-scale features extracted from images are subject to compression. Recent feature compression works have demonstrated that the versatile video coding (VVC) standard-based approach can achieve a BD-rate reduction of up to 96% against MPEG-VCM feature anchor. However, it is still sub-optimal as VVC was not designed for extracted features but for natural images. Moreover, the high encoding complexity of VVC makes it difficult to design a lightweight encoder without sacrificing performance. To address these challenges, we propose a novel multi-scale feature…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Advanced Data Compression Techniques · Video Surveillance and Tracking Methods