Hybrid coding of visual content and local image features
Luca Baroffio, Matteo Cesana, Alessandro Redondi, Marco Tagliasacchi,, Stefano Tubaro

TL;DR
This paper introduces Hybrid-Analyze-Then-Compress (HATC), a method that jointly encodes visual content and local features to optimize bandwidth use and task accuracy in visual analysis applications.
Contribution
It proposes a novel hybrid coding scheme that balances content reconstruction and feature transmission, improving efficiency over traditional methods.
Findings
Effective bitrate allocation improves task accuracy
Hybrid approach outperforms pure content or feature coding
Tradeoff between image quality and analysis performance
Abstract
Distributed visual analysis applications, such as mobile visual search or Visual Sensor Networks (VSNs) require the transmission of visual content on a bandwidth-limited network, from a peripheral node to a processing unit. Traditionally, a Compress-Then-Analyze approach has been pursued, in which sensing nodes acquire and encode the pixel-level representation of the visual content, that is subsequently transmitted to a sink node in order to be processed. This approach might not represent the most effective solution, since several analysis applications leverage a compact representation of the content, thus resulting in an inefficient usage of network resources. Furthermore, coding artifacts might significantly impact the accuracy of the visual task at hand. To tackle such limitations, an orthogonal approach named Analyze-Then-Compress has been proposed. According to such a paradigm,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
