Coding local and global binary visual features extracted from video sequences
Luca Baroffio, Antonio Canclini, Matteo Cesana, Alessandro Redondi,, Marco Tagliasacchi, Stefano Tubaro

TL;DR
This paper proposes a coding scheme for binary local and global visual features extracted from videos, optimizing bandwidth use in visual analysis tasks by exploiting redundancy and supporting the Analyze-Then-Compress paradigm.
Contribution
It introduces a novel coding method for binary features that enhances bandwidth efficiency and supports remote visual analysis in the ATC framework.
Findings
ATC with the proposed coding is competitive with CTA in bandwidth-limited scenarios.
The coding scheme effectively exploits spatial and temporal redundancy.
Experimental results demonstrate improved rate-efficiency in visual analysis tasks.
Abstract
Binary local features represent an effective alternative to real-valued descriptors, leading to comparable results for many visual analysis tasks, while being characterized by significantly lower computational complexity and memory requirements. When dealing with large collections, a more compact representation based on global features is often preferred, which can be obtained from local features by means of, e.g., the Bag-of-Visual-Word (BoVW) model. Several applications, including for example visual sensor networks and mobile augmented reality, require visual features to be transmitted over a bandwidth-limited network, thus calling for coding techniques that aim at reducing the required bit budget, while attaining a target level of efficiency. In this paper we investigate a coding scheme tailored to both local and global binary features, which aims at exploiting both spatial and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
