TL;DR
This paper introduces new binary image descriptors, BAD and HashSIFT, optimized for resource-limited devices, balancing accuracy, speed, and energy efficiency through novel training techniques.
Contribution
It presents BAD and HashSIFT descriptors derived from traditional features using triplet loss and hard negative mining, improving the accuracy-resource trade-off.
Findings
BAD is the fastest descriptor in literature.
HashSIFT achieves accuracy close to deep learning descriptors.
Proposed descriptors are more energy-efficient.
Abstract
The advent of a panoply of resource limited devices opens up new challenges in the design of computer vision algorithms with a clear compromise between accuracy and computational requirements. In this paper we present new binary image descriptors that emerge from the application of triplet ranking loss, hard negative mining and anchor swapping to traditional features based on pixel differences and image gradients. These descriptors, BAD (Box Average Difference) and HashSIFT, establish new operating points in the state-of-the-art's accuracy vs.\ resources trade-off curve. In our experiments we evaluate the accuracy, execution time and energy consumption of the proposed descriptors. We show that BAD bears the fastest descriptor implementation in the literature while HashSIFT approaches in accuracy that of the top deep learning-based descriptors, being computationally more efficient. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
