Near-Isometric Binary Hashing for Large-scale Datasets

Amirali Aghazadeh; Andrew Lan; Anshumali Shrivastava; Richard Baraniuk

arXiv:1603.03836·cs.DS·March 15, 2016

Near-Isometric Binary Hashing for Large-scale Datasets

Amirali Aghazadeh, Andrew Lan, Anshumali Shrivastava, Richard Baraniuk

PDF

Open Access

TL;DR

This paper introduces Near-Isometric Binary Hashing (NIBH), a scalable data-dependent hashing method that minimizes worst-case distortion to improve large-scale dataset indexing and retrieval performance.

Contribution

The paper proposes a novel hashing scheme based on worst-case distortion minimization and develops an efficient algorithm that outperforms existing methods on large datasets.

Findings

01

NIBH achieves superior distance and ranking preservation.

02

NIBH outperforms ten state-of-the-art hashing schemes.

03

The algorithm scales well to large datasets.

Abstract

We develop a scalable algorithm to learn binary hash codes for indexing large-scale datasets. Near-isometric binary hashing (NIBH) is a data-dependent hashing scheme that quantizes the output of a learned low-dimensional embedding to obtain a binary hash code. In contrast to conventional hashing schemes, which typically rely on an $ℓ_{2}$ -norm (i.e., average distortion) minimization, NIBH is based on a $ℓ_{\infty}$ -norm (i.e., worst-case distortion) minimization that provides several benefits, including superior distance, ranking, and near-neighbor preservation performance. We develop a practical and efficient algorithm for NIBH based on column generation that scales well to large datasets. A range of experimental evaluations demonstrate the superiority of NIBH over ten state-of-the-art binary hashing schemes.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Video Surveillance and Tracking Methods · Algorithms and Data Compression