Hashing with Binary Matrix Pursuit

Fatih Cakir; Kun He; Stan Sclaroff

arXiv:1808.01990·cs.LG·August 7, 2018

Hashing with Binary Matrix Pursuit

Fatih Cakir, Kun He, Stan Sclaroff

PDF

Open Access 2 Repos

TL;DR

This paper introduces a theoretically grounded and empirically validated two-stage hashing method that leverages residual learning and high-capacity hash functions like CNNs to produce highly accurate binary codes for image retrieval.

Contribution

It provides a theoretical analysis of binary code quality, simplifies code inference with CNNs, and proposes a new hashing method that outperforms previous approaches.

Findings

01

Residual learning achieves arbitrary accuracy in fitting neighborhood structures.

02

High-capacity hash functions simplify binary code inference.

03

Proposed method outperforms previous hashing techniques on image retrieval benchmarks.

Abstract

We propose theoretical and empirical improvements for two-stage hashing methods. We first provide a theoretical analysis on the quality of the binary codes and show that, under mild assumptions, a residual learning scheme can construct binary codes that fit any neighborhood structure with arbitrary accuracy. Secondly, we show that with high-capacity hash functions such as CNNs, binary code inference can be greatly simplified for many standard neighborhood definitions, yielding smaller optimization problems and more robust codes. Incorporating our findings, we propose a novel two-stage hashing method that significantly outperforms previous hashing studies on widely used image retrieval benchmarks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Multimodal Machine Learning Applications