Near-Optimal Sample Complexity Bounds for Circulant Binary Embedding

Samet Oymak

arXiv:1603.03178·cs.DS·March 15, 2016

Near-Optimal Sample Complexity Bounds for Circulant Binary Embedding

Samet Oymak

PDF

TL;DR

This paper presents near-optimal bounds for binary embedding using circulant matrices, achieving efficient embeddings with minimal distortion and sample complexity, especially for large point sets in high-dimensional spaces.

Contribution

It provides the first near-optimal sample complexity bounds for circulant binary embedding, improving theoretical understanding and practical efficiency.

Findings

01

Embedding $N$ points into $\u2208 ext{cube}$ with $k\, extasciitilde\, ext{delta}^{-3}\, ext{log}\,N$ samples is optimal.

02

The results hold when $ ext{log}\,N\, extless extless\,n^{1/3}$.

03

Most points can be embedded with optimal distortion when $ ext{log}\,N extless extless\, extsqrt{n}$.

Abstract

Binary embedding is the problem of mapping points from a high-dimensional space to a Hamming cube in lower dimension while preserving pairwise distances. An efficient way to accomplish this is to make use of fast embedding techniques involving Fourier transform e.g.~circulant matrices. While binary embedding has been studied extensively, theoretical results on fast binary embedding are rather limited. In this work, we build upon the recent literature to obtain significantly better dependencies on the problem parameters. A set of $N$ points in $R^{n}$ can be properly embedded into the Hamming cube ${\pm 1}^{k}$ with $δ$ distortion, by using $k \sim δ^{- 3} lo g N$ samples which is optimal in the number of points $N$ and compares well with the optimal distortion dependency $δ^{- 2}$ . Our optimal embedding result applies in the regime $lo g N ≲ n^{1/3}$ . Furthermore,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.