Near-Optimal Sample Complexity Bounds for Circulant Binary Embedding
Samet Oymak

TL;DR
This paper presents near-optimal bounds for binary embedding using circulant matrices, achieving efficient embeddings with minimal distortion and sample complexity, especially for large point sets in high-dimensional spaces.
Contribution
It provides the first near-optimal sample complexity bounds for circulant binary embedding, improving theoretical understanding and practical efficiency.
Findings
Embedding $N$ points into $\u2208 ext{cube}$ with $k\, extasciitilde\, ext{delta}^{-3}\, ext{log}\,N$ samples is optimal.
The results hold when $ ext{log}\,N\, extless extless\,n^{1/3}$.
Most points can be embedded with optimal distortion when $ ext{log}\,N extless extless\, extsqrt{n}$.
Abstract
Binary embedding is the problem of mapping points from a high-dimensional space to a Hamming cube in lower dimension while preserving pairwise distances. An efficient way to accomplish this is to make use of fast embedding techniques involving Fourier transform e.g.~circulant matrices. While binary embedding has been studied extensively, theoretical results on fast binary embedding are rather limited. In this work, we build upon the recent literature to obtain significantly better dependencies on the problem parameters. A set of points in can be properly embedded into the Hamming cube with distortion, by using samples which is optimal in the number of points and compares well with the optimal distortion dependency . Our optimal embedding result applies in the regime . Furthermore,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
