On Closest Pair in Euclidean Metric: Monochromatic is as Hard as   Bichromatic

Karthik C. S.; Pasin Manurangsi

arXiv:1812.00901·cs.CG·December 4, 2018

On Closest Pair in Euclidean Metric: Monochromatic is as Hard as Bichromatic

Karthik C. S., Pasin Manurangsi

PDF

Open Access

TL;DR

This paper proves tight complexity bounds for the Closest Pair problem in high-dimensional Euclidean spaces under SETH, showing it is computationally hard to solve or approximate faster than certain thresholds.

Contribution

It establishes new hardness results for the Closest Pair problem in high dimensions, connecting geometric complexity with fine-grained computational complexity assumptions.

Findings

01

No $O(n^{2- ext{epsilon}})$ algorithm in high dimensions under SETH.

02

No $O(n^{1.5- ext{epsilon}})$ approximation algorithm in high dimensions under SETH.

03

Construction of dense bipartite graphs with low contact dimension for hardness proofs.

Abstract

Given a set of $n$ points in $R^{d}$ , the (monochromatic) Closest Pair problem asks to find a pair of distinct points in the set that are closest in the $ℓ_{p}$ -metric. Closest Pair is a fundamental problem in Computational Geometry and understanding its fine-grained complexity in the Euclidean metric when $d = ω (lo g n)$ was raised as an open question in recent works (Abboud-Rubinstein-Williams [FOCS'17], Williams [SODA'18], David-Karthik-Laekhanukit [SoCG'18]). In this paper, we show that for every $p \in R_{\geq 1} \cup {0}$ , under the Strong Exponential Time Hypothesis (SETH), for every $ε > 0$ , the following holds: $∙$ No algorithm running in time $O (n^{2 - ε})$ can solve the Closest Pair problem in $d = (lo g n)^{Ω_{ε} (1)}$ dimensions in the $ℓ_{p}$ -metric. $∙$ There exists $δ = δ (ε) > 0$ and $c =…

Equations85

∥ τ (u) - τ (v) ∥_{p}

∥ τ (u) - τ (v) ∥_{p}

∥ τ (u) - τ (v) ∥_{p}

A = {a \circ (k \cdot τ (a)) ∣ a \in A}, B = {b \circ (k \cdot τ (b)) ∣ b \in B},

A = {a \circ (k \cdot τ (a)) ∣ a \in A}, B = {b \circ (k \cdot τ (b)) ∣ b \in B},

Q = {x^{h} + p (x) ∣ p (x) \in P} .

Q = {x^{h} + p (x) ∣ p (x) \in P} .

∣ E^{*} ∣ = q^{h} \cdot (h q) \geq q^{h} \cdot \frac{q ^{h}}{h ^{h}} > \frac{n ^{2}}{( lo g n ) ^{Θ ((l o g n) / (g \cdot l o g l o g n)} )} = n^{2 - O (\nicefrac 1 g)} .

∣ E^{*} ∣ = q^{h} \cdot (h q) \geq q^{h} \cdot \frac{q ^{h}}{h ^{h}} > \frac{n ^{2}}{( lo g n ) ^{Θ ((l o g n) / (g \cdot l o g l o g n)} )} = n^{2 - O (\nicefrac 1 g)} .

S^{*} = {s^{*} + c ∣ c \in C^{*}} .

S^{*} = {s^{*} + c ∣ c \in C^{*}} .

i \in [k] \cup E_{G_{π_{i}}} = E_{K_{n, n}}

i \in [k] \cup E_{G_{π_{i}}} = E_{K_{n, n}}

ψ (v) = e_{v_{1}} \circ e_{v_{2}} \circ \dots \circ e_{v_{N}},

ψ (v) = e_{v_{1}} \circ e_{v_{2}} \circ \dots \circ e_{v_{N}},

T_{A} (0) = 11000 T_{A} (1) = 00110

T_{A} (0) = 11000 T_{A} (1) = 00110

T_{B} (0) = 10100 T_{B} (1) = 01001 \qed

T_{B} (0) = 10100 T_{B} (1) = 01001 \qed

A_{i}^{t} = {a \circ (1_{d + 1} \otimes τ_{t} (a)) ∣ a \in A_{i}}, B_{j}^{t} = {b \circ (1_{d + 1} \otimes τ_{t} (b)) ∣ b \in B_{j}}

A_{i}^{t} = {a \circ (1_{d + 1} \otimes τ_{t} (a)) ∣ a \in A_{i}}, B_{j}^{t} = {b \circ (1_{d + 1} \otimes τ_{t} (b)) ∣ b \in B_{j}}

E_{s \in C_{2} ∖ C_{1}} [∣ B (s, Δ (C_{2})) \cap C_{1} ∣]

E_{s \in C_{2} ∖ C_{1}} [∣ B (s, Δ (C_{2})) \cap C_{1} ∣]

= c \in C_{1} \sum s \in C_{2} ∖ C_{1} Pr [Δ (s - c) \leq Δ (C_{2})]

= c \in C_{1} \sum s \in C_{2} ∖ C_{1} Pr [Δ (s) \leq Δ (C_{2})]

= ∣ C_{1} ∣ \cdot \frac{∣ ( C _{2} ∖ C _{1} ) \cap B ( 0 , Δ ( C _{2} )) ∣}{∣ C _{2} ∖ C _{1} ∣} .

E_{s \in C_{2} ∖ C_{1}} [∣ B (s, Δ (C_{2})) \cap C_{1} ∣]

E_{s \in C_{2} ∖ C_{1}} [∣ B (s, Δ (C_{2})) \cap C_{1} ∣]

\frac{∣ B ( s , Δ ( C _{2} )) \cap C _{1} ∣}{∣ C _{1} ∣}

\frac{∣ B ( s , Δ ( C _{2} )) \cap C _{1} ∣}{∣ C _{1} ∣}

(By Lemma \ref l e m : m d s)

\geq \frac{( \frac{q}{K _{2} - 1} ) ^{K_{2} - 1} \cdot ( q - 1 )}{q ^{K_{2}}}

= \frac{q - 1}{q} \cdot (\frac{1}{K _{2} - 1})^{K_{2} - 1}

= \frac{q - 1}{q} \cdot \frac{1}{K _{1}^{K_{1}}}

\geq \frac{1}{2} \cdot \frac{1}{q ^{δ K_{1}}}

= Ω (∣ C_{1} ∣^{- δ}),

\frac{∣ B ( s , Δ ( C _{2} )) \cap C _{1} ∣}{∣ C _{1} ∣} \geq \frac{q - 1}{q} \cdot (\frac{1}{K _{2} - 1})^{K_{2} - 1} = \frac{q - 1}{q} \cdot \frac{1}{( 3 K _{1} ) ^{(3 K_{1})}} \geq \frac{1}{2} \cdot \frac{1}{q ^{δ K_{1}}} = Ω (∣ C_{1} ∣^{- δ}) .

\frac{∣ B ( s , Δ ( C _{2} )) \cap C _{1} ∣}{∣ C _{1} ∣} \geq \frac{q - 1}{q} \cdot (\frac{1}{K _{2} - 1})^{K_{2} - 1} = \frac{q - 1}{q} \cdot \frac{1}{( 3 K _{1} ) ^{(3 K_{1})}} \geq \frac{1}{2} \cdot \frac{1}{q ^{δ K_{1}}} = Ω (∣ C_{1} ∣^{- δ}) .

A_{N - a} (C) \geq \frac{( a N )}{( q + 1 ) ^{2 g}} .

A_{N - a} (C) \geq \frac{( a N )}{( q + 1 ) ^{2 g}} .

A_{N_{i} - a_{2}} (C_{2}) \geq \frac{( a _{2} N _{i} )}{( q + 1 ) ^{2 g_{i}}} .

A_{N_{i} - a_{2}} (C_{2}) \geq \frac{( a _{2} N _{i} )}{( q + 1 ) ^{2 g_{i}}} .

\frac{∣ B ( s , Δ ( C _{2} )) \cap C _{1} ∣}{∣ C _{1} ∣}

\frac{∣ B ( s , Δ ( C _{2} )) \cap C _{1} ∣}{∣ C _{1} ∣}

(From Lemma \ref l e m : a g - p ai r)

(Singleton Bound)

\geq \frac{( N _{i} / a _{2} ) ^{a_{2}}}{( q + 1 ) ^{2 g_{i}} \cdot q ^{a_{2} + 1}}

= \frac{q ^{0.5 (1 - δ) a_{2}}}{( q + 1 ) ^{2 g_{i}} \cdot q ^{a_{2} + 1}}

= \frac{1}{( q + 1 ) ^{2 g_{i}} \cdot q ^{(0.5 + 0.5 δ) a_{2} + 1}}

= \frac{1}{q ^{(0.5 + 0.5 δ + o (1)) a_{2}}}

= \frac{1}{q ^{(0.5 + 0.5 δ + o (1)) (a_{1} + o (1))}}

= \frac{1}{∣ C _{1} ∣ ^{(0.5 + 0.5 δ + o (1))}}

\geq Ω (∣ C_{1} ∣^{- 0.5 - 0.5 δ - o (1)})

μ \geq \frac{a _{2} - a _{1} - 1}{N _{i} - a _{2}} = Ω (1/ q) .

μ \geq \frac{a _{2} - a _{1} - 1}{N _{i} - a _{2}} = Ω (1/ q) .

O ((a _{2} N + a _{2} - 1) \cdot ∣ C_{2} ∣ \cdot poly (N_{i}))

O ((a _{2} N + a _{2} - 1) \cdot ∣ C_{2} ∣ \cdot poly (N_{i}))

\leq O ((2 e q)^{a_{2}} \cdot ∣ C_{2} ∣ \cdot poly (N_{i}))

\leq O (∣ C_{1} ∣ \cdot ∣ C_{2} ∣ \cdot poly (N_{i}))

\leq O (n_{i}^{3}),

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational Geometry and Mesh Generation · Complexity and Algorithms in Graphs · Advanced Graph Theory Research

Full text

On Closest Pair in Euclidean Metric:

Monochromatic is as Hard as Bichromatic

Karthik C. S.

Weizmann Institute of Science

[email protected] Supported by Irit Dinur’s ERC-CoG grant 772839 and BSF grant 2014371.

Pasin Manurangsi

University of California, Berkeley

[email protected] Supported by NSF under Grants No. CCF 1655215 and CCF 1815434.

Abstract

Given a set of $n$ points in $\mathbb{R}^{d}$ , the (monochromatic) Closest Pair problem asks to find a pair of distinct points in the set that are closest in the $\ell_{p}$ -metric. Closest Pair is a fundamental problem in Computational Geometry and understanding its fine-grained complexity in the Euclidean metric when $d=\omega(\log n)$ was raised as an open question in recent works (Abboud-Rubinstein-Williams [FOCS’17], Williams [SODA’18], David-Karthik-Laekhanukit [SoCG’18]).

In this paper, we show that for every $p\in\mathbb{R}_{\geq 1}\cup\{0\}$ , under the Strong Exponential Time Hypothesis ( $\mathsf{SETH}$ ), for every $\varepsilon>0$ , the following holds:

•

No algorithm running in time $O(n^{2-\varepsilon})$ can solve the Closest Pair problem in $d=(\log n)^{\Omega_{\varepsilon}(1)}$ dimensions in the $\ell_{p}$ -metric.

•

There exists $\delta=\delta(\varepsilon)>0$ and $c=c(\varepsilon)\geq 1$ such that no algorithm running in time $O(n^{1.5-\varepsilon})$ can approximate Closest Pair problem to a factor of $(1+\delta)$ in $d\geq c\log n$ dimensions in the $\ell_{p}$ -metric.

In particular, our first result is shown by establishing the computational equivalence of the bichromatic Closest Pair problem and the (monochromatic) Closest Pair problem (up to $n^{\varepsilon}$ factor in the running time) for $d=(\log n)^{\Omega_{\varepsilon}(1)}$ dimensions.

Additionally, under $\mathsf{SETH}$ , we rule out nearly-polynomial factor approximation algorithms running in subquadratic time for the (monochromatic) Maximum Inner Product problem where we are given a set of $n$ points in $n^{o(1)}$ -dimensional Euclidean space and are required to find a pair of distinct points in the set that maximize the inner product.

At the heart of all our proofs is the construction of a dense bipartite graph with low contact dimension, i.e., we construct a balanced bipartite graph on $n$ vertices with $n^{2-\varepsilon}$ edges whose vertices can be realized as points in a $(\log n)^{\Omega_{\varepsilon}(1)}$ -dimensional Euclidean space such that every pair of vertices which have an edge in the graph are at distance exactly 1 and every other pair of vertices are at distance greater than 1. This graph construction is inspired by the construction of locally dense codes introduced by Dumer-Miccancio-Sudan [IEEE Trans. Inf. Theory’03].

1 Introduction
1.1 Our Results
2 Proof Overview
2.1 Conditional Lower Bound on Exact Closest Pair
2.2 Abstracting the Construction via Error-Correcting Codes
2.3 Inapproximability of Closest Pair and Maximum Inner Product
2.3.1 Approximate Maximum Inner Product
2.3.2 Approximate Closest Pair
3 Preliminaries
3.1 Notations, Problems and Fine-Grained Hypotheses
3.2 Error-Correcting Codes
3.3 Miscellaneous Tools
3.4 $\mathsf{OVH}$ -hardness of Exact Bichromatic Closest Pair
3.5 Contact Dimension of a Graph
4 Lower Bound on Closest Pair under Orthogonal Vector Hypothesis
5 Gadget Constructions
5.1 Finding a Center of a Code via Another Code
5.2 Gadgets based on Reed-Solomon Codes
5.2.1 The Basic Gadget: Dense Bipartite Graphs with Low Contact Dimensions
5.2.2 A Gadget for Maximum Inner Product
5.3 Gadgets based on AG Codes
6 Inapproximability of Maximum Inner Product
7 Inapproximability of Closest Pair
8 Discussion and Open Questions
Acknowledgements
A Lower Bound on Gap Closest Pair in Edit Distance Metric
B Covering Biclique By Isomorphic Graphs: Proof of Lemma 3.11

1 Introduction

The Closest Pair of Points problem or Closest Pair problem ( $\mathsf{CP}$ ) is a fundamental problem in computational geometry: given $n$ points in a $d$ -dimensional metric space, find a pair of distinct points with the smallest distance between them. The Closest Pair problem for points in the Euclidean plane [SH75, BS76] stands at the origins of the systematic study of the computational complexity of geometric problems [PS85, Man89, KT05, CLRS09]. Since then, this problem has found abundant applications in geographic information systems [Hen06], clustering [Zah71, Alp10], and numerous matching problems (such as stable marriage [WTFX07]).

The trivial algorithm for $\mathsf{CP}$ examines every pair of points in the point-set and runs in time $O(n^{2}d)$ . Over the decades, there have been a series of developments on $\mathsf{CP}$ in low dimensional space for the Euclidean metric [Ben80, HNS88, KM95, SH75, BS76], leading to a deterministic $O(2^{O(d)}n\log n)$ -time algorithm [BS76] and a randomized $O(2^{O(d)}n)$ -time algorithm [Rab76, KM95]. For low (i.e., constant) dimensions, these algorithms are tight as a matching lower bound of $\Omega(n\log n)$ was shown by Ben-Or [Ben83] and Yao [Yao91] in the algebraic decision tree model, thus settling the complexity of $\mathsf{CP}$ in low dimensions. On other hand, for very high dimensions (i.e., $d=\Omega(n)$ ) there are subcubic algorithms [GS16, ILLP04] in the $\ell_{1},\ell_{2},$ and $\ell_{\infty}$ -metrics using fast matrix multiplication algorithms [Gal14]. However, $\mathsf{CP}$ in medium dimensions, i.e., $d=\text{polylog}(n)$ , and in various $\ell_{p}$ -metrics, have been a focus of study in machine learning and analysis of Big Data [Kle97], and it is surprising that, even with the tools and techniques that have been developed over many decades, when $d=\omega(\log n)$ , there is no known subquadratic-time (i.e., $O(2^{o(d)}n^{2-\varepsilon})$ -time) algorithm, for $\mathsf{CP}$ in any standard distance measure [Ind00, AC09, ILLP04] . The absence of such algorithms was explicitly observed as early as the late nineties by Cohen and Lewis [CL99] but there was not any explanation until recently.

David, Karthik, and Laekhanukit [DKL18] showed that for all $p>2$ , assuming the Strong Exponential Time Hypothesis ( $\mathsf{SETH}$ ), for every $\varepsilon>0$ , no algorithm running in $n^{2-\varepsilon}$ time can solve $\mathsf{CP}$ in the $\ell_{p}$ -metric, even when $d=\omega(\log n)$ . Their conditional lower bound was based on the conditional lower bound (again assuming $\mathsf{SETH}$ ) of Alman and Williams [AW15] for the Bichromatic Closest Pair problem111We remark that $\mathsf{BCP}$ is of independent interest as it’s equivalent to finding the Minimum Spanning Tree in $\ell_{p}$ -metric [AESW91, KLN99]. Moreover, understanding the fine-grained complexity of $\mathsf{BCP}$ has lead to better understanding of the query time needed for Approximate Nearest Neighbor search problem (see Razenshteyn’s thesis [Raz17] for a survey about the problem) with polynomial preprocessing time [Rub18]. ( $\mathsf{BCP}$ ) where we are given two sets of $n$ points in a $d$ -dimensional metric space, and the goal is to find a pair of points, one from each set, with the smallest distance between them. Alman and Williams showed that for all $p\in\mathbb{R}_{\geq 1}\cup\{0\}$ , assuming $\mathsf{SETH}$ , for every $\varepsilon>0$ , no algorithm running in $n^{2-\varepsilon}$ time can solve $\mathsf{BCP}$ in the $\omega(\log n)$ -dimensional $\ell_{p}$ -metric space. Given that [AW15] show their lower bound on $\mathsf{BCP}$ for all $\ell_{p}$ -metrics, the lower bound on $\mathsf{CP}$ of [DKL18] feels unsatisfactory, since the $\ell_{2}$ -metric is arguably the most interesting metric to study $\mathsf{CP}$ on. On the other hand, the answer to the complexity of $\mathsf{CP}$ in the Euclidean metric might be on the positive side, i.e., there might exist an algorithm that performs well in the $\ell_{2}$ -metric because there are more tools available, e.g., Johnson-Lindenstrauss’ dimension reduction [JL84]. Thus we have the following question:

Open Question 1.1 (Abboud-Rubinstein-Williams222Please see the erratum in [ARW17a]. [ARW17b], Williams [Wil18a], David

-Karthik-Laekhanukit [DKL18]).

Is there an algorithm running in time $n^{2-\varepsilon}$ for some $\varepsilon>0$ which can solve $\mathsf{CP}$ in the Euclidean metric when the points are in $\omega(\log n)$ dimensions?

Even if the answer to the above question is negative, this does not rule out strong approximation algorithms for $\mathsf{CP}$ in the Euclidean metric, which might suffice for all applications. Indeed, we do know of subquadratic approximation algorithms for $\mathsf{CP}$ . For example, LSH based techniques can solve $(1+\delta)$ - $\mathsf{CP}$ (i.e., $(1+\delta)$ factor approximate $\mathsf{CP}$ ) in $n^{2-\Theta\left(\delta\right)}$ time [IM98], but cannot do much better [MNP07, OWZ14]. In a recent breakthrough, Valiant [Val15] obtained an approximation algorithm for $(1+\delta)$ - $\mathsf{CP}$ with runtime of $n^{2-\Theta\left(\sqrt{\delta}\right)}$ . The state of the art is an $n^{2-\widetilde{\Theta}\left(\delta^{1/3}\right)}$ -time algorithm by Alman, Chan, and Williams [ACW16]. Can the dependence on $\delta$ be improved indefinitely? For the case of $(1+\delta)$ - $\mathsf{BCP}$ , assuming $\mathsf{SETH}$ , Rubinstein [Rub18] answered the question in the negative. Does $(1+\delta)$ - $\mathsf{CP}$ also admit the same negative answer?

Open Question 1.2.

Is there an algorithm running in time $n^{2-\varepsilon}$ for some $\varepsilon>0$ which can solve $(1+\delta)$ - $\mathsf{CP}$ in the Euclidean metric when the points are in $\omega(\log n)$ dimensions for every $\delta>0$ ?

Another important geometric problem is the Maximum Inner Product problem ( $\mathsf{MIP}$ ): given $n$ points in the $d$ -dimensional Euclidean space, find a pair of distinct points with the largest inner product. This problem along with its bichromatic variant (Bichromatic Maximum Inner Product problem, denoted $\mathsf{BMIP}$ ) is extensively studied in literature (see [ARW17b] and references therein). Abboud, Rubinstein, and Williams [ARW17b] showed that assuming $\mathsf{SETH}$ , for every $\varepsilon>0$ , no $2^{(\log n)^{1-o(1)}}$ -approximation algorithm running in $n^{2-\varepsilon}$ time can solve $\mathsf{BMIP}$ when $d=n^{o(1)}$ . It is a natural question to ask if their inapproximability result can be extended to $\mathsf{MIP}$ :

Open Question 1.3.

Is there an algorithm running in time $n^{2-\varepsilon}$ for some $\varepsilon>0$ which can solve $\gamma$ - $\mathsf{MIP}$ in $n^{o(1)}$ dimensions for even $\gamma=2^{(\log n)^{1-o(1)}}$ ?

1.1 Our Results

In this paper we address all three previously mentioned open questions. First, we almost completely resolve Open Question 1.1. In particular, we show the following.

Theorem 1.4 (Subquadratic Hardness of $\mathsf{CP}$ ; Informal, See Theorem 4.3).

Let $p\in\mathbb{R}_{\geq 1}\cup\{0\}$ . Assuming $\mathsf{SETH}$ , for every $\varepsilon>0$ , no algorithm running in $n^{2-\varepsilon}$ time can solve $\mathsf{CP}$ in the $\ell_{p}$ -metric, even when $d=\left(\log n\right)^{\Omega_{\varepsilon}(1)}$ .

In particular we would like to emphasize that the dimension for which we show the lower bound on $\mathsf{CP}$ depends on $\varepsilon$ . We would also like to remark that our lower bound holds even when the input point-set of $\mathsf{CP}$ is a subset of $\{0,1\}^{d}$ . Finally, we note that the centerpiece of the proof of the above theorem (and also the proofs of the other results that will be subsequently mentioned) is the construction of a dense bipartite graph with low contact dimension, i.e., we construct a balanced bipartite graph on $n$ vertices with $n^{2-\varepsilon}$ edges whose vertices can be realized as points in a $(\log n)^{\Omega_{\varepsilon}(1)}$ -dimensional $\ell_{p}$ -metric space such that every pair of vertices which have an edge in the graph are at distance exactly 1 and every other pair of vertices are at distance greater than 1. This graph construction is inspired by the construction of locally dense codes introduced by Dumer, Miccancio, and Sudan [DMS03] and uses special density properties of Reed Solomon codes. A detailed proof overview is given in Section 2.1.

Next, we improve our result in Theorem 1.4 in some aspects by showing $1+o(1)$ factor inapproximability of $\mathsf{CP}$ even in $O_{\varepsilon}(\log n)$ dimensions, but can only rule out algorithms running in $n^{1.5-\varepsilon}$ time (as opposed to Theorem 1.4 which rules out exact algorithms for $\mathsf{CP}$ running in $n^{2-\varepsilon}$ time). More precisely, we show the following.

Theorem 1.5 (Subquadratic Hardness of gap- $\mathsf{CP}$ ).

Let $p\in\mathbb{R}_{\geq 1}\cup\{0\}$ . Assuming $\mathsf{SETH}$ , for every $\varepsilon>0$ , there exists $\delta(\varepsilon)>0$ and $c(\varepsilon)>1$ such that no algorithm running in $n^{1.5-\varepsilon}$ time that can solve $(1+\delta)$ - $\mathsf{CP}$ in the $\ell_{p}$ -metric, even when $d=c\log n$ .

We remark that the $n^{1.5-\varepsilon}$ lower bound on approximate $\mathsf{CP}$ is an artifact of our proof strategy and that a different approach or an improvement in the state-of-the-art bound on the number of minimum weight codewords in algebraic geometric codes (which are used in our proof), will lead to the complete resolution of Open Question 1.2.

It should also be noted that the approximate version of $\mathsf{CP}$ and the dimension are closely related. Namely, using standard dimensionality reduction techniques [JL84]333In fact, since our results applies to $\{0,1\}$ -vectors, simply subsampling coordinates would also work. for $(1+\delta)$ - $\mathsf{CP}$ , one can always assume that $d=O_{\delta}(\log n)$ . In other words, hardness of $(1+\delta)$ - $\mathsf{CP}$ immediately yields logarithmic dimensionality bound as a byproduct.

Finally, we completely answer Open Question 1.3 by showing the following inapproximability result for $\mathsf{MIP}$ , matching the hardness for $\mathsf{BMIP}$ from [ARW17b].

Theorem 1.6 (Subquadratic Hardness of gap- $\mathsf{MIP}$ ).

Assuming $\mathsf{SETH}$ , for every $\varepsilon>0$ , no algorithm running in $n^{2-\varepsilon}$ time can solve $\gamma$ - $\mathsf{MIP}$ for any $\gamma\leq 2^{(\log n)^{1-o(1)}}$ , even when $d=n^{o(1)}$ .

Recently, there have been a lot of results connecting $\mathsf{BCP}$ or $(1+o(1))$ - $\mathsf{BCP}$ to other problems (see [Rub18, Che18a, Che18b, CW19]). Now such connections can be extended to $\mathsf{CP}$ as well. For example, the following conditional lower bound follows from [Rub18] for gap- $\mathsf{CP}$ in the edit distance metric and for completeness a proof is given in Appendix A.

Theorem 1.7 (Subquadratic Hardness of gap- $\mathsf{CP}$ in edit distance metric).

Assuming $\mathsf{SETH}$ , for every $\varepsilon>0$ , there exists $\delta(\varepsilon)>0$ and $c(\varepsilon)>1$ such that no algorithm running in $n^{1.5-\varepsilon}$ time can solve $(1+\delta)$ - $\mathsf{CP}$ in the edit distance metric, even when $d=c\log n\log\log n$ .

2 Proof Overview

In this section, we provide an overview of our proofs. For ease of presentation, we will sometimes be informal here; all notions and proofs are formalized in subsequent sections. Our overview is organized as follows. First, in Subsection 2.1, we outline our proof of running time lower bounds for exact $\mathsf{CP}$ (Theorem 1.4). Then, in Subsection 2.2, we abstract part of our reduction using error-correcting codes, and relate them back to the works on locally dense codes [DMS03, CW12, Mic14] that inspire our constructions. Finally, in Subsection 2.3, we briefly discuss how to modify the base construction (i.e. code properties) to give conditional lower bounds for approximate $\mathsf{CP}$ and $\mathsf{MIP}$ (Theorems 1.5 and 1.6).

2.1 Conditional Lower Bound on Exact Closest Pair

In this subsection, we provide a proof overview of a slightly weaker version of Theorem 1.4, i.e., we show that assuming $\mathsf{SETH}$ , for every $p\in\mathbb{R}_{\geq 1}\cup\{0\}$ , no subquadratic time algorithm can solve $\mathsf{CP}$ in the $\ell_{p}$ -metric when $d=(\log n)^{\omega(1)}$ . We prove such a result by reducing $\mathsf{BCP}$ in dimension $d$ to $\mathsf{CP}$ in dimension $d+(\log n)^{\omega(1)}$ , and the subquadratic hardness for $\mathsf{CP}$ follows from the subquadratic hardness of $\mathsf{BCP}$ established by [AW15]. Note that the results in this paper remain interesting even if $\mathsf{SETH}$ is false, as our reduction shows that $\mathsf{BCP}$ and $\mathsf{CP}$ are computationally equivalent444We can reduce an instance of $\mathsf{CP}$ to an instance of $\mathsf{BCP}$ by randomly partitioning the input set of $\mathsf{CP}$ instance into two, and the optimal closest pair of points will be in different sets with probability $\nicefrac{{1}}{{2}}$ (and this reduction can be made deterministic). (up to $n^{o(1)}$ factor in the running time) when $d=(\log n)^{\omega(1)}$ . The conditional lower bound on $\mathsf{CP}$ is merely a consequence of this computational equivalence. Finally, we note that a similar equivalence also holds between $\mathsf{MIP}$ and $\mathsf{BMIP}$ .

Understanding an obstacle of [DKL18].

Our proof builds on the ideas of [DKL18] who showed that assuming $\mathsf{SETH}$ , for every $p>2$ , no subquadratic time algorithm can solve $\mathsf{CP}$ in the $\ell_{p}$ -metric when $d=\omega(\log n)$ . They did so by connecting the complexity of $\mathsf{CP}$ and $\mathsf{BCP}$ via the contact dimension of the balanced complete bipartite graph (biclique), denoted by $K_{n,n}$ . We elaborate on this below.

To motivate the idea behind [DKL18], let us first consider the trivial reduction from $\mathsf{BCP}$ to $\mathsf{CP}$ : given an instance $A,B$ of $\mathsf{BCP}$ , we simply output $A\cup B$ as an instance of $\mathsf{CP}$ . This reduction fails because there is no guarantee on the distances of a pair of points both in $A$ (or both in $B$ ). That is, there could be two points $\mathbf{a},\mathbf{a}^{\prime}\in A$ such that $\|\mathbf{a}-\mathbf{a}^{\prime}\|_{p}$ is much smaller than the optimum of $\mathsf{BCP}$ on $A,B$ . If we simply solve $\mathsf{CP}$ on $A\cup B$ , we might find such $\mathbf{a},\mathbf{a}^{\prime}$ as the optimal pair but this does not give the answer to the original $\mathsf{BCP}$ problem. In order to circumvent this issue, one needs a gadget that “stretch” pairs of points both in $A$ or both in $B$ further apart while keeping the pairs of points across $A$ and $B$ close (and preserving the optimum of $\mathsf{BCP}$ on $A,B$ ). It turns out that this notion corresponds exactly to the contact dimension of the biclique, which we define below.

Definition 2.1 (Contact Dimension [Pac80]).

For any graph $G=(V,E)$ , a mapping $\tau:V\to\mathbb{R}^{d}$ is said to realize $G$ (in the $\ell_{p}$ -metric) if for some $\beta>0$ , the following holds for every distinct vertices $u,v$ :

[TABLE]

The contact dimension (in the $\ell_{p}$ -metric) of $G$ , denoted by $\mathsf{cd}_{p}(G)$ , is the minimum $d\in\mathbb{N}$ such that there exists $\tau:V\to\mathbb{R}^{d}$ realizing $G$ in the $\ell_{p}$ -metric.

In this paper, we will be mainly interested in the contact dimension of bipartite graphs. Specifically, [DKL18] only consider the contact dimension of the biclique $K_{n,n}$ . Notice that a realization of biclique ensures that vertices on the same side are far from each other while vertices on different sides are close to each other preserving the optimum of $\mathsf{BCP}$ ; these are exactly the desired properties of a gadget outlined above. Using this, [DKL18] give a reduction from $\mathsf{BCP}$ to $\mathsf{CP}$ which shows that the two are computationally equivalent whenever $d=\Omega(\mathsf{cd}_{p}(K_{n,n}))$ , as follows.

Let $A,B\subseteq\mathbb{R}^{d}$ each of cardinality $n$ be an instance of $\mathsf{BCP}$ and let $\tau:A\dot{\cup}B\to\mathbb{R}^{\mathsf{cd}_{p}(K_{n,n})}$ be a map realizing the biclique $(A\dot{\cup}B,A\times B)$ in the $\ell_{p}$ -metric; we may assume w.l.o.g. that $\beta=1$ . Let $\delta$ be the distance between any point in $A$ and any point in $B$ (i.e., $\delta$ is an upper bound on the optimum of $\mathsf{BCP}$ ). Let $\rho>0$ be such that $\|\tau(\mathbf{a})-\tau(\mathbf{b})\|_{p}>1+\rho$ for all $\mathbf{a}\in A,\mathbf{b}\in B$ (and this is guaranteed to exist by (2)). Moreover, let $k>\delta/\rho$ be any sufficiently large number. Consider the point-sets $\widetilde{A},\widetilde{B}\subseteq\mathbb{R}^{d+\mathsf{cd}_{p}(K_{n,n})}$ of cardinality $n$ each defined as

[TABLE]

where $\circ$ denotes the concatenation between two vectors and $k\cdot\mathbf{x}$ denotes the usual scalar-vector multiplication (i.e. scaling $\mathbf{x}$ up by a factor of $k$ ). For brevity, we write $\widetilde{\mathbf{a}}$ and $\widetilde{\mathbf{b}}$ to denote $\mathbf{a}\circ(k\cdot\tau(\mathbf{a}))$ and $\mathbf{b}\circ(k\cdot\tau(\mathbf{b}))$ respectively.

We now argue that, if we can find the closest pair of points in $\widetilde{A}\cup\widetilde{B}$ , then we also immediately solve $\mathsf{BCP}$ for $(A,B)$ . More precisely, we claim that $(\mathbf{a}^{*},\mathbf{b}^{*})\in A\times B$ is a bichromatic closest pair of $(A,B)$ if and only if $(\widetilde{\mathbf{a}^{*}},\widetilde{\mathbf{b}^{*}})$ is a closest pair of $\widetilde{A}\cup\widetilde{B}$ .

To see that this is the case, observe that, for cross pairs $(\widetilde{\mathbf{a}},\widetilde{\mathbf{b}})\in\widetilde{A}\times\widetilde{B}$ , (1) implies that the distance $\|\widetilde{\mathbf{a}}-\widetilde{\mathbf{b}}\|_{p}$ is exactly $(k^{p}+\|\mathbf{a}-\mathbf{b}\|^{p}_{p})^{1/p}$ ; hence, among these pairs, $(\widetilde{\mathbf{a}^{*}},\widetilde{\mathbf{b}^{*}})$ is a closest pair iff $(\mathbf{a}^{*},\mathbf{b}^{*})$ is a bichromatic closest pair in $A,B$ . Notice also that, since the bichromatic closest pair in $A,B$ is of distance at most $\delta$ , the closest pair in $\widetilde{A}\cup\widetilde{B}$ is of distance at most $(k^{p}+\delta^{p})^{1/p}\leq k+\delta$ .

On the other hand, for pairs both from $\widetilde{A}$ or both from $\widetilde{B}$ , the distance must be at least $k(1+\rho)$ , which is more than $k+\delta$ from our choice of $k$ . As a result, these pairs cannot be a closest pair in $\widetilde{A}\cup\widetilde{B}$ , and this concludes the sketch of the proof.

There are a couple of details that we have glossed over here: one is that the gap $\rho$ cannot be too small (e.g., $\rho$ cannot be as small as $\nicefrac{{1}}{{2^{n}}}$ ) and the other is that we should be able to construct $\tau$ efficiently. Nevertheless, these are typically not an issue.

[DKL18] show that $\mathsf{cd}_{p}(K_{n,n})=\Theta(\log n)$ when $p>2$ and that the realization can be constructed efficiently and with sufficiently large $\rho$ . This implies the subquadratic hardness of $\mathsf{CP}$ (by reduction from $\mathsf{BCP}$ ) in the $\ell_{p}$ -metric for all $p>2$ and $d=\omega(\log n)$ . However, it was known that $\mathsf{cd}_{2}(K_{n,n})=\Theta(n)$ [FM88]. Thus, they could not extend their conditional lower bound to $\mathsf{CP}$ in the Euclidean metric555Note that plugging in the bound on $\mathsf{cd}_{2}(K_{n,n})$ in the result of [DKL18] yields that assuming $\mathsf{SETH}$ , no subquadratic in $n$ running time algorithm can solve $\mathsf{CP}$ when $d=\Omega(n)$ . This is not a meaningful lower bound as just the input size of $\mathsf{CP}$ when $d=\Omega(n)$ is $\Omega(n^{2})$ . even when $d=o(n)$ . In fact, this is a serious obstacle as it rules out many natural approaches to reduce $\mathsf{BCP}$ to $\mathsf{CP}$ in a black-box manner. Elaborating, the lower bound on $\mathsf{cd}_{2}(K_{n,n})$ rules out local gadget reductions which would replace each point with a composition of that point and a gadget with a small increase in the number of dimensions, as such gadgets can be used to construct a realization of $K_{n,n}$ in the Euclidean metric in a low dimensional space, contradicting the lower bound on $\mathsf{cd}_{2}(K_{n,n})$ .

Overcoming the Obstacle: Beyond Biclique.

We overcome the above obstacle by considering dense bipartite graphs, instead of the biclique. More precisely, we show that there exists a balanced bipartite graph $G^{*}=(A^{*}\dot{\cup}B^{*},E^{*})$ on $2n$ vertices such that $|E^{*}|~{}\geq~{}n^{2-o(1)}$ and $\mathsf{cd}_{p}(G^{*})$ is small (i.e. $\mathsf{cd}_{p}(G^{*})\leq(\log n)^{\omega(1)}$ ). We give a construction of such a graph below but before we do so, let us briefly argue why this suffices to show that $\mathsf{BCP}$ and $\mathsf{CP}$ are computationally equivalent (up to $n^{o(1)}$ multiplicative overhead in the running time) for dimension $d=\Omega(\mathsf{cd}_{p}(G^{*}))$ .

Let us consider the same reduction which produces $\widetilde{A},\widetilde{B}$ as before, but instead of using a realization of the biclique, we use a realization $\tau$ of $G^{*}$ . This reduction is of course incorrect: if $(\mathbf{a}^{*},\mathbf{b}^{*})$ is not an edge in $G^{*}$ , then $\|\tau(\mathbf{a}^{*})-\tau(\mathbf{b}^{*})\|_{p}$ could be large and, thus the corresponding pair of points $(\widetilde{\mathbf{a}^{*}},\widetilde{\mathbf{b}^{*}})\in\widetilde{A}\times\widetilde{B}$ , may not be the closest pair. Nevertheless, we are not totally hopeless: if $(\mathbf{a}^{*},\mathbf{b}^{*})$ is an edge, then we are in good shape and the reduction is correct.

With the above observation in mind, consider picking a random permutation $\pi$ of $A\cup B$ such that $\pi(A)=A$ and $\pi(B)=B$ and then initiate the above reduction with the map $(\tau\circ\pi)$ instead of $\tau$ . Note that $\tau\circ\pi$ is simply a realization of an appropriate permutation $G^{\prime}$ of $G^{*}$ (i.e., $G^{\prime}$ is isomorphic to $G^{*}$ ). Due to this, the probability that we are “lucky” and $(\mathbf{a}^{*},\mathbf{b}^{*})$ is an edge in $G^{\prime}$ is $p:=|E|/n^{2}$ ; when this is the case, solving $\mathsf{CP}$ on the resulting instance would give the correct answer for the original $\mathsf{BCP}$ instance. If we repeat this $\log n/p=n^{o(1)}$ times, we would find the optimum of the original $\mathsf{BCP}$ instance with high probability.

To recap, even when $G^{*}$ is not a biclique, we can still use it to give a reduction from $\mathsf{BCP}$ to $\mathsf{CP}$ , except that the reduction produces multiple (i.e. $\widetilde{O}(n^{2}/|E^{*}|)$ ) instances of $\mathsf{CP}$ . We remark here that the reduction can be derandomized: we can deterministically (and efficiently) pick the permutations so that the permuted graphs covers $K_{n,n}$ (see Lemma 3.11). As a minor digression, we would like to draw a parallel here with a recent work of Abboud, Rubinstein, and Williams [ARW17b]. The obstacle raised in [DKL18] is about the impossibility of certain kinds of many-one gadget reductions. We overcame it by designing a reduction from $\mathsf{BCP}$ to $\mathsf{CP}$ which not only increased the number of dimensions but also the number of points (by creating multiple instances of $\mathsf{CP}$ ). This technique is also utilized in [ARW17b] where they showed the impossibility of Deterministic Distributed PCPs (Theorem I.2 in [ARW17b]) but then overcame that obstacle by using an advice (which is then enumerated over resulting in multiple instances) to build Non-deterministic Distributed PCPs.

Constructing a dense bipartite graph with low contact dimension.

We now proceed to construct the desired graph $G^{*}=(A^{*}\cup B^{*},E^{*})$ . Note that any construction of a dense bipartite graph with contact dimension $n^{o(1)}$ is non-trivial. This is because it is known that a random graph has contact dimension $\Omega(n)$ in the Euclidean metric with high probability [RRS89, BL05], and therefore our graph construction must be significantly better than a random graph.

Our realization $\tau^{*}$ of $G^{*}$ will map into a subset of $\{0,1\}^{(\log n)^{\omega(1)}}$ . As a result, we can fix $p=0$ , since a realization of a graph with entries in $\{0,1\}$ in the Hamming-metric also realizes the same graph in every $\ell_{p}$ -metric for any $p\neq\infty$ .

Fix $g=\omega(1)$ . We associate $[n]$ with $\mathbb{F}_{q}^{h}$ where $q=\Theta\left((\log n)^{g}\right)$ is a prime and $h~{}=~{}\Theta\left(\frac{\log n}{g\cdot\log\log n}\right)$ . Let $\mathcal{P}$ be the set of all univariate polynomials (in $x$ ) over $\mathbb{F}_{q}$ of degree at most $h-1$ . We have that $|\mathcal{P}|=q^{h}=n$ and associate $\mathcal{P}$ with $A^{*}$ . Let $\mathcal{Q}$ be the set of all univariate monic polynomials (in $x$ ) over $\mathbb{F}_{q}$ of degree $h$ , i.e.,

[TABLE]

We associate the polynomials in $\mathcal{Q}$ with the vertices in $B^{*}$ (note that $|\mathcal{Q}|=n$ ). In fact, we view the vertices in $A^{*}$ and $B^{*}$ as being uniquely labeled by polynomials in $\mathcal{P}$ and $\mathcal{Q}$ respectively. For notational clarity, we write $p_{a}$ (resp. $p_{b}$ ) to denote the polynomial in $\mathcal{P}$ (resp. $\mathcal{Q}$ ) that is associated to $a\in A^{*}$ (resp. $b\in B^{*}$ ).

For every $a\in A^{*}$ and $b\in B^{*}$ , we include $(a,b)$ as an edge in $E^{*}$ if and only if the polynomial $p_{b}-p_{a}$ (which is of degree $h$ ) has $h$ distinct roots. This completes the construction of $G^{*}$ . We have to now show the following two claims about $G^{*}$ : (i) $|E^{*}|=n^{2-O\left(\nicefrac{{1}}{{g}}\right)}=n^{2-o(1)}$ and (ii) there is $\tau:A^{*}\dot{\cup}B^{*}\to\{0,1\}^{(\log n)^{O(g)}}=\{0,1\}^{(\log n)^{\omega(1)}}$ that realizes $G^{*}$ .

To show (i), let $\mathcal{R}$ be the set of all monic polynomials of degree $h$ with $h$ distinct roots. We have that $|\mathcal{R}|=\binom{q}{h}$ . Fix a vertex $a\in A^{*}$ . Its degree in $G^{*}$ is exactly $|\mathcal{R}|=\binom{q}{h}$ . This is because, for every polynomial $r\in\mathcal{R}$ , $r+a$ belongs to $\mathcal{Q}$ , and therefore $(a,r+a)\in E^{*}$ . This implies the following bound on $|E^{*}|$ :

[TABLE]

Next, to show (ii), we construct a realization $\tau^{*}:A^{*}\dot{\cup}B^{*}\to\mathbb{F}_{q}^{q}$ of $G^{*}$ . We note that, it is simple to translate the entries to $\{0,1\}$ instead of $\mathbb{F}_{q}$ , by replacing $i\in\mathbb{F}_{q}$ with the $i$ -th standard basis $\mathbf{e}_{i}\in\{0,1\}^{q}$ . This would result in a realization $\tau^{*}:A^{*}\dot{\cup}B^{*}\to\{0,1\}^{q^{2}}$ of $G^{*}$ ; notice that the dimension of $\tau^{*}$ is $q^{2}=\Theta((\log n)^{2g})$ as claimed.

We define $\tau^{*}$ as follows.

•

For every $a\in A^{*}$ , $\tau^{*}(a)$ is simply the vector of evaluation of $p_{a}$ on every element in $\mathbb{F}_{q}$ . More precisely, for every $j\in[q]$ , the $j$ -th coordinate of $\tau^{*}(a)$ is $p_{a}(j-1)$ .

•

Similarly, for every $b\in B^{*}$ and $j\in[q]$ , the $j$ -th coordinate of $\tau^{*}(b)$ is $p_{b}(j-1)$ .

We now show that $\tau^{*}$ is indeed a realization of $G^{*}$ ; specifically, we show that $\tau^{*}$ satisfies (1) and (2) with $\beta=q-h$ .

Consider any edge $(a,b)\in E^{*}$ . Notice that $\|\tau^{*}(a)-\tau^{*}(b)\|_{0}$ is the number of $x\in\mathbb{F}_{q}$ such that $p_{b}(x)-p_{a}(x)\neq 0$ . By definition of $E^{*}$ , $p_{b}-p_{a}$ is a polynomial with $h$ distinct roots over $\mathbb{F}_{q}$ . Thus, $\|\tau^{*}(a)-\tau^{*}(b)\|_{0}=q-h=\beta$ as desired.

Next, consider a non-edge $(a,b)\in(A^{*}\times B^{*})\setminus E^{*}$ . Then, we know that $p_{b}-p_{a}$ has at most $h-1$ distinct roots over $\mathbb{F}_{q}$ . Therefore, the polynomial $p_{b}-p_{a}$ is non-zero on at least $q-h+1$ coordinates. This implies that $\|\tau^{*}(a)-\tau^{*}(b)\|_{0}\geq q-h+1>\beta$ .

Finally, for any distinct $a,a^{\prime}\in A^{*}$ , we have $\|\tau^{*}(a)-\tau^{*}(a^{\prime})\|_{0}\geq q-h+1$ because $p_{a}-p_{a^{\prime}}$ is a non-zero polynomial of degree at most $h-1$ and thus can be zero over $\mathbb{F}_{q}$ in at most $h-1$ locations. Similarly, $\|\tau^{*}(b)-\tau^{*}(b^{\prime})\|_{0}\geq q-h+1$ for any distinct $b,b^{\prime}\in B^{*}$ .

This completes the proof sketch for both the claims about $G^{*}$ and yields Theorem 1.4 for $d=(\log n)^{\omega(1)}$ . Finally we remark that in the actual proof of Theorem 1.4, we will set the parameters in the above construction more carefully and achieve the bound $\mathsf{cd}_{p}(G^{*})=(\log n)^{O_{\varepsilon}(1)}$ .

2.2 Abstracting the Construction via Error-Correcting Codes

Before we move on to discuss the proofs of Theorems 1.6 and 1.5, let us give an abstraction of the construction in the previous subsection. This will allow us to easily generalize the construction for the aforemention theorems, and also to explain where our motivation behind the construction comes from in the first place.

Dense Bipartite Graph with Low Contact Dimension from Codes.

In order to construct a balanced bipartite graph $G^{*}$ on $2n$ vertices with $n^{2-o(1)}$ edges such that $\mathsf{cd}_{p}(G^{*})\leq~{}d^{*}$ , it suffices to have a code $C^{*}$ with the following properties (for code-related definitions, see Section 3.2):

•

$C^{*}\subseteq\mathbb{F}_{q}^{\ell}$ of cardinality $n$ is a linear code with block length $\ell$ over alphabet $\mathbb{F}_{q}$ , and minimum distance $\Delta$ .

•

There exists a center $s^{*}\in\mathbb{F}_{q}^{\ell}$ and $r^{*}<\Delta$ such that $|C^{*}|^{1-o(1)}$ codewords are at Hamming distance exactly $r^{*}$ from $s^{*}$ and no codeword is at distance less than $r^{*}$ from $s^{*}$ .

•

$q\cdot\ell=d^{*}$ .

We also require that $C^{*}$ and $s^{*}$ can be constructed in ${\rm{poly}}(n)$ time but we shall ignore this requirement for the ease of exposition.

We describe below how to construct $G^{*}$ from $C^{*}$ , but first note that the construction of $G^{*}$ we saw in the previous subsubsection was just showing that Reed Solomon codes [RS60] of block length $q=\Theta((\log n)^{g})$ and message length $h=\Theta\left(\frac{\log n}{g\cdot\log\log n}\right)$ over alphabet $\mathbb{F}_{q}$ with minimum distance $q-h+1$ has the above properties. The center $s^{*}$ in that construction was the evaluation of the polynomial $x^{h}$ over $\mathbb{F}_{q}$ , and $r^{*}$ was $q-h$ .

In general, to construct $G^{*}$ from $C^{*}$ , we first define a subset $S^{*}\subseteq\mathbb{F}_{q}^{\ell}$ of cardinality $n$ as follows:

[TABLE]

We associate the vertices in $A^{*}$ with the codewords of $C^{*}$ and vertices in $B^{*}$ with the strings in $S^{*}$ . For any $(\mathbf{a},\mathbf{b})\in A^{*}\times B^{*}$ , let $(\mathbf{a},\mathbf{b})\in E^{*}$ if and only if $\|\mathbf{b}-\mathbf{a}\|_{0}=r^{*}$ . This completes the construction of $G^{*}$ . We have to now show the following claims about $G^{*}$ : (i) $|E^{*}|=n^{2-o(1)}$ and (ii) there is $\tau:A^{*}\dot{\cup}B^{*}\to\{0,1\}^{q\cdot\ell}$ that realizes $G^{*}$ .

Item (i) follows rather easily from the properties of $C^{*}$ and $s^{*}$ . Let $T^{*}$ be the subset of $C^{*}$ of all codewords which are at distance exactly equal to $r^{*}$ from $s^{*}$ . From the definition of $s^{*}$ , we have $|T^{*}|=|C^{*}|^{1-o(1)}$ . Fix $\mathbf{a}\in A^{*}$ . Its degree in $G^{*}$ is $|T^{*}|=|C^{*}|^{1-o(1)}$ . This is because for every codeword $\mathbf{t}\in T^{*}$ we have that $\mathbf{t}-\mathbf{a}$ is a codeword in $C^{*}$ (from the linearity of $C^{*}$ ) and thus $\mathbf{s}^{*}-\mathbf{t}+\mathbf{a}$ is in $S^{*}$ , and therefore $(\mathbf{a},\mathbf{s}^{*}-\mathbf{t}+\mathbf{a})\in E^{*}$ .

For item (ii), consider the identity mapping $\tau^{*}:A^{*}\dot{\cup}B^{*}\to\mathbb{F}_{q}^{\ell}$ that maps each string to itself. It is simple to check that $\tau^{*}$ realizes $G^{*}$ in the Hamming metric (with $\beta=r^{*}$ ).

Recall from the previous subsection that given $\tau^{*}:A^{*}\dot{\cup}B^{*}\to\mathbb{F}_{q}^{\ell}$ that realizes $G^{*}$ in the Hamming metric, it is easy to construct $\tau:A^{*}\dot{\cup}B^{*}\to\{0,1\}^{q\cdot\ell}$ that realizes $G^{*}$ in the Hamming metric with a $q$ multiplicative factor blow-up in the dimension. This completes the proof of both the claims about $G^{*}$ and gives a general way to prove Theorem 1.4 given the construction of $C^{*}$ and $s^{*}$ .

Finding Center from Another Code.

One thing that might not be clear so far is: where does the center $s^{*}$ come from? Here we provide a systematic way to produce such an $s^{*}$ , by looking at another code that contains $C^{*}$ . More precisely, let $C^{*}\subseteq\widetilde{C}^{*}\subseteq\mathbb{F}_{q}^{\ell}$ be two linear codes with the same block length and alphabet. Suppose that the distance of $C^{*}$ is $\Delta$ , the distance of $\widetilde{C}^{*}$ is $r^{*}$ and that $r^{*}<\Delta$ . It is easy to see that, by taking $s^{*}$ to be any element of $\widetilde{C}^{*}\setminus C^{*}$ , it holds that every codeword in $C^{*}$ is at distance at least $r^{*}$ from $s^{*}$ , simply because both $s^{*}$ and the codewords of $C^{*}$ are codewords of $\widetilde{C}^{*}$ .

Hence, we are only left to argue that there are many codewords of $C^{*}$ that is of distance exactly $r^{*}$ from $s^{*}$ . While this is not true in general, we can show by an averaging argument that this is true (for some $s^{*}\in\widetilde{C}^{*}$ ) if a large fraction (e.g. $|C^{*}|^{-o(1)}$ fraction) of codewords of $\widetilde{C}^{*}$ has Hamming weight exactly $r^{*}$ (see Lemma 5.1).

Indeed, viewing in this light, our previous choice of center for Reed-Solomon code (i.e. evaluation of $x^{h}$ ) is not coincidental: we simply take $\widetilde{C}^{*}$ to be another Reed-Solomon code with message length $h+1$ (whereas the base code $C^{*}$ is of message length $h$ ).

Comparison to Locally Dense Codes.

We end this subsection by remarking that the codes that we seek are very similar to locally dense codes [DMS03, CW12, Mic14], which is indeed our inspiration. A locally dense code is a linear code of block length $\ell$ and large minimum distance $\Delta$ , admitting a ball centered at $s$ of radius666 Clearly, for the ball to contain more than a single codeword, it must be $r\geq\Delta/2$ . Here we are interested in balls with radius not much bigger than that, say $r<\gamma\cdot\Delta$ for some constant $1/2<\gamma<1$ . $r<\Delta$ and containing a large (i.e. $\exp({\rm{poly}}(\ell))$ ) number of codewords777Strictly speaking, a locally dense code also requires an auxiliary matrix $T$ used to index these codewords. However, in previous works, finding $T$ is typically not hard given the center $s$ . Hence, we ignore $T$ in our discussion here for the ease of exposition.. Such codes are non-trivial to construct and in particular all known constructions of locally dense codes are using codes that beat the Gilbert-Varshamov (GV) bound [Gil52, Var57]; in other words we need to do better than random codes to construct them. This is because (as noted in [DMS03]), for a random code $C\subseteq\mathbb{F}_{q}^{\ell}$ (or any code that does not beat the GV bound), a random point in $\mathbb{F}_{q}^{\ell}$ acting as the center contains in expectation less than one codeword in a ball of radius $\Delta$ . Of course, this is simply an intuition and not a formal proof that a locally dense code needs to beat the GV bound, since there may be more sophisticated ways to pick a center.

Although the codes we require are similar to locally dense codes, there are differences between the two. Below we list four such differences: the first two makes it harder for us to construct our codes whereas the latter two makes it easier for us.

•

We seek a center $s^{*}$ so that no codewords in $C^{*}$ lies at distance less than $r^{*}$ , as opposed to locally dense codes which allows codewords to be close to $s^{*}$ . This is indeed where our idea of using another code $\widetilde{C}^{*}\supseteq C^{*}$ comes in, as picking $s^{*}$ from $\widetilde{C}^{*}\setminus C^{*}$ ensures us that no codeword of $C^{*}$ is too close to $s^{*}$ .

•

Another difference is that we need the number of codewords at distance $r^{*}$ from $s^{*}$ to be very large, i.e., $|C^{*}|^{1-o(1)}$ , whereas locally dense codes allow for much smaller number of codewords. Indeed, the deterministic constructions from [CW12, Mic14] only yield the bound of $2^{O(\sqrt{\log|C^{*}|})}$ . Hence, these do not directly work for us.

•

Locally dense codes requires $r$ to be at most $(1-\varepsilon)\Delta$ for some constant $\varepsilon>0$ , whereas we are fine with any $r^{*}<\Delta$ . In fact, our Reed-Solomon code based construction above only yields $r^{*}=\Delta-1$ which would not suffice for locally dense codes. Nevertheless, as we will see later for inapproximability of $\mathsf{CP}$ , we will also need the ratio $r^{*}/\Delta$ to be a constant bounded away from 1 as well and, since we need a code with these extraordinary properties, they are very hard to find. Indeed, in this case we only manage to prove a weaker lower bound on gap- $\mathsf{CP}$ .

•

Finally, we remark that locally dense codes are required to be efficiently constructed in ${\rm{poly}}(\log|C^{*}|)$ time, which is part of why it is hard to find. Specifically, while [DMS03] shows that an averaging argument works for a random center, derandomizing this is a big issue and a few subsequent works are dedicated solely to this issue [CW12, Mic14]. (We also note that it remains open whether a center can be deterministically found for a variant of locally dense codes used in hardness of parameterized version of the minimum distance problem. See [BGKM18] for more details.) On the other hand, brute force search (over all codewords in $\widetilde{C}^{*}$ ) suffices to find a center for us, as we are allowed construction time of ${\rm{poly}}(|C^{*}|)$ .

2.3 Inapproximability of Closest Pair and Maximum Inner Product

In this subsection, we sketch our inapproximability results for $\mathsf{MIP}$ and $\mathsf{CP}$ . Both these results use the same reduction that we had from $\mathsf{BCP}$ to $\mathsf{CP}$ , except that we now need stronger properties from the gadget, i.e., the previously used notions of contact dimension does not suffice anymore. Below we sketch the required strengthening of the gadget properties and explain how to achieve them.

2.3.1 Approximate Maximum Inner Product

Observe that the gadget we construct for $\mathsf{CP}$ in Subsection 2.2 can also be written in terms of inner product as follows: there exists a dense balanced bipartite graph $G^{*}=(A^{*}\dot{\cup}B^{*},E^{*})$ , a mapping $\tau:A^{*}\dot{\cup}B^{*}\to\{0,1\}^{q\cdot\ell}$ such that the following holds.

(i)

For all edges $(a,b)\in E^{*}$ , $\left<\tau(a),\tau(b)\right>=\ell-r^{*}$ . 2. (ii)

For all edges $(a,b)\in(A^{*}\times B^{*})\setminus E^{*}$ , $\left<\tau(a),\tau(b)\right><\ell-r^{*}$ . 3. (iii)

For all distinct $a,b$ both from $A^{*}$ or both from $B^{*}$ , $\left<\tau(a),\tau(b)\right>\leq\ell-\Delta$ .

Notice that we wrote the conditions above in a slightly different way than in previous subsections; previously in the contact dimension notation, (ii) and (iii) would be simply written together as: for all non-edge $(a,b)$ , $\left<\tau(a),\tau(b)\right><\ell-r^{*}$ . This change is intentional, since, to get gap in our reductions, we only need a gap between the bounds in (i) and (iii) (but not in (ii)). In particular, to get hardness of approximating $\mathsf{MIP}$ , we require $\frac{\ell-r^{*}}{\ell-\Delta}$ to be at least $(1+\varepsilon)$ for some $\varepsilon>0$ .

From our Reed-Solomon construction above, $\ell-\Delta$ and $\ell-r^{*}$ are exactly the message length of $C^{*}$ minus one and the message length of $\widetilde{C}^{*}$ minus one respectively. Previously, we selected these two to be $h$ and $h+1$ . Now to obtain the desired gap, we simply take the larger code $\widetilde{C}^{*}$ to be a Reed-Solomon code with larger (i.e. $(1+\varepsilon)h$ ) message length888This approach can in fact give not just $(1+\varepsilon)$ but arbitrarily large constant gap between the two cases. In the actual reduction, we take this gap to be 3 (Theorem 6.2), which makes some computations simpler..

Finally, we note that even with the above gadget, the reduction only gives a small (i.e. $1+o(1)$ ) factor hardness of approximating $\mathsf{MIP}$ (Theorem 6.2). To boost the gap to near polynomial, we simply tensor the vectors with themselves (see Section 6).

2.3.2 Approximate Closest Pair

Once again, recall that we have the following gadget from Subsection 2.2: there exists a dense balanced bipartite graph $G^{*}=(A^{*}\dot{\cup}B^{*},E^{*})$ , a mapping $\tau:A^{*}\dot{\cup}B^{*}\to\{0,1\}^{q\cdot\ell}$ such that the following holds.

(i)

For all edges $(a,b)\in E^{*}$ , $\|\tau(a)-\tau(b)\|_{0}=r^{*}$ . 2. (ii)

For all edges $(a,b)\in(A^{*}\times B^{*})\setminus E^{*}$ , $\|\tau(a)-\tau(b)\|_{0}>r^{*}$ . 3. (iii)

For all distinct $a,b$ both from $A^{*}$ or both from $B^{*}$ , $\|\tau(a)-\tau(b)\|_{0}\geq\Delta$ .

Once again, we need an $(1+\varepsilon)$ gap between the bounds in (iii) and (i), i.e., $\frac{\Delta}{r^{*}}$ . Unfortunately, we cannot construct such codes using any of the Reed-Solomon code families. We turn to another type of codes that beat the Gilbert-Varshamov bound: Algebraic- Geometric (AG) codes. Similar to the Reed-Solomon code based construction, we take $C^{*}$ as an AG code and $\widetilde{C}^{*}$ to be a “higher degree” AG code; getting the desired gap simply means that the distance of $C^{*}$ must be at least $(1+\varepsilon)$ times the distance of $\widetilde{C}^{*}$ .

Recall from Subsection 2.2 also that, to bound the density of $G^{*}$ , we need a lower bound on the number of minimum weight codewords of $\widetilde{C}^{*}$ . Such bounds for AG codes are non-trivial and we turn to the bounds from [ABV01, Vlă18]. Unfortunately, this only gives $G^{*}$ with density $|C^{*}|^{-1/2-o(1)}$ , instead of $|C^{*}|^{-o(1)}$ as before. This is indeed the reason that our running time lower bound for approximate $\mathsf{CP}$ is only $n^{1.5-\varepsilon}$ .

We are not aware of any result on the (asymptotic) tightness of the bounds from [ABV01, Vlă18] that we use. However, improving upon such bounds would have other consequences, such as a better bound on the kissing numbers of lattices constructed in [Vlă18]. As a result, it seems likely that more understanding of AG codes (and perhaps even new constructions) are needed in order to improve these bounds.

3 Preliminaries

In this section we define the geometric problems of interest to this paper, give an alternate proof for the conditional lower bound on bichromatic closest pair, and recall the definition of the contact dimension of a graph.

3.1 Notations, Problems and Fine-Grained Hypotheses

Distance Measures.

For any two vectors $a,b\in\mathbb{R}^{d}$ , the distance between them in the $\ell_{p}$ -metric is denoted by $||a-b||_{p}~{}=~{}\left(\sum_{i=1}^{d}|a_{i}-b_{i}|^{p}\right)^{1/p}$ . Their distance in the $\ell_{\infty}$ -metric is denoted by $||a-b||_{\infty}=\underset{{i\in[d]}}{\max}\ \{|a_{i}-b_{i}|\}$ , and in the $\ell_{0}$ -metric is denoted by $||a-b||_{0}=|\{i\in[d]:a_{i}\neq b_{i}\}|$ , i.e., the number of coordinates on which $a$ and $b$ differ. More generally, for any two vectors $a,b\in\mathbb{R}^{d}$ in the $\Delta$ -metric, we denote by $\Delta(a,b)$ its distance in that metric space. The $\ell_{p}$ -metrics that are well studied in literature are the Hamming metric ( $\ell_{0}$ -metric), the rectilinear metric ( $\ell_{1}$ -metric), the Euclidean metric ( $\ell_{2}$ -metric), and the Chebyshev metric ( $\ell_{\infty}$ -metric). We denote the inner product (associated with the Euclidean space) of $a$ and $b$ by $\langle a,b\rangle=\underset{i\in[d]}{\sum}a_{i}\cdot b_{i}$ . Finally, for every positive integer $d$ we define the edit metric over $\Sigma$ to be the space $\Sigma^{d}$ endowed with distance function $\mathsf{ed}(a,b)$ , which is defined as the minimum number of character substitutions/insertions/deletions to transform $a$ into $b$ .

Problems.

Here we give formal definitions of Orthogonal Vectors ( $\mathsf{OV}$ ), Closest Pair ( $\mathsf{CP}$ ) and Bichromatic Closest Pair ( $\mathsf{BCP}$ ) problems, and also Maximum Inner Product ( $\mathsf{MIP}$ ) and Bichromatic Maximum Inner Product ( $\mathsf{BMIP}$ ) problems.

Definition 3.1 (Orthogonal Vectors Problem, $\mathsf{OV}$ ).

In $\mathsf{OV}$ , we are given two collections of $n$ points $A,B\subseteq\mathbb{\{}0,1\}^{d}$ , and the goal is to find a pair of points $a\in A$ , $b\in B$ such that $\langle a,b\rangle=0$ .

Definition 3.2 (Closest Pair Problem, $\mathsf{CP}$ ).

In $\mathsf{CP}$ in the $\Delta$ -metric, we are given a collection of $n$ points $P\subseteq\mathbb{R}^{d}$ and a positive real $\alpha$ , and the goal is to find a pair of distinct points $a,b\in P$ such that $\Delta(a,b)\leq\alpha$ .

Definition 3.3 (Bichromatic Closest Pair Problem, $\mathsf{BCP}$ ).

In $\mathsf{BCP}$ in the $\Delta$ -metric, we are given two collections of $n$ points $A,B\subseteq\mathbb{R}^{d}$ and a positive real $\alpha$ , and the goal is to find a pair of points $a\in A$ , $b\in B$ such that $\Delta(a,b)\leq\alpha$ .

We will also use gap versions of these problems. For any $\delta\geq 0$ , we define $(1+\delta)$ - $\mathsf{CP}$ (resp. $(1+\delta)$ - $\mathsf{BCP}$ ) in the $\Delta$ -metric to be the problem of distinguishing between the case whether there exist distinct $a,b\in P$ (resp. $a\in A$ and $b\in B$ ) such that $\Delta(a,b)\leq\alpha$ and the case where for all distinct $a,b\in P$ (resp. $a\in A$ and $b\in B$ ) we have $\Delta(a,b)>(1+\delta)\cdot\alpha$ .

Definition 3.4 (Maximum Inner Product Problem, $\mathsf{MIP}$ ).

In $\mathsf{MIP}$ , we are given a collection of $n$ points $P\subseteq\mathbb{R}^{d}$ and a real $\alpha$ , and the goal is to find a pair of distinct points $a,b\in P$ such that $\langle a,b\rangle\geq\alpha$ .

Definition 3.5 (Bichromatic Maximum Inner Product Problem, $\mathsf{BMIP}$ ).

In $\mathsf{BMIP}$ , we are given two collections of $n$ points $A,B\subseteq\mathbb{R}^{d}$ and a real $\alpha$ , and the goal is to find a pair of points $a\in A$ , $b\in B$ such that $\langle a,b\rangle\geq\alpha$ .

Again we define the gap versions of these problems as follows. For any $\gamma\geq 1$ , we define $\gamma$ - $\mathsf{MIP}$ (resp. $\gamma$ - $\mathsf{BMIP}$ ) to be the problem of distinguishing between the case whether there exist distinct $a,b\in P$ (resp. $a\in A$ and $b\in B$ ) such that $\langle a,b\rangle\geq\alpha$ and the case where for all distinct $a,b\in P$ (resp. $a\in A$ and $b\in B$ ) we have $\langle a,b\rangle<\nicefrac{{\alpha}}{{\gamma}}$ .

Hypotheses.

Finally, we give formal definitions of the relevant fine-grained hypotheses (see [Wil18b] for a survey on the state-of-the-art conditional lower bounds that are known under these hypotheses).

Definition 3.6 (Strong Exponential Time Hypothesis, $\mathsf{SETH}$ [IP01, IPZ01, CIP06]).

For every $\varepsilon>0$ , there exists $k=k(\varepsilon)\in{\mathbb{N}}$ such that no algorithm can solve $k$ -SAT (i.e., satisfiability on a CNF of width $k$ ) in $O(2^{(1-\varepsilon)m})$ time where $m$ is the number of variables. Moreover, this holds even when the number of clauses is at most $c(\varepsilon)m$ where $c(\varepsilon)$ denotes a constant that depends only on $\varepsilon$ .

Definition 3.7 (Orthogonal Vector Hypothesis, $\mathsf{OVH}$ ).

For every $\varepsilon>0$ , no algorithm can solve $\mathsf{OV}$ in $O(n^{2-\varepsilon})$ time. Moreover, this holds even when the dimension $d$ is at most $c(\varepsilon)\log n$ where $c(\varepsilon)$ denotes a constant that depends only on $\varepsilon$ .

It is known that $\mathsf{SETH}$ implies $\mathsf{OVH}$ [Wil05], and therefore in the rest of the paper, we base all our conditional lower bounds on $\mathsf{OVH}$ .

3.2 Error-Correcting Codes

We recall here a few coding theoretic notations since all of our gadgets are based on error-correcting codes. As is standard in error-correcting codes, we will use $\Delta(\mathbf{a},\mathbf{b})$ to denote $\|\mathbf{a}-\mathbf{b}\|_{0}$ , the Hamming distance of $\mathbf{a}$ and $\mathbf{b}$ , for any $\mathbf{a},\mathbf{b}\in\mathbb{F}_{q}^{N}$ and we further define $\Delta(\mathbf{a},S):=\underset{\mathbf{b}\in S}{\min}\ \Delta(\mathbf{a},\mathbf{b})$ for any $\mathbf{a}\in\mathbb{F}_{q}^{N}$ and $S\subseteq\mathbb{F}_{q}^{N}$ . The weight of $\mathbf{a}\in\mathbb{F}_{q}^{N}$ , denoted by $\Delta(\mathbf{a})$ , is simply $\|\mathbf{a}\|_{0}:=|{i\in[N]:a_{i}\neq 0}|$ . For $\mathbf{a}\in\mathbb{F}_{q}^{N}$ and $d\in\mathbb{N}$ , we use $\mathcal{B}(\mathbf{a},d)$ to denote the (closed) Hamming ball of radius $d$ centered at $\mathbf{a}$ , i.e., $\mathcal{B}(\mathbf{a},d):=\{\mathbf{b}\in\mathbb{F}_{q}^{N}\mid\Delta(\mathbf{a},\mathbf{b})\leq d\}$ .

An error correcting code of block length $N$ over alphabet $\mathbb{F}_{q}$ is simply a collection of codewords $\mathcal{C}\subseteq\mathbb{F}_{q}^{N}$ . The distance of the code $\mathcal{C}$ , denoted by $\Delta(\mathcal{C})$ , is defined as $\underset{\mathbf{a}\neq\mathbf{b}\in\mathcal{C}}{\min}\ \Delta(\mathbf{a},\mathbf{b})$ . A code is said to be linear if $\mathcal{C}$ is a subspace of $\mathbb{F}_{q}^{N}$ . For a linear code $\mathcal{C}$ , its message length is defined to be the dimension of $\mathcal{C}$ , or equivalently $\log_{q}|\mathcal{C}|$ . We often use the notion $[N,K,D]_{q}$ to denote a linear code of block length $N$ , message length $K$ , and distance $D$ . The rate and relative distance of a linear $[N,K,D]_{q}$ code $\mathcal{C}$ are defined as $K/N$ and $D/N$ respectively. Note also that, for a linear code $\mathcal{C}$ , $\Delta(\mathcal{C})$ is equal to the minimum weight of a non-zero codeword of $\mathcal{C}$ . Finally, for any code $\mathcal{C}$ , we use $A_{w}(\mathcal{C}):=|\{\mathbf{c}\in\mathcal{C}\mid\Delta(\mathbf{c})=w\}|$ to denote the number of codewords of weight $w$ .

Let us also recall the Singleton bound and the definition of maximum distance separable (MDS) codes.

Theorem 3.8 (Singleton bound [Sin64]).

For any linear $[N,K,D]_{q}$ code, $K+D\leq N+1$ .

Definition 3.9 (MDS Codes).

A linear $[N,K,D]_{q}$ code is said to be a maximum distance separable (MDS) code if it matches the Singleton bound, i.e., $K+D=N+1$ .

We note here that the above bound and notation are well-defined (or can be naturally extended) also for non-linear codes, but we will only use them in context of linear codes in this paper.

3.3 Miscellaneous Tools

Covering Biclique by Isomorphic Graphs.

A useful fact we use to derandomize our reductions is that the biclique can be covered by any dense bipartite graph $G$ with only a few graphs that are isomorphic to $G$ . To state this more formally, let us first define a few notions.

Definition 3.10.

For any graph $G=(V_{G},E_{G})$ and any permutation $\pi:V_{G}\to V_{G}$ , we use $G_{\pi}$ to denote the graph $(V_{G_{\pi}},E_{G_{\pi}})$ where the vertex set $V_{G_{\pi}}$ is equal to $V_{G}$ and $E_{G_{\pi}}=\{(\pi(a),\pi(b))\mid(a,b)\in E_{G}\}$ .

For brevity, we say that a permutation $\pi:A\dot{\cup}B\to A\dot{\cup}B$ of vertices of a bipartite graph $G=(A\dot{\cup}B,E_{G})$ is side-preserving if $\pi(A)=A$ and $\pi(B)=B$ .

We can now state the result as follows. The proof, which proceeds via a simple set covering argument, is deferred to Appendix B.

Lemma 3.11.

For any bipartite graph $G(A\dot{\cup}B,E_{G})$ where $|A|=|B|=n$ and $E_{G}\neq\emptyset$ , there exist side-preserving permutations $\pi_{1},\dots,\pi_{k}:A\cup B\to A\cup B$ where $k\leq\frac{2n^{2}\ln n}{|E_{G}|}+1$ such that

[TABLE]

Moreover, such permutations can be found in time $O(n^{6}\log n)$ .

Translating Finite Fields Vectors to {0, 1}-Vectors.

Another simple fact which was already mentioned in the proof overview (Section 2) is that, we can embed Hamming metric on alphabet of size $q$ to Hamming metric on Boolean alphabet, with only $q$ multiplicative factor blow-up in the dimension:

Proposition 3.12.

For any $q,N\in\mathbb{N}$ , and alphabet $\Sigma$ such that $|\Sigma|=q$ , there exists a mapping $\psi:\Sigma^{N}\to\{0,1\}^{q\cdot N}$ such that, for all $\mathbf{v}_{1},\mathbf{v}_{2}\in\Sigma^{N}$ , we have $\|\psi(\mathbf{v}_{1})-\psi(\mathbf{v}_{2})\|_{0}=2\cdot\Delta(\mathbf{v}_{1},\mathbf{v}_{2})$ and $\left<\psi(\mathbf{v}_{1}),\psi(\mathbf{v}_{2})\right>=N-\Delta(\mathbf{v}_{1},\mathbf{v}_{2})$ .

Proof.

The mapping $\psi$ simply replaces each coordinate that is equal to $j\in\Sigma$ by the $j$ -th standard basis in the $q$ -dimensional space. More precisely, for $\mathbf{v}=(v_{1},\dots,v_{N})\in\mathbb{F}_{q}$ , we define

[TABLE]

where $\circ$ denotes concatenation of vectors and $e_{j}$ denote the $j$ -th standard basis in $\mathbb{R}^{q}$ , i.e., the vector whose $j$ -th coordinate is one and the remaining coordinates are zeroes.

It is simple to check that this satisfies the two requirements. ∎

3.4 $\mathsf{OVH}$ -hardness of Exact Bichromatic Closest Pair

Alman and Williams [AW15] showed the conditional hardness (under $\mathsf{OVH}$ ) of exact $\mathsf{BCP}$ in every $\ell_{p}$ -metric even when the point-sets are over $\{0,1\}$ via a Turing reduction from $\mathsf{OV}$ . David, Karthik, and Laekhanukit [DKL18] gave an alternate proof of the same result where point-sets were over $\mathbb{R}$ via a many-one reduction from $\mathsf{OV}$ . For independent interest, below we give another proof, which is both a many-one reduction and the point-sets are over $\{0,1\}$ .

Theorem 3.13.

Assuming $\mathsf{OVH}$ , for every $\varepsilon>0$ , no algorithm running in time $n^{2-\varepsilon}$ can solve $\mathsf{BCP}$ , even when the point-sets $A,B$ are subsets of $\{0,1\}^{d}$ and $d=c_{\varepsilon}\log n$ , for some constant $c_{\varepsilon}>1$ (only depending on $\varepsilon$ ).

Proof.

Let $A,B\subseteq\{0,1\}^{d}$ where $|A|=|B|=n$ be the input to an $\mathsf{OV}$ instance. We build an instance $(A^{\prime},B^{\prime},\alpha)$ of $\mathsf{BCP}$ where $A^{\prime},B^{\prime}\subseteq\{0,1\}^{5d}$ , $|A|=|B|=n$ , and $\alpha=2d$ , using functions $T_{A}$ and $T_{B}$ guaranteed by the following claim.

Claim 3.14.

There are functions $T_{A},T_{B}:\{0,1\}\to\{0,1\}^{5}$ such that for every $x,y\in\{0,1\}$ we have:

•

$x\cdot y=0$ * implies $\|T_{A}(x)-T_{B}(y)\|_{0}=2$ .*

•

$x\cdot y=1$ * implies $\|T_{A}(x)-T_{B}(y)\|_{0}=4$ .*

For every $i\in[n]$ , the $i^{\text{th}}$ point of $A^{\prime}$ , say $a^{\prime}$ is constructed from the $i^{\text{th}}$ point of $A$ , say $a$ by simply applying $T_{A}$ pointwise on each coordinate of $a$ , i.e., $a^{\prime}=(T_{A}(a_{1}),\ldots,T_{A}(a_{d}))$ . Similarly we apply $T_{B}$ pointwise on each coordinate of points in $B$ . It is easy to see that there exists $(a_{i}^{\prime},b_{j}^{\prime})\in A^{\prime}\times B^{\prime}$ such that $\|a_{i}^{\prime}-b_{j}^{\prime}\|_{0}=2d$ if and only if $\langle a_{i},b_{j}\rangle=0$ , and otherwise every pair of points in $A^{\prime}\times B^{\prime}$ is at Hamming distance at least $2d+2$ . ∎

Proof of Claim 3.14.

We define for all $x,y\in\{0,1\}$ , $T_{A}(x)=(T_{A}(x)_{0,0},T_{A}(x)_{0,1},T_{A}(x)_{1,0},x,0)$ and $T_{B}(y)=(T_{B}(y)_{0,0},T_{B}(y)_{0,1},T_{B}(y)_{1,0},0,y)$ , where for all $i,j\in\{0,1\}$ such that $i\cdot j=0$ , we have $T_{A}(x)_{i,j}=1$ if and only if $x=i$ and $T_{B}(y)_{i,j}=1$ if and only if $y=j$ . More succinctly, $T_{A}$ and $T_{B}$ are described below as strings and the claim follows by a straight-forward calculation.

[TABLE]

3.5 Contact Dimension of a Graph

The central gadget in our reduction from $\mathsf{BCP}$ to $\mathsf{CP}$ is based on the contact dimension of a graph. Below we reproduce its definition from the proof overview (i.e. Definition 2.1) for convenience.

Definition 3.15 (Contact Dimension [Pac80]).

For any graph $G=(V,E)$ , a mapping $\tau:V\to\mathbb{R}^{d}$ is said to realize $G$ (in the $\ell_{p}$ -metric) if for some $\beta>0$ , the following holds:

(i)

For all $(u,v)\in E$ , $\|\tau(u)-\tau(v)\|_{p}=\beta$ . 2. (ii)

For all $(u,v)\notin E$ , $\|\tau(u)-\tau(v)\|_{p}>\beta$ .

The contact dimension (in the $\ell_{p}$ -metric) of $G$ , denoted by $\mathsf{cd}_{p}(G)$ , is the minimum $d\in\mathbb{N}$ such that there exists $\tau:V\to\mathbb{R}^{d}$ realizing $G$ in the $\ell_{p}$ -metric.

We may also say that $\tau$ $\beta$ -realizes $G$ if we wishes to emphasize the value of $\beta$ .

Note here that we may view points in $\tau(V)$ as centers of spheres of radius $\beta/2$ . No two spheres overlap but they may touch, and $G$ has an edge $(u,v)$ if and only if the spheres centered at $\tau(u)$ and $\tau(v)$ touches.

For a summary of the bounds on $\mathsf{cd}(G)$ for various graphs in the Euclidean metric see [Mae85, FM86, FM88, Mae91] and for a summary of the bounds on $\mathsf{cd}(K_{n,n})$ in various metrics see [DKL18]. For this paper, the following bounds are relevant.

Theorem 3.16 (Frankl-Maehara [FM88]).

$(1.286)n-1<\mathsf{cd}_{2}(K_{n,n})<(1.5)n.$ **

Theorem 3.17 (David-Karthik-Laekhanukit [DKL18]).

$\mathsf{cd}_{0}(K_{n,n})=n.$ **

In particular, the above two theorems are the obstacles of the approach of [DKL18] for the $\ell_{2}$ and Hamming metrics respectively. As discussed in the proof overview, we will overcome these barriers by constructing dense bipartite graphs with low contact dimensions in every $\ell_{p}$ metrics.

As discussed in Section 2.3.2, we need a generalization of contact dimension in order to show inapproximability for $\mathsf{CP}$ . This is formally defined below; it should be noted that the definition only makes sense for bipartite graphs, whereas the original contact dimension is well-defined for any graphs. Moreover, when $\lambda=1$ , the notion of gap contact dimension coincides with the (non-gap) contact dimension in bipartite graphs.

Definition 3.18 (Gap Contact Dimension).

For any bipartite graph $G=(A\dot{\cup}B,E)$ and $\lambda\geq 1$ , a mapping $\tau:V\to\mathbb{R}^{d}$ is said to $\lambda$ -gap-realize $G$ (in the $\ell_{p}$ -metric) if for some $\beta>0$ , the following holds:

(i)

For all $(u,v)\in E$ , $\|\tau(u)-\tau(v)\|_{p}=\beta$ . 2. (ii)

For all $(u,v)\in(A\times B)\setminus E$ , $\|\tau(u)-\tau(v)\|_{p}>\beta$ . 3. (iii)

For all distinct $u,v$ both from $A$ or both from $B$ , $\|\tau(u)-\tau(v)\|_{p}>\lambda\cdot\beta$ .

The $\lambda$ -gap contact dimension (in the $\ell_{p}$ -metric) of $G$ , denoted by $\lambda\text{-}\mathsf{cd}_{p}(G)$ , is the minimum $d\in\mathbb{N}$ such that there exists $\tau:V\to\mathbb{R}^{d}$ $\lambda$ -gap-realizing $G$ in the $\ell_{p}$ -metric.

Again, we may say that $\tau$ $(\beta,\lambda)$ -gap-realizes $G$ to emphasize the value of $\beta$ .

Finally, we define an analogous notion for inner product:

Definition 3.19 (Gap Inner Product Dimension).

For any bipartite graph $G=(A\dot{\cup}B,E)$ and $\lambda\geq 1$ , a mapping $\tau:V\to\mathbb{R}^{d}$ is said to $\lambda$ -gap- $\mathsf{IP}$ -realize $G$ if for some $\beta>0$ , the following holds:

(i)

For all $(u,v)\in E$ , $\langle\tau(u),\tau(v)\rangle=\beta$ . 2. (ii)

For all $(u,v)\in(A\times B)\setminus E$ , $\langle\tau(u),\tau(v)\rangle<\beta$ . 3. (iii)

For all distinct $u,v$ both from $A$ or both from $B$ , $\langle\tau(u),\tau(v)\rangle<\beta/\lambda$ .

The $\lambda$ -gap inner product dimension of $G$ , denoted by $\lambda\text{-}\mathsf{ipd}(G)$ , is the minimum $d\in\mathbb{N}$ such that there exists $\tau:V\to\mathbb{R}^{d}$ $\lambda$ -gap- $\mathsf{IP}$ -realizing $G$ .

We may say that $\tau$ $(\beta,\lambda)$ -gap- $\mathsf{IP}$ -realizes $G$ to emphasize the value of $\beta$ .

4 Lower Bound on Closest Pair under Orthogonal Vector Hypothesis

In this section, we prove the subquadratic hardness for $\mathsf{CP}$ (assuming $\mathsf{OVH}$ ) using the efficient construction of a realization of a dense bipartite graph. The construction will be be formally stated below and the details will be given in Section 5.2.1. First, we define the notion of a log-dense sequence of integers:

Definition 4.1.

A sequence $(n_{i})_{i\in\mathbb{N}}$ of increasing positive integers is said to be log-dense if there exists a constant $C\geq 1$ such that $\log n_{i+1}\leq C\cdot\log n_{i}$ for all $i\in\mathbb{N}$ .

As outlined in Section 2.1 , we use Reed-Solomon codes to construct a family of dense bipartite graphs with low contact dimensions. While the construction does not yield a graph for every number of vertices $n$ , it does yield a graph for a log-dense sequence of numbers of vertices, which turns out to be sufficient for the purpose of the reduction. More formally, we will prove the following in Section 5.2.1.

Theorem 4.2.

For every $0<\delta<1$ , there exists a log-dense sequence $(n_{i})_{i\in\mathbb{N}}$ such that, for every $i\in\mathbb{N}$ , there is a bipartite graph $G_{i}=(A_{i}\dot{\cup}B_{i},E_{i})$ where $|A_{i}|=|B_{i}|=n_{i}$ and $|E_{i}|\geq\Omega(n_{i}^{2-\delta})$ , such that $\mathsf{cd}(G_{i})=(\log n_{i})^{O(1/\delta)}$ . Moreover, for all $i\in\mathbb{N}$ , a realization $\tau:A_{i}\dot{\cup}B_{i}\to\{0,1\}^{(\log n_{i})^{O(1/\delta)}}$ of $G_{i}$ can be constructed in time $n_{i}^{2+o(1)}$ .

Notice that we did not specify any $\ell_{p}$ -metric in the notion of contact dimension above. This is intentional, because our point sets $\tau(A_{i}\dot{\cup}B_{i})$ have coordinate entries in $\{0,1\}$ , for which the distances in the Hamming metric are equivalent (up to power of $p$ ) to distances in any $\ell_{p}$ -metric ( $p\neq\infty$ ). We also adopt this notational convenience below. Specifically, we will prove the following theorem which states that $\mathsf{CP}$ is hard even when the points are from $\{0,1\}^{d}$ ; clearly, this also implies Theorem 1.4 due to the aforementioned equivalence to other $\ell_{p}$ -metrics.

Theorem 4.3 (Subquadratic Hardness of $\{0,1\}$ - $\mathsf{CP}$ ).

Assuming $\mathsf{OVH}$ , for every $\varepsilon>0$ , there exists $s_{\varepsilon}>0$ such that no algorithm running in $O(n^{2-\varepsilon})$ time can solve $\mathsf{CP}$ in the Hamming metric even when $d=\left(\log n\right)^{s_{\varepsilon}}$ and all points have $\{0,1\}$ entries.

Proof.

For any $\varepsilon>0$ , let $C_{\text{exp}}$ be the constant such that the dimension guarantee for $\tau$ in Theorem 4.2 is at most $(\log n_{i})^{C_{\text{exp}}/\varepsilon}$ for $\delta=\varepsilon/2$ . We define $s_{\varepsilon}$ as $2\cdot C_{\text{exp}}/\varepsilon+2$ .

Assume that there exists $\varepsilon>0$ and an algorithm $\mathcal{A}$ that can solve $\mathsf{CP}$ in time $n^{2-\varepsilon}$ in the Hamming metric for any input of $n$ points in $\{0,1\}^{(\log n)^{s_{\varepsilon}}}$ . We will construct an algorithm $\mathcal{A}^{\prime}$ that solves any instance of $\mathsf{BCP}$ in time $n^{2-\varepsilon^{\prime}}$ for some constant $\varepsilon^{\prime}>0$ (to be specified below), on $n$ points in dimension $d:=c_{\varepsilon^{\prime}}\cdot\log n$ with coordinate entries in $\{0,1\}$ . Together with Theorem 3.13, this implies that $\mathsf{OVH}$ is false, arriving at a contradiction.

Let $C_{\varepsilon}$ denote the log-density constant (i.e. $\sup_{i}\frac{\log n_{i+1}}{\log n_{i}}$ ) of the sequence from Theorem 4.2 for $\delta=\varepsilon/2$ , and let $\varepsilon^{\prime}$ be $0.01\cdot\varepsilon/C_{\varepsilon}$ . The algorithm $\mathcal{A}^{\prime}$ on input $(A,B,\alpha)$ where $A,B\subseteq\{0,1\}^{d},$ with $|A|=|B|=n$ , and $\alpha\in[d]$ , works as follows:

Let $n^{\prime}$ be the largest number in the sequence from Theorem 4.2 with $\delta=\varepsilon/2$ s.t. $n^{\prime}\leq n^{0.1}$ . 2. 2.

Let $G^{\prime}=(A^{\prime}\dot{\cup}B^{\prime},E^{\prime})$ be the graph from Theorem 4.2 with $|A^{\prime}|=|B^{\prime}|=n^{\prime}$ , $|E^{\prime}|\geq\Omega((n^{\prime})^{2-\delta})$ , and $\tau:A^{\prime}\dot{\cup}B^{\prime}\to\{0,1\}^{(\log n^{\prime})^{C_{\text{exp}}/\varepsilon}}$ be a $\beta$ -realization of $G^{\prime}$ where $\beta\in\mathbb{N}$ . 3. 3.

We use the algorithm from Lemma 3.11 to find $\pi_{1},\dots,\pi_{k}$ where $k=O((n^{\prime})^{\delta}\log n^{\prime})$ such that the union of $E_{G^{\prime}_{\pi_{1}}},\ldots,E_{G^{\prime}_{\pi_{k}}}$ is $E_{K_{n^{\prime},n^{\prime}}}$ . 4. 4.

We assume w.l.o.g.999This is without loss of generality, since if $n$ is not divisible by $n^{\prime}$ , we can use brute force for the remainder points. This requires only $O(n\cdot n^{\prime}\cdot)=O(n^{1.1}\log n)$ which does not affect the overall asymptotic running time of the algorithm. that $n$ is divisible by $n^{\prime}$ . Partition $A$ and $B$ into $A_{1},\dots,A_{n/n^{\prime}}$ and $B_{1},\dots,B_{n/n^{\prime}}$ each of size $n^{\prime}$ . For each $i,j\in[n/n^{\prime}],t\in[k]$ , do the following:

(a)

Let $\tau_{t}$ be an appropriate permutation of $\tau$ that $\beta$ -realizes $G^{\prime}_{\pi_{t}}$ . Label the vertices of $G^{\prime}_{\pi_{t}}$ with the points in $A_{i}\dot{\cup}B_{j}$ . 2. (b)

Let $\alpha^{\prime}=\alpha+(d+1)\cdot\beta$ , and define $A_{i}^{t},B_{j}^{t}$ as

[TABLE]

where $\mathbf{1}_{d+1}\otimes\mathbf{v}$ simply denotes $\mathbf{v}\circ\mathbf{v}\circ\cdots\circ\mathbf{v}$ , i.e., the concatenation of $d+1$ copies of $\mathbf{v}$ . 3. (c)

Run $\mathcal{A}$ on $(A_{i}^{t}\dot{\cup}B_{j}^{t},\alpha^{\prime})$ . If $\mathcal{A}$ outputs YES, then output YES and terminate. 5. 5.

If none of the executions of $\mathcal{A}$ returns YES, then output NO.

Observe that the bottleneck in the running time of the algorithm is in the executions of $\mathcal{A}$ . The number of executions is $(n/n^{\prime})^{2}\cdot k$ and each execution takes $O((n^{\prime})^{2-\varepsilon})$ time. Hence, in total the running time of the algorithm $\mathcal{A}^{\prime}$ is $O((n/n^{\prime})^{2}\cdot k\cdot(n^{\prime})^{2-\varepsilon})\leq O(n^{2}\log n\cdot(n^{\prime})^{-\varepsilon/2})$ . Now, from the log-density of the sequence from Theorem 4.2, we have $n^{\prime}\geq n^{0.1/C_{\varepsilon}}=n^{10\varepsilon^{\prime}/\varepsilon}$ . As a result, the running time of $\mathcal{A}$ is at most $O(n^{2-5\varepsilon^{\prime}}\log n)\leq O(n^{2-\varepsilon^{\prime}})$ as desired.

To see the correctness of the algorithm, first observe that the dimensions of vectors in $A_{i}^{t},B_{j}^{t}$ are at most $d+(d+1)\cdot(\log n^{\prime})^{C_{\text{exp}}/\varepsilon}$ which is at most $(\log{n})^{s_{\varepsilon}}$ for any sufficiently large $n$ ; that is, the calls to $\mathcal{A}$ are valid. Next, observe that, if $(A,B,\alpha)$ is a YES instance of $\mathsf{BCP}$ , there must be $i,j\in[n/n^{\prime}]$ and $\mathbf{a}^{*}\in A_{i},\mathbf{b}^{*}\in B_{j}$ such that $\|\mathbf{a}^{*}-\mathbf{b}^{*}\|_{0}$ is at most $\alpha$ . Since $G^{\prime}_{\pi_{1}},\dots,G^{\prime}_{\pi_{k}}$ covers $K_{n^{\prime},n^{\prime}}$ , there must be $t\in[k]$ such that $\|\tau_{t}(\mathbf{a}^{*})-\tau_{t}(\mathbf{b}^{*})\|_{0}=\beta$ . As a result, $\|(\mathbf{a}^{*}\circ(\mathbf{1}_{d+1}\otimes\tau_{t}(\mathbf{a}^{*})))-(\mathbf{b}^{*}\circ(\mathbf{1}_{d+1}\otimes\tau_{t}(\mathbf{b}^{*})))\|_{0}\leq\alpha+(d+1)\cdot\beta=\alpha^{\prime}$ . Thus, $(A_{i}^{t}\cup B_{j}^{t},\alpha^{\prime})$ is a YES instance for $\mathsf{CP}$ and $\mathcal{A}^{\prime}$ outputs YES as desired.

Finally, assume that $(A,B,\alpha)$ is a NO instance of $\mathsf{BCP}$ . Consider any $i,j\in[n/n^{\prime}]$ and $t\in[k]$ . To argue that $(A_{i}^{t}\cup B_{j}^{t},\alpha^{\prime})$ is a NO instance for $\mathsf{CP}$ , we have to show that any two points in $A_{i}^{t}\cup B_{j}^{t}$ have distance more than $\alpha^{\prime}$ . To see this, let us consider two cases.

Both points are either from $A_{i}^{t}$ or from $B_{j}^{t}$ . Assume w.l.o.g. that the two points are from $A_{i}^{t}$ ; let them be $\mathbf{a}\circ(\mathbf{1}_{d+1}\otimes\tau_{t}(\mathbf{a}))$ and $\mathbf{a}^{\prime}\circ(\mathbf{1}_{d+1}\otimes\tau_{t}(\mathbf{a}^{\prime}))$ . Recall that, from the definition of $\beta$ -realization, $\|\tau_{t}(\mathbf{a})-\tau_{t}(\mathbf{a}^{\prime})\|_{0}>\beta$ . Since $\|\tau_{t}(\mathbf{a})-\tau_{t}(\mathbf{a}^{\prime})\|_{0}$ is an integer, we must have $\|\tau_{t}(\mathbf{a})-\tau_{t}(\mathbf{a}^{\prime})\|_{0}\geq\beta+1$ . As a result, the Hamming distance between the two points is at least $(d+1)\cdot(\beta+1)>d+(d+1)\cdot\beta=\alpha^{\prime}$ . 2. 2.

One of the point is from $A_{i}^{t}$ and the other from $B_{j}^{t}$ . Let them be $\mathbf{a}\circ(\mathbf{1}_{d+1}\otimes\tau_{t}(\mathbf{a}))$ and $\mathbf{b}\circ(\mathbf{1}_{d+1}\otimes\tau_{t}(\mathbf{b}))$ . Since $(A,B,\alpha)$ is a NO instance of $\mathsf{BCP}$ , $\|\mathbf{a}-\mathbf{b}\|_{0}>\alpha$ . Furthermore, from definition of $\beta$ -realization, we must have $\|\tau_{t}(\mathbf{a})-\tau_{t}(\mathbf{b})\|_{0}\geq\beta$ . Combining the two implies that the Hamming distance between $\mathbf{a}\circ(\mathbf{1}_{d+1}\otimes\tau_{t}(\mathbf{a}))$ and $\mathbf{b}\circ(\mathbf{1}_{d+1}\otimes\tau_{t}(\mathbf{b}))$ is more than $\alpha^{\prime}$ .

Hence, $(A_{i}^{t}\dot{\cup}B_{j}^{t},\alpha^{\prime})$ must be a NO instance for $\mathsf{CP}$ for every $t\in[k]$ and $i,j\in[n/n^{\prime}]$ . Thus, $\mathcal{A}^{\prime}$ outputs NO as desired. ∎

5 Gadget Constructions

In this section, we construct all the gadgets that are used in our reductions, including the basic gadget (Theorem 4.2) and more advanced gadgets used for $\mathsf{MIP}$ and approximate version of $\mathsf{CP}$ .

5.1 Finding a Center of a Code via Another Code

At the heart of all our gadgets is the task of finding a code $\mathcal{C}_{1}$ and a center $\mathbf{s}$ such that there are $|\mathcal{C}_{1}|^{1-o(1)}$ many codewords at Hamming distance exactly equal to $r$ (for some $r>0$ ) from $\mathbf{s}$ but there is no codeword in $\mathcal{C}_{1}$ at distance less than $r$ from $\mathbf{s}$ . The below lemma is useful in finding such an $\mathbf{s}$ .

Lemma 5.1.

Let $\mathcal{C}_{1}\subseteq\mathcal{C}_{2}\subseteq\mathbb{F}_{q}^{N}$ be two linear codes with the same block length $N$ and alphabet $\mathbb{F}_{q}$ such that $\Delta(\mathcal{C}_{2})<\Delta(\mathcal{C}_{1})$ . Then, there exists a center $\mathbf{s}\in\mathbb{F}_{q}^{N}$ such that (1) $\Delta(\mathbf{s},\mathcal{C}_{1})\geq\Delta(\mathcal{C}_{2})$ and (2) $|\mathcal{B}(\mathbf{s},\Delta(\mathcal{C}_{2}))\cap\mathcal{C}_{1}|/|\mathcal{C}_{1}|\geq A_{\Delta(\mathcal{C}_{2})}(\mathcal{C}_{2})/|\mathcal{C}_{2}|$ . Moreover, given $\mathcal{C}_{1},\mathcal{C}_{2}$ , such an $\mathbf{s}$ can be found in $O(|\mathcal{C}_{1}|\cdot|\mathcal{C}_{2}|\cdot qN)$ time.

Proof.

We show that there exists $\mathbf{s}\in\mathcal{C}_{2}\setminus\mathcal{C}_{1}$ such that (2) holds. Note that (1) immediately holds, because $\mathbf{s}-\mathbf{c}$ must be a non-zero codeword of $\mathcal{C}_{2}$ which implies that $\Delta(\mathbf{s},\mathbf{c})\geq\Delta(\mathcal{C}_{2})$ .

To show that there exists $\mathbf{s}\in\mathcal{C}_{2}\setminus\mathcal{C}_{1}$ such that $|\mathcal{B}(\mathbf{s},\Delta(\mathcal{C}_{2}))\cap\mathcal{C}_{1}|\geq|\mathcal{C}_{1}|\cdot A_{\Delta(\mathcal{C}_{2})}/|\mathcal{C}_{2}|$ . We will in fact show a stronger statement: for a random $\mathbf{s}\in\mathcal{C}_{2}\setminus\mathcal{C}_{1}$ , we have $\mathbb{E}[|\mathcal{B}(\mathbf{s},\Delta(\mathcal{C}_{2}))\cap\mathcal{C}_{1}|]\geq|\mathcal{C}_{1}|\cdot A_{\Delta(\mathcal{C}_{2})}/|\mathcal{C}_{2}|$ . Consider $\mathbb{E}_{\mathbf{s}\in\mathcal{C}_{2}\setminus\mathcal{C}_{1}}[|\mathcal{B}(\mathbf{s},\Delta(\mathcal{C}_{2}))\cap\mathcal{C}_{1}|]$ . Due to linearity of expectation, we have

[TABLE]

Now, since $\Delta(\mathcal{C}_{1})>\Delta(\mathcal{C}_{2})$ , we have $\mathcal{C}_{1}\cap\mathcal{B}(\mathbf{0},\Delta(\mathcal{C}_{2}))=\{\mathbf{0}\}$ . That is, $|(\mathcal{C}_{2}\setminus\mathcal{C}_{1})\cap\mathcal{B}(\mathbf{0},\Delta(\mathcal{C}_{2}))|=|(\mathcal{C}_{2}\setminus\{\mathbf{0}\})\cap\mathcal{B}(\mathbf{0},\Delta(\mathcal{C}_{2}))|=A_{\Delta(\mathcal{C}_{2})}(\mathcal{C}_{2})$ . Plugging this back into the above equality, we have

[TABLE]

Thus, there must exist a center $\mathbf{s}\in\mathcal{C}_{2}\setminus\mathcal{C}_{1}$ that satisfies (2) (and also (1)) as desired.

Finally, note that $\mathbf{s}$ can be found by a brute force algorithm that tries every $\mathbf{s}\in\mathcal{C}_{2}$ and check whether (2) is satisfied; this algorithm takes $O(|\mathcal{C}_{1}|\cdot|\mathcal{C}_{2}|\cdot qN)$ time. ∎

5.2 Gadgets based on Reed-Solomon Codes

In this subsection, we construct gadgets based on the Reed Solomon codes, which are defined below.

Theorem 5.2 (Reed-Solomon Codes).

For every prime power $q$ , and every $K\leq N\leq q$ , there exists a $[N,K,N-K+1]_{q}$ linear code, denoted by $\mathsf{RS}_{q}[N,K]$ . The generator matrix of this code can be computed in time ${\rm{poly}}(N,K,q)$ . Moreover, for every $q\geq N\geq K_{2}>K_{1}$ , we have $\mathsf{RS}_{q}[N,K_{1}]\subseteq\mathsf{RS}_{q}[N,K_{2}]$ .

In order to find a good center $\mathbf{s}$ , we use the following (well-known) bound on the number of minimum weight codewords of Reed Solomon codes (and more generally MDS codes). For a reference of this bound, see e.g. [MS77, Ch. 11, Theorem 6].

Lemma 5.3.

Let $\mathcal{C}$ be any linear $[N,K,D]_{q}$ code that is MDS. Then, $A_{D}(\mathcal{C})=\binom{N}{K-1}\cdot(q-1)$ .

5.2.1 The Basic Gadget: Dense Bipartite Graphs with Low Contact Dimensions

Now we construct a dense bipartite graph with low contact dimension. A proof sketch of this construction was provided in Section 2.1 and was formally stated as Theorem 4.2.

Proof of Theorem 4.2.

Let $q_{i}$ be the $i$ -th prime number and let $n_{i}=(q_{i})^{(\lfloor q_{i}^{\delta}\rfloor)}$ ; it is simple to see that the sequence $(n_{i})_{i\in\mathbb{N}}$ is log-dense. For $q=q_{i}$ , consider the Reed-Solomon codes $\mathcal{C}_{1}=\mathsf{RS}_{q}[q,K_{1}]$ and $\mathcal{C}_{2}=\mathsf{RS}_{q}[q,K_{2}]$ where $K_{1}=\lfloor q^{\delta}\rfloor$ and $K_{2}=K_{1}+1$ . Applying Lemma 5.1 with $(\mathcal{C}_{1},\mathcal{C}_{2})$ implies that there exists a center $\mathbf{s}\in\mathcal{C}_{2}$ such that

[TABLE]

where the last equality follows from the fact that $|\mathcal{C}_{1}|=q^{K_{1}}$ .

We construct the graph $G_{i}=(A_{i},B_{i},E_{i})$ and a realization $\tau$ as follows. Let $A_{i}=\mathcal{C}_{1},B_{i}=\{\mathbf{s}+\mathbf{c}\mid\mathbf{c}\in\mathcal{C}_{1}\}$ and $E_{i}=\{(\mathbf{a},\mathbf{b})\in A_{i}\times B_{i}\mid\Delta(\mathbf{a},\mathbf{b})=\Delta(\mathcal{C}_{2})\}$ . $G_{i}$ can be easily realized by applying the mapping $\psi:\mathbb{F}_{q}^{q}\to\{0,1\}^{q^{2}}$ from Proposition 3.12. More precisely, let $\tau$ be the restriction of $\psi$ on $A_{i}\cup B_{i}$ . Below we argue about the density of $G_{i}$ and that $\tau$ is a $2\Delta(\mathcal{C}_{2})$ -realization of $G_{i}$ .

•

First, notice that $|E_{i}|$ is exactly $|\mathcal{C}_{1}|\cdot|\mathcal{B}(\mathbf{s},\Delta(\mathcal{C}_{2}))\cap\mathcal{C}_{1}|\geq\Omega(|\mathcal{C}_{1}|^{2-\delta})=\Omega(n_{i}^{2-\delta})$ .

•

Second, notice that, for every $\mathbf{v}_{1},\mathbf{v}_{2}$ both from $A_{i}$ or both from $B_{i}$ , we have $\mathbf{v}_{1}-\mathbf{v}_{2}\in\mathcal{C}_{1}\setminus\{\mathbf{0}\}$ . This implies that $\|\tau(\mathbf{v}_{1})-\tau(\mathbf{v}_{2})\|_{0}=2\Delta(\mathbf{v}_{1},\mathbf{v}_{2})\geq 2\Delta(\mathcal{C}_{1})>2\Delta(\mathcal{C}_{2})$ .

•

Third, for every $\mathbf{a}\in A_{i}$ and $\mathbf{b}\in B_{i}$ , we have $\mathbf{a}-\mathbf{b}\in\mathcal{C}_{2}\setminus\{\mathbf{0}\}$ . Thus, $\Delta(\mathbf{a},\mathbf{b})\geq\Delta(\mathcal{C}_{2})$ . Hence, $\|\tau(\mathbf{a})-\tau(\mathbf{b})\|_{0}=2\Delta(\mathbf{a},\mathbf{b})\geq 2\Delta(\mathcal{C}_{2})$ . Moreover, the inequality is an equality if and only if $\Delta(\mathbf{a},\mathbf{b})=\Delta(\mathcal{C}_{2})$ , i.e., $(\mathbf{a},\mathbf{b})\in E_{i}$ as desired.

•

Finally, observe that the dimension is $q^{2}=(\log n_{i})^{O(1/\delta)}$ .

As for the running time of constructing $G_{i}$ and $\tau$ , observe that the bottleneck is the running time needed to find the center $\mathbf{s}$ ; according to Lemma 5.1, $\mathbf{s}$ can be computed in $O(|\mathcal{C}_{1}|\cdot|\mathcal{C}_{2}|\cdot q^{2})=O(n_{i}^{2}\cdot q^{2})$ , which is $n_{i}^{2+o(1)}$ as desired. ∎

5.2.2 A Gadget for Maximum Inner Product

Now, we build gadgets (stated below) which will be used for proving the inapproximability of $\mathsf{MIP}$ .

Theorem 5.4.

For every $0<\delta<1$ , there exists a log-dense sequence $(n_{i})_{i\in\mathbb{N}}$ such that, for every $i\in\mathbb{N}$ , there is a bipartite graph $G_{i}=(A_{i}\dot{\cup}B_{i},E_{i})$ where $|A_{i}|=|B_{i}|=n_{i}$ and $|E_{i}|\geq\Omega(n_{i}^{2-\delta})$ , such that 3- $\mathsf{ipd}(G)=(\log n_{i})^{O(1/\delta)}$ . Moreover, for all $i\in\mathbb{N}$ , a 3-gap- $\mathsf{IP}$ -realization $\tau:A_{i}\dot{\cup}B_{i}\to\{0,1\}^{(\log n_{i})^{O(1/\delta)}}$ of $G_{i}$ can be constructed in time $n_{i}^{4+o(1)}$ .

Proof.

The proof here is exactly the same as the proof of Theorem 4.2, except that we will not pick $K_{2}=K_{1}+1$ , but rather pick $K_{2}>3K_{1}$ (and $n_{i}$ accordingly).

More precisely, let $q_{i}$ be the $i$ -th prime number and let $n_{i}=(q_{i})^{(\lfloor q_{i}^{0.3\delta}/3\rfloor)}$ ; it is simple to see that the sequence $(n_{i})_{i\in\mathbb{N}}$ is log-dense. For $q=q_{i}$ , consider the Reed-Solomon codes $\mathcal{C}_{1}=\mathsf{RS}_{q}[q,K_{1}]$ and $\mathcal{C}_{2}=\mathsf{RS}_{q}[q,K_{2}]$ where $K_{1}=\lfloor q^{0.3\delta}/3\rfloor$ and $K_{2}=3K_{1}+1$ . Similar to the proof of Theorem 4.2, applying Lemma 5.1 with $(\mathcal{C}_{1},\mathcal{C}_{2})$ implies that there exists $\mathbf{s}\in\mathcal{C}_{2}\setminus\mathcal{C}_{1}$ such that

[TABLE]

We construct the graph $G_{i}=(A_{i},B_{i},E_{i})$ and a realization $\tau$ as follows. Let $A_{i}=\mathcal{C}_{1},B_{i}=\{\mathbf{s}+\mathbf{c}\mid\mathbf{c}\in\mathcal{C}_{1}\}$ and $E_{i}=\{(\mathbf{a},\mathbf{b})\in A_{i}\times B_{i}\mid\Delta(\mathbf{a},\mathbf{b})=\Delta(\mathcal{C}_{2})\}$ . $G_{i}$ can be easily 3-gap- $\mathsf{IP}$ -realized by applying the mapping $\psi:\mathbb{F}_{q}^{q}\to\{0,1\}^{q^{2}}$ from Proposition 3.12. More precisely, let $\tau$ be the restriction of $\psi$ on $A_{i}\cup B_{i}$ . Below we argue about the density of $G_{i}$ and that $\tau$ is a $(K_{2}-1,3)$ -gap- $\mathsf{IP}$ -realization of $G_{i}$ .

•

First, notice that $|E_{i}|$ is exactly $|\mathcal{C}_{1}|\cdot|\mathcal{B}(\mathbf{s},\Delta(\mathcal{C}_{2}))\cap\mathcal{C}_{1}|\geq\Omega(|\mathcal{C}_{1}|^{2-\delta})=\Omega(n_{i}^{2-\delta})$ .

•

Second, for every $\mathbf{v}_{1},\mathbf{v}_{2}$ both from $A_{i}$ or both from $B_{i}$ , we have $\mathbf{v}_{1}-\mathbf{v}_{2}\in\mathcal{C}_{1}\setminus\{\mathbf{0}\}$ . Thus, $\left<\tau(\mathbf{v}_{1}),\tau(\mathbf{v}_{2})\right>=q-\Delta(\mathbf{v}_{1},\mathbf{v}_{2})\leq q-\Delta(\mathcal{C}_{1})=K_{1}-1<(K_{2}-1)/3$ .

•

Third, for every $\mathbf{a}\in A_{i}$ and $\mathbf{b}\in B_{i}$ , we have $\mathbf{a}-\mathbf{b}\in\mathcal{C}_{2}\setminus\{\mathbf{0}\}$ . Thus, $\Delta(\mathbf{a},\mathbf{b})\geq\Delta(\mathcal{C}_{2})$ . Hence, $\left<\tau(\mathbf{a}),\tau(\mathbf{b})\right>=q-\Delta(\mathbf{a},\mathbf{b})\leq q-\Delta(\mathcal{C}_{2})=K_{2}-1$ . Moreover, the inequality is an equality if and only if $\Delta(\mathbf{a},\mathbf{b})=\Delta(\mathcal{C}_{2})$ , i.e., $(\mathbf{a},\mathbf{b})\in E_{i}$ as desired.

•

Finally, observe that the dimension is $q^{2}=(\log n_{i})^{O(1/\delta)}$ .

Once again, the running time of the construction is $O(|\mathcal{C}_{1}|\cdot|\mathcal{C}_{2}|\cdot q^{2})\leq n_{i}^{4+o(1)}$ . ∎

5.3 Gadgets based on AG Codes

In this subsection, we construct gadgets based on algebraic geometric (AG) codes. The definitions of AG Codes are well beyond the scope of this work and we refer the readers to [Sti08, VNT07] for more thorough introductions.

Once again to find a good center, we need a bound on the number of minimum weight codewords. On this front, we use the following bound101010Note that most of the proof of this bound was from [ABV01]; [Vlă18] simply makes the bound more explicit, which is more convenience for us. from [Vlă18]. Throughout this subsection, we follow the notations from [Vlă18].

Theorem 5.5 (Theorem 4.3 of [Vlă18]).

Let $q$ be a prime power, $X$ be a curve of genus $g$ over $\mathbb{F}_{q}$ , let $S\subseteq X(\mathbb{F}_{q})$ such that $|S|=N$ , and let $a\in\mathbb{N}$ with $1\leq a\leq N-1$ . Then, there exists an $\mathbb{F}_{q}$ -positive divisor $D\geq 0$ , $\deg(D)=a$ , such that the corresponding AG Code $\mathcal{C}=\mathcal{C}(X,D,S)$ has minimum distance $N-a$ and

[TABLE]

We also need the following well-known (central) fact about the parameters of AG codes.

Theorem 5.6.

Let $q$ be a prime power, $X$ be a curve of genus $g$ over $\mathbb{F}_{q}$ , let $S\subseteq X(\mathbb{F}_{q})$ such that $|S|=N$ , and let $a\in\mathbb{N}$ with $1\leq a\leq N-1$ . Then, the corresponding AG Code $C=C(X,D,S)$ is a linear code over $\mathbb{F}_{q}$ with block length $N$ , distance at least $N-a$ and message length $k\geq a-g+1$ .

Recall also the tower of functions of Garcia and Stichtenoth [GS96], whose parameters approach the TVZ bound. We note here that, it suffices for us to have the genus approaching $\Omega(N/\sqrt{q})$ and there are also other curves that satisfy this.

Theorem 5.7 ([GS96]).

For any $\zeta>0$ and any square of prime $q$ , there exists a dense sequence111111A sequence $(N_{i})_{i\in\mathbb{N}}$ of increasing positive integers is said to be dense if there exists a constant $C\geq 1$ such that $N_{i+1}\leq C\cdot N_{i}$ for all $i\in\mathbb{N}$ . $(N_{i})_{i\in\mathbb{N}}$ such that there exists a curve $X_{i}$ with genus at most $\frac{N_{i}}{\sqrt{q}-1}+\zeta$ where $|X_{i}(\mathbb{F}_{q})|\geq N_{i}$ .

Plugging the bound from [Vlă18] into the above family of curves immediately yields the following:

Lemma 5.8.

For any $\zeta>0$ and any square of prime $q$ , there exists a dense sequence $(N_{i})_{i\in\mathbb{N}}$ such that the following holds. For any $i\in\mathbb{N}$ and any $a_{1},a_{2}\in\mathbb{N}$ such that $1\leq a_{1}<a_{2}\leq N_{i}-1$ , there exists linear codes $\mathcal{C}_{1}\subseteq\mathcal{C}_{2}\subseteq\mathbb{F}_{q}^{N_{i}}$ such that the following holds, where $g_{i}=\frac{N_{i}}{\sqrt{q}-1}+\zeta$ :

•

$\mathcal{C}_{1}$ * has message length at least $a_{1}-g_{i}+1$ and distance at least $N_{i}-a_{1}$ .*

•

$\mathcal{C}_{2}$ * has message length at least $a_{2}-g_{i}+1$ and distance exactly $N_{i}-a_{2}$ and*

[TABLE]

Moreover, the generator matrices of $\mathcal{C}_{1},\mathcal{C}_{2}$ can be computed in $O\left(\binom{N+a_{2}-1}{a_{2}}\cdot|\mathcal{C}_{2}|\cdot{\rm{poly}}(N_{i})\right)$ time.

Proof.

Let $(N_{i})_{i\in\mathbb{N}}$ be a dense sequence as in Theorem 5.7. From Theorem 5.5, there exists an $\mathbb{F}_{q}$ -positive divisor $D_{2}$ of degree $a_{2}$ such that the corresponding code $\mathcal{C}_{2}=C(X_{i},D_{2},S_{i})$ (where $S\subseteq X_{i}(\mathbb{F}_{q})$ of size $N_{i}$ ) satisfies (3) and that its distance is $N_{i}-a_{2}$ ; from Theorem 5.6, its message length must also be at least $a_{2}-g_{i}+1$ . Next, let $D_{1}$ be any $\mathbb{F}_{q}$ -positive divisor of degree $a_{1}$ such that $D_{2}-D_{1}\geq 0$ . Let $\mathcal{C}_{1}=C(X_{i},D_{1},S_{i})$ be the corresponding AG code; once again, Theorem 5.6 yields the desired bounds on its message length and distance. Finally, observe that $D_{2}-D_{1}\geq 0$ implies that $\mathcal{C}_{1}\subseteq\mathcal{C}_{2}$ as desired.

The main bottleneck to algorithmically construct such codes lies in finding $D_{2}$ . Nevertheless, the total number of degree- $a_{2}$ $\mathbb{F}_{q}$ -positive divisor is only $\binom{N_{i}+a_{2}-1}{a_{2}}$ . We can use brute force to enumerate all of them and check whether the corresponding code satisfies (3), which further takes $|\mathcal{C}_{2}|$ time. This results in the claimed running time. ∎

Finally, we can now construct our gadgets, by an appropriate setting of parameters. In particular, $a_{1}$ and $a_{2}$ will be selected to be close to each other and to both be slightly larger than $N/\sqrt{q}$ . This results in the graphs whose degrees are roughly square root of the number of vertices.

Theorem 5.9.

For every $0<\delta<1$ , there exist $\mu>0$ and a log-dense sequence $(n_{i})_{i\in\mathbb{N}}$ such that, for every $i\in\mathbb{N}$ , there is a bipartite graph $G_{i}=(A_{i}\dot{\cup}B_{i},E_{i})$ where $|A_{i}|=|B_{i}|=n_{i}$ and $|E_{i}|\geq\Omega(n_{i}^{2-\delta})$ , such that $(1+\mu)$ - $\mathsf{cd}(G)=O(\log n_{i})$ . Moreover, for all $i\in\mathbb{N}$ , a $(\beta,1+\mu)$ -gap-realization $\tau:A_{i}\dot{\cup}B_{i}\to\{0,1\}^{O(\log n_{i})}$ of $G_{i}$ can be constructed in time $O(n_{i}^{3})$ for some $\beta=\Theta(\log n_{i})$ .

Proof.

Once again, the proof here is similar to those of Theorems 4.2 and 5.4, except that we use the (pairs of) AG codes from Lemma 5.8 instead of Reed-Solomon codes.

Let $q\geq 49$ be any sufficiently large square of prime and $\zeta>0$ be any sufficiently small positive real number (both to be precisely specified later).

Let $(N_{i})_{i\in\mathbb{N}}$ be the sequence guarantee by Lemma 5.8. Let $a_{1}=N_{i}\cdot\left(\frac{1}{q^{0.5(1-\delta)}}-\frac{1}{q}\right)$ and $a_{2}=\frac{N_{i}}{q^{0.5(1-\delta)}}$ . For convenience, we assume that $a_{1}$ and $a_{2}$ are integers121212Note that, for sufficiently large $N_{i}$ , one can take the ceilings (or floors) of the specified values to get integers with negligible affect to the calculations.. Let $\mathcal{C}_{1},\mathcal{C}_{2}$ be the codes given by Lemma 5.8. The sequence $(n_{i})_{i\in\mathbb{N}}$ is defined as $n_{i}=|\mathcal{C}_{1}|$ .

Applying Lemma 5.1 to $(\mathcal{C}_{1},\mathcal{C}_{2})$ implies that there exists $\mathbf{s}\in\mathcal{C}_{2}\setminus\mathcal{C}_{1}$ such that

[TABLE]

where $o(1)$ terms above denote the terms that go to zero as $q\to\infty$ and $\zeta\to 0$ . As a result, by picking $q$ sufficiently large and $\zeta$ sufficiently small, the term in (4) is at least $\Omega(|\mathcal{C}_{1}|^{-0.5-\delta})$ .

We construct the graph $G_{i}=(A_{i},B_{i},E_{i})$ and a realization $\tau$ as follows. Let $A_{i}=\mathcal{C}_{1},B_{i}=\{\mathbf{s}+\mathbf{c}\mid\mathbf{c}\in\mathcal{C}_{1}\}$ and $E_{i}=\{(\mathbf{a},\mathbf{b})\in A_{i}\times B_{i}\mid\Delta(\mathbf{a},\mathbf{b})=\Delta(\mathcal{C}_{2})\}$ . $G_{i}$ can be easily realized by applying the mapping $\psi:\mathbb{F}_{q}^{q}\to\{0,1\}^{q^{2}}$ from Proposition 3.12. More precisely, let $\tau$ be the restriction of $\psi$ on $A_{i}\cup B_{i}$ . Below we argue about the density of $G_{i}$ and that $\tau$ is a $(2\Delta(\mathcal{C}_{2}),1+\mu)$ -gap-realization of $G_{i}$ where $\mu=\frac{\Delta(\mathcal{C}_{1})-1}{\Delta(\mathcal{C}_{2})}-1$ . Note that

[TABLE]

Let us now check that $G_{i}$ and $\tau$ satisfy all the claimed properties:

•

First, notice that $|E_{i}|$ is exactly $|\mathcal{C}_{1}|\cdot|\mathcal{B}(\mathbf{s},\Delta(\mathcal{C}_{2}))\cap\mathcal{C}_{1}|\geq\Omega(|\mathcal{C}_{1}|^{1.5-\delta})=\Omega(n_{i}^{1.5-\delta})$ .

•

For any $\mathbf{v}_{1}=\psi(\mathbf{c}_{1}),\mathbf{v}_{2}=\psi(\mathbf{c}_{2})$ both from $X_{i}$ or both from $Y_{i}$ , we have $\mathbf{c}_{1}-\mathbf{c}_{2}\in\mathcal{C}_{1}\setminus\{\mathbf{0}\}$ . Hence, $\|\mathbf{v}_{1}-\mathbf{v}_{2}\|_{0}=2\cdot\Delta(\mathbf{v}_{1},\mathbf{v}_{2})\geq 2\cdot\Delta(\mathcal{C}_{1})>(1+\mu)\cdot(2\Delta(\mathcal{C}_{2}))$ .

•

Next, for every $\mathbf{a}\in A_{i}$ and $\mathbf{b}\in B_{i}$ , we have $\mathbf{a}-\mathbf{b}\in\mathcal{C}_{2}\setminus\{\mathbf{0}\}$ . Thus, $\Delta(\mathbf{a},\mathbf{b})\geq\Delta(\mathcal{C}_{2})$ . Hence, $\|\tau(\mathbf{a})-\tau(\mathbf{b})\|_{0}=2\Delta(\mathbf{a},\mathbf{b})\geq 2\Delta(\mathcal{C}_{2})$ . Moreover, the inequality is an equality if and only if $\Delta(\mathbf{a},\mathbf{b})=\Delta(\mathcal{C}_{2})$ , i.e., $(\mathbf{a},\mathbf{b})\in E_{i}$ as desired.

Given $\mathcal{C}_{1},\mathcal{C}_{2}$ , the running time of constructing $(X_{i},Y_{i})$ is $O(|\mathcal{C}_{1}|\cdot|\mathcal{C}_{2}|\cdot q^{2})=O(n_{i}^{3})$ . Moreover, the running time to construct $\mathcal{C}_{1}$ and $\mathcal{C}_{2}$ , as given by Lemma 5.8, is

[TABLE]

where the last two inequalities are true for any sufficiently large $q$ . ∎

6 Inapproximability of Maximum Inner Product

In this section, we prove the hardness of approximating $\mathsf{MIP}$ . Once again, we show a stronger version (than Theorem 1.6) where every point has Boolean coordinates, as stated below.

Theorem 6.1.

Assuming $\mathsf{OVH}$ , for every $\varepsilon>0$ , there is no algorithm running in $O(n^{2-\varepsilon})$ time for $\gamma$ - $\mathsf{MIP}$ even for points in $\{0,1\}^{n^{o(1)}}$ , for any $\gamma\leq 2^{(\log n)^{1-o(1)}}$ .

The proof proceeds in two steps: first, we show hardness of approximating $\mathsf{MIP}$ in low dimension but with a small ( $1+o(1)$ ) approximation factor. Second, we use tensor product operation to amplify the gap to be almost polynomial, as stated in Theorem 6.1. More specifically, in the first step, we prove the following:

Theorem 6.2.

Assuming $\mathsf{OVH}$ , for every $\varepsilon>0$ , there exists $s_{\varepsilon}>0$ such that no algorithm running in $O(n^{2-\varepsilon})$ time can solve $\left(1+\frac{1}{\log\log n}\right)$ - $\mathsf{MIP}$ even for points in $\{0,1\}^{\left(\log n\right)^{s_{\varepsilon}}}$ .

Note that the factor $\frac{1}{\log\log n}$ is not significant, and this can be replaced by any $o(1)$ factor; we use this just to make the calculations more concrete. Before we move on to the proof of Theorem 6.2, let us first show how it implies Theorem 6.1.

Proof of Theorem 6.1 from Theorem 6.2.

Let $(P,\alpha)$ be an instance of $\left(1+\frac{1}{\log\log n}\right)$ - $\mathsf{MIP}$ where $P\subseteq\{0,1\}^{\left(\log n\right)^{s_{\varepsilon}}}$ . For $t=\frac{\log n}{(\log\log n)^{2}}$ , define $P^{\prime}=\{\mathbf{x}^{\otimes t}\mid\mathbf{x}\in P\},\alpha^{\prime}=\alpha^{t}$ and $\gamma=\left(1+\frac{1}{\log\log n}\right)^{t}=2^{(\log n)^{1-o(1)}}$ . The dimension of points in $P^{\prime}$ is $(\log n)^{s_{\varepsilon}\cdot t}=n^{o(1)}$ . Moreover, it is easy to check, based on the identity $\left<\mathbf{x}^{\otimes t},\mathbf{y}^{\otimes t}\right>=\left<\mathbf{x},\mathbf{y}\right>^{t}$ , that $(P^{\prime},\alpha^{\prime})$ is a YES (resp. no) instance of $\gamma$ -MIP iff $(P,\alpha)$ is a YES (resp. NO) instance of $\left(1+\frac{1}{\log\log n}\right)$ - $\mathsf{MIP}$ .

In other words, if there is an $O(n^{2-\varepsilon})$ time algorithm for $\gamma$ - $\mathsf{MIP}$ in $n^{o(1)}$ dimension, then there also exist an $O(n^{2-\varepsilon})$ subquadratic time algorithm for $\left(1+\frac{1}{\log\log n}\right)$ - $\mathsf{MIP}$ in $(\log n)^{s_{\varepsilon}}$ dimension. Thus, Theorem 6.1 follows from Theorem 6.2. ∎

The rest of this section is devoted to proving Theorem 6.2. To do so, we consider the gap- $\mathsf{Additive\text{-}BMIP}$ problem.

Definition 6.3 ( $\gamma$ - $\mathsf{Additive\text{-}BMIP}$ problem).

Let $\gamma\geq 0$ . In the $\gamma$ - $\mathsf{Additive\text{-}BMIP}$ problem we are given two sets $A,B$ each of $n$ points in $\{0,1\}^{d}$ and an integer $\alpha\in[d]$ as input, and the goal is to distinguish between the following two cases.

•

Completeness.* There exists $(a,b)\in A\times B$ such that $\langle a,b\rangle\geq\alpha$ .*

•

Soundness.* For every $(a,b)\in A\times B$ we have $\langle a,b\rangle<\alpha-\gamma$ .*

We need the below hardness result from [Rub18]. Note that the result is stated differently in [Rub18]; for how the result in [Rub18] implies the one below, see Section 3.2 of [Che18a].

Theorem 6.4 ([Rub18]).

Assuming $\mathsf{OVH}$ , for every $\varepsilon>0$ , there is no algorithm running in $O(n^{2-\varepsilon})$ time for the $\gamma$ - $\mathsf{Additive\text{-}BMIP}$ problem, for any $d=\omega(\log n)$ and $\gamma=o(d)$ .

Proof of Theorem 6.2.

For any $\varepsilon>0$ , let $C_{\text{exp}}$ be the constant such that the dimension of $\tau$ in Theorem 5.4 is at most $(\log n_{i})^{C_{\text{exp}}/\varepsilon}$ for $\delta=\varepsilon/2$ . We define $s_{\varepsilon}$ as $2\cdot C_{\text{exp}}/\varepsilon+2$ .

Suppose contrapositively that there exists $\varepsilon>0$ and an algorithm $\mathcal{A}$ that can solve $\left(1+\frac{1}{\log\log n}\right)$ - $\mathsf{MIP}$ of dimension $(\log n)^{s_{\varepsilon}}$ in time $n^{2-\varepsilon}$ . We will construct an algorithm $\mathcal{A}^{\prime}$ that solves $(\log n)$ - $\mathsf{Additive\text{-}BMIP}$ in time $n^{2-\varepsilon^{\prime}}$ for some constant $\varepsilon^{\prime}>0$ (to be specified below) for $d=(\log n\sqrt{\log\log n})$ dimensions. Together with Theorem 6.4, this implies that $\mathsf{OVH}$ is false, as desired.

Let $C_{\varepsilon}$ denote the constant of the log-dense sequence from Theorem 5.4 for $\delta=\varepsilon/2$ , and let $\varepsilon^{\prime}$ be $0.01\cdot\varepsilon/C_{\varepsilon}$ . The algorithm $\mathcal{A}^{\prime}$ on input $(A,B,\alpha)$ where $A,B\subseteq\{0,1\}^{d},\alpha\in[d]$ works as follows:

Let $n^{\prime}$ be the largest number in the sequence from Theorem 5.4 with $\delta=\varepsilon/2$ s.t. $n^{\prime}\leq n^{0.1}$ . 2. 2.

Let $G^{\prime}=(A^{\prime}\dot{\cup}B^{\prime},E^{\prime})$ be the graph from Theorem 5.4 with $|A^{\prime}|=|B^{\prime}|=n^{\prime}$ , $|E^{\prime}|\geq\Omega((n^{\prime})^{2-\delta})$ , and $\tau:A^{\prime}\dot{\cup}B^{\prime}\to\{0,1\}^{(\log n^{\prime})^{C_{\text{exp}}/\varepsilon}}$ be a $(\beta,3)$ -gap- $\mathsf{IP}$ -relization of $G^{\prime}$ where $\beta\in\mathbb{N}$ . 3. 3.

We use the algorithm from Lemma 3.11 to find $\pi_{1},\dots,\pi_{k}$ where $k=O((n^{\prime})^{\delta}\log n^{\prime})$ such that the union of $E_{G^{\prime}_{\pi_{1}}},\ldots,E_{G^{\prime}_{\pi_{k}}}$ is $E_{K_{n^{\prime},n^{\prime}}}$ 4. 4.

We assume w.l.o.g. that $n$ is divisible by $n^{\prime}$ . Partition $A$ and $B$ into $A_{1},\dots,A_{n/n^{\prime}}$ and $B_{1},\dots,B_{n/n^{\prime}}$ each of size $n^{\prime}$ . For each $i,j\in[n/n^{\prime}],t\in[k]$ , do the following:

(a)

Let $\tau_{t}$ be an appropriate permutation of $\tau$ that $(\beta,3)$ -gap- $\mathsf{IP}$ -realizes $G^{\prime}_{\pi_{t}}$ . 2. (b)

Let $\alpha^{\prime}=\beta\cdot\alpha+3d\cdot\beta$ , and define $A_{i}^{t},B_{j}^{t}$ as

[TABLE] 3. (c)

Run $\mathcal{A}$ on $(A_{i}^{t}\dot{\cup}B_{j}^{t},\alpha^{\prime})$ . If $\mathcal{A}$ outputs YES, then output YES and terminate. 5. 5.

If none of the executions of $\mathcal{A}$ returns with YES, then output NO.

Observe that the bottleneck in the running time of the algorithm is in the executions of $\mathcal{A}$ . The number of executions is $(n/n^{\prime})^{2}\cdot k$ and each execution takes $O((n^{\prime})^{2-\varepsilon})$ time. Hence, in total the running time of the algorithm $\mathcal{A}^{\prime}$ is $O((n/n^{\prime})^{2}\cdot k\cdot(n^{\prime})^{2-\varepsilon})\leq O(n^{2}\log n\cdot(n^{\prime})^{-\varepsilon/2})$ . Now, from the log-density of the sequence from Theorem 5.4, we have $n^{\prime}\geq n^{0.1/C_{\varepsilon}}=n^{10\varepsilon^{\prime}/\varepsilon}$ . As a result, the running time of $\mathcal{A}$ is at most $O(n^{2-5\varepsilon^{\prime}}\log n)\leq O(n^{2-\varepsilon^{\prime}})$ as desired.

To see the correctness of the algorithm, first observe that the dimensions of vectors in $A_{i}^{t},B_{j}^{t}$ are at most $\beta\cdot d+3d\cdot(\log n^{\prime})^{C_{\text{exp}}/\varepsilon}$ which is at most $(\log{n})^{s_{\varepsilon}}$ for any sufficiently large $n$ ; that is, the calls to $\mathcal{A}$ are valid. Next, observe that, if $(A,B,\alpha)$ is a YES instance of $\mathsf{Additive\text{-}BMIP}$ , there must be $i,j\in[n/n^{\prime}]$ and $\mathbf{a}^{*}\in A_{i},\mathbf{b}^{*}\in B_{j}$ such that $\left<\mathbf{a}^{*},\mathbf{b}^{*}\right>$ is at least $\alpha$ . Since $G^{\prime}_{\pi_{1}},\dots,G^{\prime}_{\pi_{k}}$ covers $K_{n^{\prime},n^{\prime}}$ , there must be $t\in[k]$ such that $\left<\tau_{t}(\mathbf{a}^{*}),\tau_{t}(\mathbf{b}^{*})\right>\geq\beta$ . As a result, $\left<(\mathbf{1}_{\beta}\otimes\mathbf{a}^{*})\circ(\mathbf{1}_{3d}\otimes\tau_{t}(\mathbf{a}^{*}),(\mathbf{1}_{\beta}\otimes\mathbf{b}^{*})\circ(\mathbf{1}_{3d}\otimes\tau_{t}(\mathbf{b}^{*}))\right>\geq\beta\cdot\alpha+3d\cdot\beta=\alpha^{\prime}$ . Thus, $(A_{i}^{t}\cup B_{j}^{t},\alpha^{\prime})$ is a YES instance for $\mathsf{MIP}$ and $\mathcal{A}^{\prime}$ outputs YES as desired.

Finally, let us assume that $(A,B,\alpha)$ is a NO instance of $(\log n)$ - $\mathsf{Additive\text{-}BMIP}$ . Consider any $i,j\in[n/n^{\prime}]$ and $t\in[k]$ . To argue that $(A_{i}^{t}\cup B_{j}^{t},\alpha^{\prime})$ is a NO instance for $\left(1+\frac{1}{\log\log{n^{\prime}}}\right)$ - $\mathsf{MIP}$ , we have to show that any two points in $A_{i}^{t}\cup B_{j}^{t}$ have inner product less than $\alpha^{\prime}/\left(1+\frac{1}{\log\log{n^{\prime}}}\right)$ . To see this, let us consider two cases.

The two points are either both from $A_{i}^{t}$ or both from $B_{j}^{t}$ . Assume w.l.o.g. that the two points are from $A_{i}^{t}$ ; let them be $(\mathbf{1}_{\beta}\otimes\mathbf{a})\circ(\mathbf{1}_{3d}\otimes\tau_{t}(\mathbf{a}))$ and $(\mathbf{1}_{\beta}\otimes\mathbf{a}^{\prime})\circ(\mathbf{1}_{3d}\otimes\tau_{t}(\mathbf{a}^{\prime}))$ . Recall that, from Theorem 5.4, we must have $\left<\tau_{t}(\mathbf{a}),\tau_{t}(\mathbf{a}^{\prime})\right><\beta/3$ . Moreover, since $\mathbf{a},\mathbf{a}^{\prime}\in\{0,1\}^{d}$ , we have $\left<\mathbf{a},\mathbf{a}^{\prime}\right>\leq d$ . Thus, we can conclude that

[TABLE]

which is less than $\alpha^{\prime}/\left(1+\frac{1}{\log\log{n^{\prime}}}\right)$ for any sufficiently large $n$ . 2. 2.

One of the point is from $A_{i}^{t}$ and the other from $B_{j}^{t}$ . Let them be $(\mathbf{1}_{\beta}\otimes\mathbf{a})\circ(\mathbf{1}_{3d}\otimes\tau_{t}(\mathbf{a}))$ and $(\mathbf{1}_{\beta}\otimes\mathbf{b})\circ(\mathbf{1}_{3d}\otimes\tau_{t}(\mathbf{b}))$ . Since $(A,B,\alpha)$ is a NO instance of $(\log n)$ - $\mathsf{Additive\text{-}BMIP}$ , we must have $\left<\mathbf{a},\mathbf{b}\right><\alpha-\log n$ . Furthermore, from Theorem 5.4, we must have $\left<\tau_{t}(\mathbf{a}),\tau_{t}(\mathbf{b})\right>\leq\beta$ . Combining the two implies that

[TABLE]

where the second-to-last inequality holds for any sufficiently large $n$ .

Hence, $(A_{i}^{t}\dot{\cup}B_{j}^{t},\alpha^{\prime})$ must be a NO instance for $\left(1+\frac{1}{\log\log{n^{\prime}}}\right)$ - $\mathsf{MIP}$ for every $t\in[k]$ and $i,j\in[n/n^{\prime}]$ . Thus, $\mathcal{A}^{\prime}$ outputs NO as desired. ∎

7 Inapproximability of Closest Pair

In this section, we prove the hardness of approximating $\mathsf{CP}$ (Theorem 1.5). As usual, we reduce from the bichromatic version of the problem, and the lower bound for the bichromatic version is stated below:

Theorem 7.1 (Rubinstein [Rub18]).

Assuming $\mathsf{OVH}$ , for every $\varepsilon>0$ there exists $\kappa>0$ such that there is no algorithm running in $n^{2-\varepsilon}$ time for $(1+\kappa)$ - $\mathsf{BCP}$ in the Hamming metric. Moreover, this holds even for instances $(A,B,\alpha)$ of $(1+\kappa)$ - $\mathsf{BCP}$ when $d=\Theta_{\varepsilon}(\log n),\alpha=\Theta_{\varepsilon}(\log n)$ and $A,B\subseteq\{0,1\}^{d}$ .

Again, we prove below the inapproximability of the gap- $\mathsf{CP}$ problem for Boolean vectors. Clearly, this immediately implies Theorem 1.5.

Theorem 7.2.

Assuming $\mathsf{OVH}$ , for every $\varepsilon>0$ , there exists $\theta>0$ and $c>0$ such that there is no algorithm running in $n^{1.5-\varepsilon}$ time for $(1+\theta)$ - $\mathsf{CP}$ in the Hamming metric for point-set in $\{0,1\}^{c\cdot\log n}$ .

Proof.

Assume towards a contradiction that there exists an $\varepsilon>0$ and an algorithm $\mathcal{A}$ that, for every $\theta>0$ solves $(1+\theta)$ - $\mathsf{CP}$ of dimension $c\cdot\log n$ in time $O(n^{1.5-\varepsilon})$ , where $c:=c(\varepsilon)$ is a constant that will be specified later. Let $\varepsilon^{\prime}>0$ be a small constant (depending on $\varepsilon$ ) that we will specify below and let $\kappa=\kappa(\varepsilon^{\prime})$ be as in Theorem 7.1. We construct below an algorithm $\mathcal{A}^{\prime}$ that solves $(1+\kappa)$ - $\mathsf{BCP}$ in time $O(n^{2-\varepsilon^{\prime}})$ for any instance $(A,B,\alpha)$ such that $A,B\subseteq\{0,1\}^{O(\log n)}$ and $\alpha=\Theta(\log n)$ . Together with Theorem 7.1, this implies that $\mathsf{OVH}$ is false, as desired.

Let $C_{\varepsilon}$ denote the constant of the log-dense sequence from Theorem 5.9 for $\delta=\varepsilon/2$ , and let $\varepsilon^{\prime}$ be $0.01\cdot\varepsilon/C_{\varepsilon}$ . Let $\mu$ be the constant from Theorem 5.9. Select $\theta>0$ be a sufficiently small constant such that $\frac{\mu-\theta}{1+\theta}>\frac{\theta}{\kappa-\theta}$ .

The algorithm $\mathcal{A}^{\prime}$ on $(A,B,\alpha)$ where $A,B\subseteq\{0,1\}^{O(\log n)},\alpha=\Theta(\log n)$ works as follows:

Let $n^{\prime}$ be the largest number in the sequence from Theorem 5.9 with $\delta=\varepsilon/2$ s.t. $n^{\prime}\leq n^{0.1}$ . 2. 2.

Let $G^{\prime}=(A^{\prime}\dot{\cup}B^{\prime},E^{\prime})$ be the graph from Theorem 5.9 with $|A^{\prime}|=|B^{\prime}|=n^{\prime}$ , $|E^{\prime}|\geq\Omega((n^{\prime})^{1.5-\delta})$ , and $\tau:A^{\prime}\dot{\cup}B^{\prime}\to\{0,1\}^{O(\log n^{\prime})}$ be a $(\beta,1+\mu)$ -gap-relization of $G^{\prime}$ where $\beta\in\mathbb{N}$ and $\beta=\Theta(\log n^{\prime})$ . 3. 3.

We use the algorithm from Lemma 3.11 to find $\pi_{1},\dots,\pi_{k}$ where $k=O((n^{\prime})^{0.5+\delta}\log n^{\prime})$ such that the union of $E_{G^{\prime}_{\pi_{1}}},\ldots,E_{G^{\prime}_{\pi_{k}}}$ is $E_{K_{n^{\prime},n^{\prime}}}$ 4. 4.

We assume w.l.o.g. that $n$ is divisible by $n^{\prime}$ . Partition $A$ and $B$ into $A_{1},\dots,A_{n/n^{\prime}}$ and $B_{1},\dots,B_{n/n^{\prime}}$ each of size $n^{\prime}$ . For each $i,j\in[n/n^{\prime}],t\in[k]$ , do the following:

(a)

Let $\tau_{t}$ be an appropriate permutation of $\tau$ that $(\beta,1+\mu)$ -gap-realizes $G^{\prime}_{\pi_{t}}$ . 2. (b)

Pick $r_{1},r_{2}$ such that

[TABLE]

Notice that the upper and lower bounds are $\Theta(1)$ and they are also $\Theta(1)$ apart. Hence, we can pick these $r_{1},r_{2}$ so that $r_{1},r_{2}=\Theta(1)$ . 3. (c)

Let $\alpha^{\prime}=r_{1}\cdot\alpha+r_{2}\cdot\beta$ and define $A_{i}^{t},B_{j}^{t}$ as

[TABLE] 4. (d)

Run $\mathcal{A}$ on $(A_{i}^{t}\cup B_{j}^{t},\alpha^{\prime})$ . If $\mathcal{A}$ outputs YES, then output YES and terminate. 5. 5.

If none of the executions of $\mathcal{A}$ returns with YES, then output NO.

Observe that the bottleneck in the running time of the algorithm is in the executions of $\mathcal{A}$ . The number of executions is $(n/n^{\prime})^{2}\cdot k$ and each execution takes $O((n^{\prime})^{1.5-\varepsilon})$ time. Hence, in total the running time of the algorithm $\mathcal{A}^{\prime}$ is $O((n/n^{\prime})^{2}\cdot k\cdot(n^{\prime})^{1.5-\varepsilon})\leq O(n^{2}\log n\cdot(n^{\prime})^{-\varepsilon/2})$ . Now, from the log-density of the sequence from Theorem 5.9, we have $n^{\prime}\geq n^{0.1/C_{\varepsilon}}=n^{10\varepsilon^{\prime}/\varepsilon}$ . As a result, the running time of $\mathcal{A}$ is at most $O(n^{2-5\varepsilon^{\prime}}\log n)\leq O(n^{2-\varepsilon})$ as desired.

To see the correctness of the algorithm, first observe that the dimensions of vectors in $A_{i}^{t},B_{j}^{t}$ are at most $r_{1}\cdot\alpha+r_{2}\cdot\beta$ which is $O(\log{n^{\prime}})$ ; that is, the calls to $\mathcal{A}$ are valid. Next, observe that, if $(A,B,\alpha)$ is a YES instance of $\mathsf{BCP}$ , there must be $i,j\in[n/n^{\prime}]$ and $\mathbf{a}^{*}\in A_{i},\mathbf{b}^{*}\in B_{j}$ such that $\|\mathbf{a}^{*}-\mathbf{b}^{*}\|_{0}$ is at most $\alpha$ . Since $G^{\prime}_{\pi_{1}},\dots,G^{\prime}_{\pi_{k}}$ covers $K_{n^{\prime},n^{\prime}}$ , there must be $t\in[k]$ such that $\|\tau_{t}(\mathbf{a}^{*})-\tau_{t}(\mathbf{b}^{*})\|_{0}\leq\beta$ . As a result, $\|((\mathbf{1}_{r_{1}}\otimes\mathbf{a}^{*})\circ(\mathbf{1}_{r_{2}}\otimes\tau_{t}(\mathbf{a}^{*}))-((\mathbf{1}_{r_{1}}\otimes\mathbf{b}^{*})\circ(\mathbf{1}_{r_{2}}\otimes\tau_{t}(\mathbf{b}^{*})))\|_{0}\leq r_{1}\cdot\alpha+r_{2}\cdot\beta=\alpha^{\prime}$ . Thus, $(A_{i}^{t}\cup B_{j}^{t},\alpha^{\prime})$ is a YES instance for $\mathsf{CP}$ and $\mathcal{A}^{\prime}$ outputs YES as desired.

Finally, let us assume that $(A,B,\alpha)$ is a NO instance of $(1+\kappa)$ - $\mathsf{BCP}$ . Consider any $i,j\in[n/n^{\prime}]$ and $t\in[k]$ . To argue that $(A_{i}^{t}\cup B_{j}^{t},\alpha^{\prime})$ is a NO instance for $(1+\theta)$ - $\mathsf{CP}$ , we have to show that any two points in $A_{i}^{t}\cup B_{j}^{t}$ have distance more than $\alpha^{\prime}$ . To see this, let us consider two cases.

Both points are either from $A_{i}^{t}$ or from $B_{j}^{t}$ . Assume w.l.o.g. that they are from $A_{i}^{t}$ ; let them be $(\mathbf{1}_{r_{1}}\otimes\mathbf{a})\circ(\mathbf{1}_{r_{2}}\otimes\tau_{t}(\mathbf{a}))$ and $(\mathbf{1}_{r_{1}}\otimes\mathbf{a}^{\prime})\circ(\mathbf{1}_{r_{2}}\otimes\tau_{t}(\mathbf{a}^{\prime}))$ . Recall that, from the definition of $X^{\prime}_{t}$ and Theorem 5.9, we must have $\|\tau_{t}(\mathbf{a})-\tau_{t}(\mathbf{a}^{\prime})\|_{0}>(1+\mu)\cdot\beta$ . Thus, the Hamming distance between the two points is more than $r_{2}\cdot(1+\mu)\cdot\beta\geq(1+\theta)\cdot\alpha^{\prime}$ , where the inequality comes from our choice of $r_{1},r_{2}$ . 2. 2.

One of the point is from $A_{i}^{t}$ and the other from $B_{j}^{t}$ . Let them be $(\mathbf{1}_{r_{1}}\otimes\mathbf{a})\circ(\mathbf{1}_{r_{2}}\otimes\tau_{t}(\mathbf{a}))$ and $(\mathbf{1}_{r_{1}}\otimes\mathbf{b})\circ(\mathbf{1}_{r_{2}}\otimes\tau_{t}(\mathbf{b}))$ . Since $(A,B,\alpha)$ is a NO instance of $(1+\kappa)$ - $\mathsf{BCP}$ , $\|\mathbf{a}-\mathbf{b}\|_{0}>(1+\kappa)\cdot\alpha$ . Moreover, from definition of $\tau_{t}$ , we must have $\|\tau_{t}(\mathbf{a})-\tau_{t}(\mathbf{b})\|_{0}\geq\beta$ . Combining the two implies that the distance between $(\mathbf{1}_{r_{1}}\otimes\mathbf{a})\circ(\mathbf{1}_{r_{2}}\otimes\tau_{t}(\mathbf{a}))$ and $(\mathbf{1}_{r_{1}}\otimes\mathbf{b})\circ(\mathbf{1}_{r_{2}}\otimes\tau_{t}(\mathbf{b}))$ is more than $r_{1}\cdot(1+\kappa)\cdot\alpha+r_{2}\cdot\beta\geq(1+\theta)\cdot\alpha^{\prime}$ , where the inequality is once again from our choice of $r_{1},r_{2}$ .

Hence, $(A_{i}^{t}\dot{\cup}B_{j}^{t},\alpha^{\prime})$ must be a NO instance for $(1+\theta)$ - $\mathsf{CP}$ for every $t\in[k]$ and $i,j\in[n/n^{\prime}]$ . Thus, $\mathcal{A}^{\prime}$ outputs NO as desired. ∎

8 Discussion and Open Questions

It remains open to completely resolve Open Questions 1.1 and 1.2. It is still possible that our framework can be used to resolve these problems: we just need to construct gadgets with better parameters! In particular, to resolve Question 1.1, we have to improve the dimension bound in Theorem 4.2 to $O_{\delta}(\log n_{i})$ . For Question 1.2, we just have to improve the bound on the number of pairs in (3) of Theorem 5.9 to $\Omega(n_{i}^{2-\delta})$ . Following our observation from Lemma 5.1, this motivates us to ask the following purely coding theoretic question:

Open Question 8.1.

For every $0<\delta<1$ , are there linear codes $\mathcal{C}_{1}\subseteq\mathcal{C}_{2}\subseteq\mathbb{F}_{q}^{N}$ both of block length $N$ over alphabet $\mathbb{F}_{q}$ such that the following holds:

•

$\Delta(\mathcal{C}_{1})\geq(1+f(\delta))\cdot\Delta(\mathcal{C}_{2})$ , for some $f:(0,1)\to(0,1)$ .

•

$|A_{\Delta(\mathcal{C}_{2})}(\mathcal{C}_{2})|/|\mathcal{C}_{2}|\geq|\mathcal{C}_{1}|^{-\delta}$ .

Apart from the aforementioned questions, Rubinstein [Rub18] pointed out an interesting obstacle, aptly dubbed the “triangle inequality barrier”, to obtain fine-grained lower bounds against 3-approximation algorithms for $\mathsf{BCP}$ (see Open Question 3 in [Rub18]). In the case of $\mathsf{CP}$ , this barrier turns out to be against 2-approximation algorithms as noted in [DKL18]. We reiterate this below as an open problem to be resolved:

Open Question 8.2.

Can we show that assuming $\mathsf{SETH}$ , for some constant $\varepsilon>0$ , no algorithm running in time $n^{1+\varepsilon}$ can solve 2- $\mathsf{CP}$ in any metric when the points are in $\omega(\log n)$ dimensions?

Another interesting direction is to extend the hardness of $\mathsf{MIP}$ to the $k$ -vector generalization of the problem, called $k$ - $\mathsf{MIP}$ . In $k$ - $\mathsf{MIP}$ , we are given a set of $n$ points $P\subseteq\mathbb{R}^{d}$ and we would like to select $k$ distinct points $\mathbf{a}_{1},\dots,\mathbf{a}_{k}\in P$ that maximizes

[TABLE]

It is known that the $k$ -chromatic variant of $k$ - $\mathsf{MIP}$ is hard to approximate (see Appendix B of [KLM18]) but this is not known to be true for $k$ - $\mathsf{MIP}$ itself. Our approach seems quite compatible to tackling this problem as well; in particular, if we can construct a certain (natural) generalization of our gadget for $\mathsf{MIP}$ , then we would immediately arrive at the inapproximability of $k$ - $\mathsf{MIP}$ even for $\{0,1\}$ -entries vectors. The issue in constructing this gadget is that we are now concerned about agreements of more than two vectors, which does not correspond to error-correcting codes anymore and some additional tools are needed to argue for this more general case.

It should be noted that the hardness of approximating $k$ - $\mathsf{MIP}$ for $\{0,1\}$ -entry vectors is equivalent to the one-sided $k$ -biclique problem [Lin18], in which a bipartite graph is given and the goal is to select $k$ vertices on the right that maximize the number of their common neighbors. The equivalence can be easily seen by viewing the coordinates as the left-hand-side vertices and the vectors as the right-hand-side vertices. The one-sided $k$ -biclique is shown to be $\mathsf{W[1]}$ -hard to approximate by Lin [Lin18] who also showed a lower bound of $n^{\Omega(\sqrt{k})}$ for the problem assuming $\mathsf{ETH}$ . If the generalization of our gadget for $k$ - $\mathsf{MIP}$ works as intended, then this lower bound can be improved to $n^{\Omega(k)}$ under $\mathsf{ETH}$ and even $n^{k-o(1)}$ under $\mathsf{SETH}$ .

The one-sided $k$ -biclique is closely related to the (two-sided) $k$ -biclique problem, where we are given a bipartite graph and we wish to decide whether it contains $K_{k,k}$ as a subgraph. The $k$ -biclique problem was consider a major open problem in parameterized complexity (see e.g., [DF13]) until it was shown by Lin to be $\mathsf{W[1]}$ -hard [Lin18]. Nevertheless, the running time lower bound known is still not tight: currently, the best lower bound known for this problem is $n^{\Omega(\sqrt{k})}$ both for the exact version (under $\mathsf{ETH}$ ) [Lin18] and its approximate variant (under $\mathsf{Gap}$ - $\mathsf{ETH}$ ) [CCK*+*17]. It remains an interesting open question to close the gap between the above lower bounds and the trivial upper bound of $n^{O(k)}$ . Progresses on the one-sided $k$ -biclique problem could lead to improved lower bounds for $k$ -biclique problem too, although several additional steps have to be taken care of.

Acknowledgements

We are grateful to Madhu Sudan for extremely helpful and informative discussion about AG codes; in particular, Madhu pointed us to [Vlă18]. We thank Bundit Laekhanukit and Or Meir for general discussions, and the Simons Institute for their wonderful work-space. Finally, we would like to thank Lijie Chen for sharing [CW19], and Orr Paradise for useful comments on an earlier draft of this manuscript.

Appendix A Lower Bound on Gap Closest Pair in Edit Distance Metric

In this section we prove Theorem 1.7. The proof is almost identical to Rubinstein’s [Rub18] proof for the $\mathsf{OVH}$ -hardness of gap- $\mathsf{BCP}$ in the edit distance metric and uses the following technical tool established in [Rub18].

Lemma A.1 (Rubinstein [Rub18]).

For large enough $d\in\mathbb{N}$ , there is a function $\zeta:\{0,1\}^{d}\to\{0,1\}^{d^{\prime}}$ , where $d^{\prime}=O(d\log d)$ , such that for all $a,b\in\{0,1\}^{d}$ the following holds for some constant $\lambda>0$ :

[TABLE]

Moreover, for any $a\in\{0,1\}^{d}$ , $\zeta(a)$ can be computed in $2^{o(d)}$ time.

At a high level, $\zeta$ picks a random $O(\log d)$ -bit string $s_{i,x}$ uniformly and independently for every $(i,x)\in[d]\times\{0,1\}$ , and for every vector $u\in\{0,1\}^{d}$ , replaces the $i^{\text{th}}$ coordinate $u_{i}$ by $s_{i,u_{i}}$ . The claims in the lemma statement follow by the known concentration bounds on the edit distance of random strings [McD89, Lue09]. This construction is further efficiently derandomized by using $\log d$ -wise independent strings [Kop13].

Proof of Theorem 1.7.

We show that if there exists an algorithm $\mathcal{A}$ running in time $O(n^{1.5-\varepsilon})$ for some $\varepsilon>0$ that can solve $(1+\delta)$ - $\mathsf{CP}$ in the edit distance metric for some $\delta>0$ over point-sets in $\{0,1\}^{d^{\prime}}$ , then $\mathcal{A}$ can be used to solve $(1+\delta-o(1))$ - $\mathsf{CP}$ in the Hamming metric in time $O(n^{1.5-\varepsilon})$ over point-sets in $\{0,1\}^{d}$ , where $d^{\prime}=O(d\log d)$ . Together with Theorem 7.2, this implies that $\mathsf{OVH}$ is false, as desired.

Let $(P,\alpha)$ be an instance of $(1+\delta)$ - $\mathsf{CP}$ in the Hamming metric over point-sets in $\{0,1\}^{d}$ . It is clear131313In fact, one can design a $2^{\alpha}\cdot n\log n$ time algorithm for $\mathsf{CP}$ in the Hamming metric, and therefore to assume $\mathsf{OVH}$ , we require $\alpha=\Omega(d)$ . from the proofs of Theorem 7.1 and Theorem 7.2 that $\alpha=\Omega(d)$ . We now define an instance of $(P^{\prime},\alpha^{\prime}:=(1+o(1))\cdot\lambda\log d\cdot\alpha)$ of $(1+\delta-o(1))$ - $\mathsf{CP}$ in the edit distance metric as follows. Recall the function $\zeta$ from Lemma A.1 and define the set $P^{\prime}=\{\zeta(p)\mid p\in P\}$ . Notice that for every pair of distinct points $p,q\in P$ , we have $\left|\mathsf{ed}(\zeta(p),\zeta(q))=\lambda\cdot\log d\cdot\|p-q\|_{0}\right|=o(d^{\prime})$ . In other words if we had a pair of distinct points $p,q$ in $P$ such that $\|p-q\|_{0}\leq\alpha$ then, $\mathsf{ed}(\zeta(p),\zeta(q))\leq\lambda\log d\cdot\alpha+o(d^{\prime})=(1+o(1))\cdot\lambda\log d\cdot\alpha$ and suppose for all pairs of distinct points $p,q\in P$ we had $\|p-q\|_{0}>(1+\delta)\cdot\alpha$ then $\mathsf{ed}(\zeta(p),\zeta(q))>\lambda\log d\cdot(1+\delta)\cdot\alpha-o(d^{\prime})>(1+\delta-o(1))\lambda\log d\cdot\alpha$ , since $\alpha=\Omega(d)$ . This completes the analysis of the completeness and soundness cases, and we can conclude that running $\mathcal{A}$ on input $(P^{\prime},\alpha^{\prime})$ solves the instance $(P,\alpha)$ of $(1+\delta)$ - $\mathsf{CP}$ in the Hamming metric. ∎

Appendix B Covering Biclique By Isomorphic Graphs: Proof of Lemma 3.11

Below we prove Lemma 3.11. The proof strategy is similar to how the greedy approximation algorithms for the set cover problem are analyzed: we show that at each step, we can pick a graph isomorphic to $G$ that covers at least $|E_{G}|/n^{2}$ fraction of the remaining edges of the biclique. By doing so, we guarantee that the process ends in $O(\log n)\cdot n^{2}/|E_{G}|$ steps. Note however that, there are exponential number of isomorphisms and thus we cannot simply enumerate all isomorphisms to find one that covers the desired fraction of uncovered edges. Nevertheless, it is not hard to see that we can use the method of conditional expectation to find one such isomorphism in polynomial time. This is formalized below.

Lemma B.1.

For any two bipartite graphs $G=(A\dot{\cup}B,E_{G})$ and $H=(A\dot{\cup}B,E_{H})$ , there exists a side-preserving permutation $\pi:A\dot{\cup}B\to A\dot{\cup}B$ such that

[TABLE]

Moreover, such a permutation $\pi$ can be found (deterministically) in $O((|A|+|B|)^{4})$ time.

Proof.

Notice that, if we pick $\pi|_{A}$ and $\pi|_{B}$ randomly among all permutations of $A$ and $B$ respectively, then, for a fixed $(a,b)\in E_{H}$ , the probability that $(a,b)$ belongs to $E_{G_{\pi}}$ is $\frac{|E_{G}|}{|A|\cdot|B|}$ . Thus,

[TABLE]

This proves the existence part of the claim. To deterministically find such a $\pi$ , we use the method of conditional expectation. Suppose $A\dot{\cup}B=\{1,\dots,n\}$ . The algorithm works as follows:

Let $V_{\text{assigned}}\leftarrow\emptyset$ . 2. 2.

For $i=1,\dots,n$ :

(a)

If $i\in A$ , let $V_{\text{candidate}}=A\setminus V_{\text{assigned}}$ . Otherwise, if $i\in B$ , let $V_{\text{candidate}}=B\setminus V_{\text{assigned}}$ . 2. (b)

For each $k\in V_{\text{candidate}}$ , compute the conditional expectation:

[TABLE]

Let $k^{*}$ be the maximizer for the above conditional expectation. We set $\pi^{*}(i)=k^{*}$ . 3. 3.

Output $\pi^{*}$ .

It is simple to see that the conditional expectation never decreases as we fill in the permutation. As a result, we must have $|E_{H}\cap E_{G_{\pi}}|\geq\frac{|E_{G}|\cdot|E_{H}|}{|A|\cdot|B|}$ as desired. Moreover, it is easy to see that the conditional expectation can be computed in time $O(|A|\cdot|B|)$ because, for each edge $(a,b)\in E_{H}$ , we can compute the probability that $(a,b)\in E_{G_{\pi}}$ in $O(1)$ time. As a result, the overall running time of the algorithm is $O((|A|+|B|)^{4})$ . ∎

Finally using Lemma B.1, we prove Lemma 3.11 using the strategy outlined earlier in this section.

Proof of Lemma 3.11.

We describe below an algorithm for finding $\pi_{1},\dots,\pi_{k}$ . It works as follows.

Let $k\leftarrow 0$ . 2. 2.

While $E_{H}:=E_{K_{n,n}}\setminus\underset{i\in[k]}{\cup}E_{G_{\pi_{i}}}$ is non-empty, do the following:

(a)

Let $k\leftarrow k+1$ . 2. (b)

Let $H=(A\dot{\cup}B,E_{H})$ . 3. (c)

Use the algorithm from Lemma B.1 to find $\pi_{k}$ such that $|E_{H}\cap E_{G_{\pi_{k}}}|\geq|E_{H}|\cdot\frac{|E_{G}|}{n^{2}}$ . 3. 3.

Output $\pi_{1},\dots,\pi_{k}$ .

It is obvious that the permutations are all side-preserving permutations and that the union of $E_{G_{\pi_{i}}}$ over $i\in[k]$ is equal to $E_{K_{n,n}}$ . To see that $k\leq\frac{2n^{2}\ln n}{|E_{G}|}+1$ , observe that due to the guarantee of Lemma B.1, $|E_{H}|$ decreases by a multiplicative factor of (at most) $(1-|E_{G}|/n^{2})\leq e^{-|E_{G}|/n^{2}}$ for each permutation picked. Since the set $E_{H}$ remains non-empty after $k-1$ permutations are picked, we have $e^{-(k-1)\cdot|E_{G}|/n^{2}}\cdot n^{2}\geq 1$ , which implies that $k\leq 2n^{2}\ln n/|E_{G}|+1$ as desired. Finally, the bottleneck in the running time is Step 2c; we execute this step $k$ times and each execution takes $O(n^{4})$ time. Thus, the total running time is $O(nk)=O(n^{6}\log n)$ . ∎

Bibliography74

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[ABV 01] Alexei E. Ashikhmin, Alexander Barg, and Serge G. Vladut. Linear codes with exponentially many light vectors. J. Comb. Theory, Ser. A , 96(2):396–399, 2001.
2[AC 09] Nir Ailon and Bernard Chazelle. The fast johnson–lindenstrauss transform and approximate nearest neighbors. SIAM J. Comput. , 39(1):302–322, 2009. Preliminary version in STOC’06.
3[ACW 16] Josh Alman, Timothy M. Chan, and R. Ryan Williams. Polynomial representations of threshold functions and algorithmic applications. In IEEE 57th Annual Symposium on Foundations of Computer Science, FOCS 2016, 9-11 October 2016, Hyatt Regency, New Brunswick, New Jersey, USA , pages 467–476, 2016.
4[AESW 91] Pankaj K. Agarwal, Herbert Edelsbrunner, Otfried Schwarzkopf, and Emo Welzl. Euclidean minimum spanning trees and bichromatic closest pairs. Discrete & Computational Geometry , 6:407–422, 1991. Preliminary version in So CG’90.
5[Alp 10] Ethem Alpaydin. Introduction to Machine Learning . The MIT Press, 2nd edition, 2010.
6[ARW 17a] Amir Abboud, Aviad Rubinstein, and Ryan Williams. Distributed PCP theorems for hardness of approximation in P. Co RR , abs/1706.06407, 2017.
7[ARW 17b] Amir Abboud, Aviad Rubinstein, and Ryan Williams. Distributed PCP theorems for hardness of approximation in P. In FOCS , pages 25–36, 2017.
8[AW 15] Josh Alman and Ryan Williams. Probabilistic polynomials and hamming nearest neighbors. In IEEE 56th Annual Symposium on Foundations of Computer Science, FOCS 2015, Berkeley, CA, USA, 17-20 October, 2015 , pages 136–150, 2015.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

On Closest Pair in Euclidean Metric:

Abstract

Contents

1 Introduction

Open Question 1.1** **(Abboud-Rubinstein-Williams222Please see the erratum in [ARW17a]. [ARW17b], Williams [Wil18a], David

Open Question 1.2**.**

Open Question 1.3**.**

1.1 Our Results

Theorem 1.4** (Subquadratic Hardness of CP\mathsf{CP}CP; Informal, See Theorem 4.3).**

Theorem 1.5** (Subquadratic Hardness of gap-CP\mathsf{CP}CP).**

Theorem 1.6** (Subquadratic Hardness of gap-MIP\mathsf{MIP}MIP).**

Theorem 1.7** (Subquadratic Hardness of gap-CP\mathsf{CP}CP in edit distance metric).**

2 Proof Overview

2.1 Conditional Lower Bound on Exact Closest Pair

Understanding an obstacle of [DKL18].

Definition 2.1** (Contact Dimension [Pac80]).**

Overcoming the Obstacle: Beyond Biclique.

Constructing a dense bipartite graph with low contact dimension.

2.2 Abstracting the Construction via Error-Correcting Codes

Dense Bipartite Graph with Low Contact Dimension from Codes.

Finding Center from Another Code.

Comparison to Locally Dense Codes.

2.3 Inapproximability of Closest Pair and Maximum Inner Product

2.3.1 Approximate Maximum Inner Product

2.3.2 Approximate Closest Pair

3 Preliminaries

3.1 Notations, Problems and Fine-Grained Hypotheses

Distance Measures.

Problems.

Definition 3.1** (Orthogonal Vectors Problem, OV\mathsf{OV}OV).**

Definition 3.2** (Closest Pair Problem, CP\mathsf{CP}CP).**

Definition 3.3** (Bichromatic Closest Pair Problem, BCP\mathsf{BCP}BCP).**

Definition 3.4** (Maximum Inner Product Problem, MIP\mathsf{MIP}MIP).**

Definition 3.5** (Bichromatic Maximum Inner Product Problem, BMIP\mathsf{BMIP}BMIP).**

Hypotheses.

Definition 3.6** (Strong Exponential Time Hypothesis, SETH\mathsf{SETH}SETH [IP01, IPZ01, CIP06]).**

Definition 3.7** (Orthogonal Vector Hypothesis, OVH\mathsf{OVH}OVH).**

3.2 Error-Correcting Codes

Theorem 3.8** (Singleton bound [Sin64]).**

Definition 3.9** (MDS Codes).**

3.3 Miscellaneous Tools

Covering Biclique by Isomorphic Graphs.

Definition 3.10**.**

Lemma 3.11**.**

Translating Finite Fields Vectors to {0, 1}-Vectors.

Proposition 3.12**.**

Proof.

3.4 OVH\mathsf{OVH}OVH-hardness of Exact Bichromatic Closest Pair

Theorem 3.13**.**

Proof.

Claim 3.14**.**

Proof of Claim 3.14.

3.5 Contact Dimension of a Graph

Definition 3.15** (Contact Dimension [Pac80]).**

Theorem 3.16** (Frankl-Maehara [FM88]).**

Theorem 3.17** (David-Karthik-Laekhanukit [DKL18]).**

Definition 3.18** (Gap Contact Dimension).**

Definition 3.19** (Gap Inner Product Dimension).**

4 Lower Bound on Closest Pair under Orthogonal Vector Hypothesis

Definition 4.1**.**

Theorem 4.2**.**

Theorem 4.3** (Subquadratic Hardness of {0,1}\{0,1\}{0,1}-CP\mathsf{CP}CP).**

Proof.

5 Gadget Constructions

5.1 Finding a Center of a Code via Another Code

Lemma 5.1**.**

Proof.

5.2 Gadgets based on Reed-Solomon Codes

Theorem 5.2** (Reed-Solomon Codes).**

Lemma 5.3**.**

5.2.1 The Basic Gadget: Dense Bipartite Graphs with Low Contact Dimensions

Proof of Theorem 4.2.

5.2.2 A Gadget for Maximum Inner Product

Open Question 1.1 (Abboud-Rubinstein-Williams222Please see the erratum in [ARW17a]. [ARW17b], Williams [Wil18a], David

Open Question 1.2.

Open Question 1.3.

Theorem 1.4 (Subquadratic Hardness of $\mathsf{CP}$ ; Informal, See Theorem 4.3).

Theorem 1.5 (Subquadratic Hardness of gap- $\mathsf{CP}$ ).

Theorem 1.6 (Subquadratic Hardness of gap- $\mathsf{MIP}$ ).

Theorem 1.7 (Subquadratic Hardness of gap- $\mathsf{CP}$ in edit distance metric).

Definition 2.1 (Contact Dimension [Pac80]).

Definition 3.1 (Orthogonal Vectors Problem, $\mathsf{OV}$ ).

Definition 3.2 (Closest Pair Problem, $\mathsf{CP}$ ).

Definition 3.3 (Bichromatic Closest Pair Problem, $\mathsf{BCP}$ ).

Definition 3.4 (Maximum Inner Product Problem, $\mathsf{MIP}$ ).

Definition 3.5 (Bichromatic Maximum Inner Product Problem, $\mathsf{BMIP}$ ).

Definition 3.6 (Strong Exponential Time Hypothesis, $\mathsf{SETH}$ [IP01, IPZ01, CIP06]).

Definition 3.7 (Orthogonal Vector Hypothesis, $\mathsf{OVH}$ ).

Theorem 3.8 (Singleton bound [Sin64]).

Definition 3.9 (MDS Codes).

Definition 3.10.

Lemma 3.11.

Proposition 3.12.

3.4 $\mathsf{OVH}$ -hardness of Exact Bichromatic Closest Pair

Theorem 3.13.

Claim 3.14.

Definition 3.15 (Contact Dimension [Pac80]).

Theorem 3.16 (Frankl-Maehara [FM88]).

Theorem 3.17 (David-Karthik-Laekhanukit [DKL18]).

Definition 3.18 (Gap Contact Dimension).

Definition 3.19 (Gap Inner Product Dimension).

Definition 4.1.

Theorem 4.2.

Theorem 4.3 (Subquadratic Hardness of $\{0,1\}$ - $\mathsf{CP}$ ).

Lemma 5.1.

Theorem 5.2 (Reed-Solomon Codes).

Lemma 5.3.

Theorem 5.4.

Theorem 5.5 (Theorem 4.3 of [Vlă18]).

Theorem 5.6.

Theorem 5.7 ([GS96]).

Lemma 5.8.

Theorem 5.9.

Theorem 6.1.

Theorem 6.2.

Definition 6.3 ( $\gamma$ - $\mathsf{Additive\text{-}BMIP}$ problem).

Theorem 6.4 ([Rub18]).

Theorem 7.1 (Rubinstein [Rub18]).

Theorem 7.2.

Open Question 8.1.

Open Question 8.2.

Lemma A.1 (Rubinstein [Rub18]).

Lemma B.1.