Rounding semidefinite programs for large-domain problems via Brownian   motion

Kevin L. Chang; Alantha Newman

arXiv:1812.03572·cs.DS·December 11, 2018

Rounding semidefinite programs for large-domain problems via Brownian motion

Kevin L. Chang, Alantha Newman

PDF

Open Access

TL;DR

This paper introduces a novel rounding method for semidefinite programming relaxations in large-domain problems, leveraging Brownian motion to improve approximate solutions for angular synchronization.

Contribution

It proposes a simple, Brownian motion-based rounding scheme for SDP relaxations, specifically applied to angular synchronization problems, with conjectured near-optimal guarantees.

Findings

01

The rounding scheme is feasible and effective based on computational evidence.

02

It achieves approximation guarantees close to the best possible under the Unique-Games Conjecture.

03

The method simplifies the rounding process for large-domain SDP problems.

Abstract

We present a new simple method for rounding a semidefinite programming relaxation of a constraint satisfaction problem. We apply it to the problem of approximate angular synchronization. Specifically, we are given directed distances on a circle (i.e., directed angles) between pairs of elements and our goal is to assign the elements to positions on a circle so as to preserve these distances as much as possible. The feasibility of our rounding scheme is based on properties of the well-known stochastic process called Brownian motion. Based on computational and other evidence, we conjecture that this rounding scheme yields an approximation guarantee that is very close to the best-possible guarantee (assuming the Unique-Games Conjecture).

Equations119

max ij \in E \sum (1 - \frac{2 \cdot min {( x _{j} - x _{i} - d _{ij} ) mod p , - ( x _{j} - x _{i} - d _{ij} ) mod p }}{p}) .

max ij \in E \sum (1 - \frac{2 \cdot min {( x _{j} - x _{i} - d _{ij} ) mod p , - ( x _{j} - x _{i} - d _{ij} ) mod p }}{p}) .

max

max

v_{i}^{a} \cdot v_{i}^{b}

v_{i}^{a} \cdot v_{j}^{b}

v_{i}^{a} \cdot v_{i}^{a}

v_{i}^{k}

max

max

v_{i}^{a} \cdot v_{i}^{b}

v_{i}^{a} \cdot v_{j}^{b}

v_{i}^{a} \cdot v_{i}^{a}

v_{i}^{k}

w_{k} = i = 0 \sum k \frac{2}{p} e_{i} .

w_{k} = i = 0 \sum k \frac{2}{p} e_{i} .

(\frac{( v _{i}^{k} - v _{i}^{k - 1} )}{2}) \cdot (\frac{( v _{i}^{k} - v _{i}^{k - 1} )}{2})

(\frac{( v _{i}^{k} - v _{i}^{k - 1} )}{2}) \cdot (\frac{( v _{i}^{k} - v _{i}^{k - 1} )}{2})

(v_{i}^{a} - v_{i}^{b}) \cdot (v_{i}^{c} - v_{i}^{d})

(v_{i}^{a} - v_{i}^{b}) \cdot (v_{i}^{c} - v_{i}^{d})

max ij \in E \sum k \in P \sum

max ij \in E \sum k \in P \sum

u_{ih} \cdot u_{j k}

u_{ih} \cdot u_{ik}

u_{ih} \cdot u_{j k}

u_{ih} \cdot u_{ih}

∣ h \in P \sum u_{ih} - k \in P \sum u_{j k} ∣^{2}

u_{ih}

v_{i}^{k}

v_{i}^{k}

Pr [τ_{b} \in d t] = \frac{b}{2 π t ^{3/2}} exp (- \frac{b ^{2}}{2 t}) d t .

Pr [τ_{b} \in d t] = \frac{b}{2 π t ^{3/2}} exp (- \frac{b ^{2}}{2 t}) d t .

W^{\prime}_{t}=\left\{\begin{array}[]{ll}W_{t}&\mbox{for $0\leq t\leq\tau_{b}$}\\ 2b-W_{t}&\mbox{for $t\geq\tau_{b}$}\end{array}\right.

W^{\prime}_{t}=\left\{\begin{array}[]{ll}W_{t}&\mbox{for $0\leq t\leq\tau_{b}$}\\ 2b-W_{t}&\mbox{for $t\geq\tau_{b}$}\end{array}\right.

a^{'} - 2 w_{k}^{i^{'}} = v^{k^{'}} \cdot r, w_{k}^{i^{'}} = \frac{a ^{'} - v ^{k^{'}} \cdot r}{2} .

a^{'} - 2 w_{k}^{i^{'}} = v^{k^{'}} \cdot r, w_{k}^{i^{'}} = \frac{a ^{'} - v ^{k^{'}} \cdot r}{2} .

a - 2 w_{k}^{i} = v^{k} \cdot r, w_{k}^{i} = \frac{a - v ^{k} \cdot r}{2} .

a - 2 w_{k}^{i} = v^{k} \cdot r, w_{k}^{i} = \frac{a - v ^{k} \cdot r}{2} .

Pr [A ∣ X = x] = \frac{Pr [ A \mbox an d X \in d x ]}{Pr [ X \in d x ]} .

Pr [A ∣ X = x] = \frac{Pr [ A \mbox an d X \in d x ]}{Pr [ X \in d x ]} .

Pr [H^{+} \leq 1 \mbox or H^{-} \leq 1] = \int_{- \infty}^{\infty} Pr [H^{+} \leq 1 \mbox or H^{-} \leq 1 ∣ W_{1} = a] ϕ (a) d a .

Pr [H^{+} \leq 1 \mbox or H^{-} \leq 1] = \int_{- \infty}^{\infty} Pr [H^{+} \leq 1 \mbox or H^{-} \leq 1 ∣ W_{1} = a] ϕ (a) d a .

\int_{1}^{\infty} Pr [H^{+} \leq 1 ∣ W_{1} = a] ϕ (a) d a = \int_{1}^{\infty} ϕ (a) d a \geq .158655.

\int_{1}^{\infty} Pr [H^{+} \leq 1 ∣ W_{1} = a] ϕ (a) d a = \int_{1}^{\infty} ϕ (a) d a \geq .158655.

Pr [H^{+} \leq 1 \mbox or H^{-} \leq 1 ∣ W_{1} = a]

Pr [H^{+} \leq 1 \mbox or H^{-} \leq 1 ∣ W_{1} = a]

Pr [H^{+} \leq 1 ∣ W_{1} = a] = \frac{Pr [ W _{1}^{'} \in d x ( 1 )]}{Pr [ W _{1} \in d a ]} = \frac{ϕ ( 1 )}{ϕ ( a )} .

Pr [H^{+} \leq 1 ∣ W_{1} = a] = \frac{Pr [ W _{1}^{'} \in d x ( 1 )]}{Pr [ W _{1} \in d a ]} = \frac{ϕ ( 1 )}{ϕ ( a )} .

\int_{- 1}^{1} Pr [H^{+} \leq 1 ∣ W_{1} = a] ϕ (a) d a = \int_{- 1}^{1} Pr [H^{-} \leq 1 ∣ W_{1} = a] ϕ (a) d a = \int_{- 1}^{1} ϕ (1) d a \geq .483941.

\int_{- 1}^{1} Pr [H^{+} \leq 1 ∣ W_{1} = a] ϕ (a) d a = \int_{- 1}^{1} Pr [H^{-} \leq 1 ∣ W_{1} = a] ϕ (a) d a = \int_{- 1}^{1} ϕ (1) d a \geq .483941.

Pr [H^{+} \leq 1 \mbox an d H^{-} \leq 1 ∣ W_{1} = a]

Pr [H^{+} \leq 1 \mbox an d H^{-} \leq 1 ∣ W_{1} = a]

Pr [H_{2}^{+} \leq 1 ∣ W_{1} = a]

Pr [H_{2}^{+} \leq 1 ∣ W_{1} = a]

Pr [H_{2}^{+} \leq 1 ∣ W_{1} = a]

Pr [H_{2}^{+} \leq 1 ∣ W_{1} = a]

\int_{- 1}^{1} Pr [H_{2}^{+} \leq 1 ∣ W_{1} = a] ϕ (a) d a = \int_{- 1}^{1} ϕ (2 + a) d a \leq .157305.

\int_{- 1}^{1} Pr [H_{2}^{+} \leq 1 ∣ W_{1} = a] ϕ (a) d a = \int_{- 1}^{1} ϕ (2 + a) d a \leq .157305.

\int_{- 1}^{1} Pr [H_{2}^{-} \leq 1 ∣ W_{1} = a] ϕ (a) d a = \int_{- 1}^{1} ϕ (2 + a) d a \leq .157305.

\int_{- 1}^{1} Pr [H_{2}^{-} \leq 1 ∣ W_{1} = a] ϕ (a) d a = \int_{- 1}^{1} ϕ (2 + a) d a \leq .157305.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Optimization Algorithms Research · Complexity and Algorithms in Graphs · Constraint Satisfaction and Optimization

Full text

Rounding semidefinite programs for large-domain problems

via Brownian motion

Kevin L. Chang Research done at Max-Planck-Institut für Informatik, Saarbrücken, Germany. [email protected].

Alantha Newman Research done at Max-Planck-Institut für Informatik, Saarbrücken, Germany. CNRS-Université Grenoble Alpes. [email protected].

Abstract

We present a new simple method for rounding a semidefinite programming relaxation of a constraint satisfaction problem. We apply it to the problem of approximate angular synchronization studied in [MN11]. Specifically, we are given directed distances on a circle (i.e., directed angles) between pairs of elements and our goal is to assign the elements to positions on a circle so as to preserve these distances as much as possible. The feasibility of our rounding scheme is based on properties of the well-known stochastic process called Brownian motion. Based on computational and other evidence, we conjecture that this rounding scheme yields an approximation guarantee that is very close to the best-possible guarantee (assuming the Unique-Games Conjecture).

1 Introduction

We present an alternate approach to the Relaxed Linear Equations $\bmod~{}p$ (Rel-Lin-Eq) problem studied in [MN11]. Our new approach is also based on rounding a semidefinite programming relaxation but uses a different rounding technique. Based on computational evidence and other justification, we believe this approach has essentially the same approximation guarantee of $.854$ for Rel-Lin-Eq as proven for a different algorithm presented in [MN11].

We are given a set $E$ of equations in the form of $x_{j}-x_{i}\equiv d_{ij}(\bmod~{}p)$ . Let ${\cal X}=\{x_{i}\}$ be the set of elements and let $n=|{\cal X}|$ . We assign each element in ${\cal X}$ a (integral) value from the set $[0,p)$ . For a fixed assignment, an equation has value $x_{j}-x_{i}\equiv d_{ij}\pm y_{ij}(\bmod~{}p)$ , for $y_{ij}\leq p/2$ . Since $y_{ij}$ can have a nonnegative value of at most $p/2$ , we divide $y_{ij}$ by $p/2$ in order to obtain a normalized value between 0 and 1. Our goal is to find an assignment that maximizes the sum $\sum_{ij\in E}(1-2y_{ij}/s)$ . More precisely, we formulate our objective function as follows.

[TABLE]

This problem generalizes the Max-Cut problem: For any edge $ij$ in a graph, we write the constraint $x_{j}-x_{i}\equiv p/2(\bmod~{}p)$ . Then an $\alpha$ -approximation to Rel-Lin-Eq yields an $\alpha$ -approximation for Max-Cut. It can be viewed as an approximate version of the angular synchronization problem studied by Singer [Sin11]. Originally, our motivation was to develop rounding methods for constraint satisfaction problems whose solutions to assignment-constraint based SDPs can have the property that no pair of assignment vectors have a high dot product despite the solution having a high objective value. (e.g., A problem with this property is Maximum Acyclic Subgraph, which has since been proven to be Unique-Games hard to approximate to within a factor greater than $\frac{1}{2}$ [GHM*+*11]. On the other hand, Unique Games does not have this property.)

Our rounding scheme is based on properties of the well-known stochastic process called Brownian motion. Theoretically, this procedure could be applied to other problems that can be modeled using the standard assignment-constraint based SDP framework (i.e., SDP formulation $(P^{+})$ in Section 2.1). However, it does seem tailor-made for our particular objective function. It is also reminiscent of “sticky” random walks used in constructive approaches to discrepancy minimization [Ban10], although these results focus on assigning each element a binary value and one of our main motivations was to study how to approximate large-domain problems.

1.1 Organization

After presenting our quadratic formulations and relaxations in Section 2, we present our rounding procedure in Section 3. In Section 4, we discuss Brownian motion and how it relates to our rounding procedure. Then we prove that our rounding procedure is feasible most of the time; precisely, at least $.96$ of the time, it assigns a (non-random) position to a variable. First, we prove this for a continuous process (Section 5) and then for a discrete process (Section 6). Finally, in Section 7, we state a conjecture regarding the correlation of two random walks, which is supported by extensive computational experiments. A positive resolution to this conjecture would be one way to prove that this rounding procedure has a guarantee close to the best-known guarantee of $.854$ [MN11] (and close to the best-possible guarantee of $.878$ under the Unique-Games Conjecture).

2 Quadratic Programs

For each variable $x_{i}$ , we have a set of $p$ unit vectors, $v_{i}^{0},v_{i}^{1},v_{i}^{2},\dots,v_{i}^{p-1}$ , for a total of $pn$ vectors. For $b>a$ , let $d(a,b)=\min\{b-a,p-(b-a)\}$ . Note that $d(a,b)=d(b,a)$ . Let ${\cal F}_{p}$ denote a particular (fixed) set of $p$ vectors with the property that for $v_{a},v_{b}\in{\cal F}_{p}$ , $v_{a}\cdot v_{b}=1-\frac{4d(a,b)}{p}$ . For example, if $p=8$ , then the set ${\cal F}_{p}$ can be the following eight vectors.

[TABLE]

This formula for creating such a set ${\cal F}_{p}$ of vectors can be generalized for any even value of $p$ (where the absolute value of each coordinate of each vector is $\frac{\sqrt{2}}{\sqrt{p}}$ ). We obtain the following quadratic program for Rel-Lin-Eq. Let $P$ denote the set of integers in $[0,p)$ .

A Quadratic Program ( $\mathit{Q}$ ):

$\displaystyle\max$ $\displaystyle\sum_{ij\in E}\frac{1+v_{i}^{0}\cdot v_{j}^{d_{ij}}}{2}$

$\displaystyle v_{i}^{a}\cdot v_{i}^{b}$ $\displaystyle=1-\frac{4\cdot d(a,b)}{p},$ $\displaystyle\forall x_{i}\in{\cal X},~{}a,b\in P,$

(1)

$\displaystyle v_{i}^{a}\cdot v_{j}^{b}$ $\displaystyle=v_{i}^{k+a}\cdot v_{j}^{k+b},$ $\displaystyle\forall x_{i},x_{j}\in{\cal X},~{}a,b,k\in P,$

(2)

$\displaystyle v_{i}^{a}\cdot v_{i}^{a}$ $\displaystyle=1,$ $\displaystyle\forall x_{i}\in{\cal X},a\in P,$

(3)

$\displaystyle v_{i}^{k}$ $\displaystyle\in{\cal F}_{p},$ $\displaystyle\forall x_{i}\in{\cal X},k\in P.$

(4)

For each variable $x_{i}\in{\cal X}$ , the corresponding set of $p$ vectors has the same configuration up to rotation, reflection and translation. This is enforced by Constraints (1) and (3). In an integral solution, the set of $p$ vectors corresponding to variable $x_{i}$ is identical to the set of $p$ vectors corresponding to the variable $x_{j}$ , for all variables $x_{i},x_{j}\in{\cal X}$ . This follows from the fact that all vectors belong to ${\cal F}_{p}$ . The only difference is that vectors in the two sets may have different labels (i.e., one set of vectors can be viewed as a rotation of the other set). Then the relative values or positions of two variables only depends on the rotations of the labels. In other words, if for variables $x_{i}$ and $x_{j}$ , the same vectors have the same labels, then the two variables will be assigned to the same position. If $v_{i}^{k}=v_{j}^{k+3}$ and $x_{i}$ is in position $k$ , $x_{j}$ should be in position $k+3$ . Given an integral solution, we can determine the position of each variable by picking a vector, and assigning each variable to the label to which that vector corresponds for that variable.

2.1 Semidefinite Relaxations

To obtain a semidefinite relaxation of ( $Q$ ), we remove Constraint (4) and require only that each $v_{i}^{k}\in{\mathbbm{R}}^{pn}$ . Note that even in the semidefinite relaxation, the set of $p$ vectors corresponding to a particular variable $x_{i}$ have the same configuration for each variable up to rotation, reflection and translation. We refer to the set of vectors corresponding to a variable $x_{i}$ as a constellation $C_{i}$ . We show that certain properties hold for each constellation.

A Semidefinite Program ( $\mathit{P}$ ):

$\displaystyle\max$ $\displaystyle\sum_{ij\in E}\frac{1+v_{i}^{0}\cdot v_{j}^{d_{ij}}}{2}$

$\displaystyle v_{i}^{a}\cdot v_{i}^{b}$ $\displaystyle=1-\frac{4\cdot d(a,b)}{p},$ $\displaystyle\forall x_{i}\in{\cal X},~{}a,b\in P,$

(5)

$\displaystyle v_{i}^{a}\cdot v_{j}^{b}$ $\displaystyle=v_{i}^{k+a}\cdot v_{j}^{k+b},$ $\displaystyle\forall x_{i},x_{j}\in{\cal X},~{}a,b,k\in P,$

(6)

$\displaystyle v_{i}^{a}\cdot v_{i}^{a}$ $\displaystyle=1,$ $\displaystyle\forall x_{i}\in{\cal X},a\in P,$

(7)

$\displaystyle v_{i}^{k}$ $\displaystyle\in{\mathbbm{R}}^{pn},$ $\displaystyle\forall x_{i}\in{\cal X},k\in P.$

(8)

Let ${\bf v_{p}}$ be a vector in ${\mathbbm{R}}^{\frac{p}{2}}$ in which each entry is $\frac{\sqrt{2}}{\sqrt{p}}$ . Let ${\bf e}_{i}\in\mathbbm{R}^{\frac{p}{2}}$ be the indicator vector which has a $1$ in the $i^{th}$ position and 0 elsewhere. For $k$ such that $0\leq k\leq p/2$ , we define $w_{k}$ as follows:

[TABLE]

Definition 1.

Let the constellation $C_{0}$ be the set of $s$ unit vectors $\{v_{0}^{0},v_{0}^{1},v_{0}^{2},\dots v_{0}^{p-1}\}$ defined as follows. For $k$ such that $0\leq k\leq p/2$ , define $v_{0}^{k}=-2\cdot w_{k}+{\bf v_{p}}$ . For $k$ such that $p/2<k<p$ , let $v_{0}^{k}=-v_{0}^{k-p/2}$ .

Lemma 1.

For any $x_{i}\in{\cal X}$ , the constellation $C_{i}=\{v_{i}^{0},v_{i}^{1},\dots,v_{i}^{p-1}\}$ is equivalent to the constellation $C_{0}$ up to rotation, reflection and translation.

Proof.

Let $v_{ik}=\frac{(v_{i}^{k}-v_{i}^{k-1})}{2}$ . Without loss of generality, let us assume that $v_{ik}=\frac{\sqrt{2}}{\sqrt{p}}{\bf e}_{k}$ for $1\leq k\leq p/2$ . We can assume this since for all $k\in P$ , we have (i) $||v_{ik}||=\frac{\sqrt{2}}{\sqrt{p}}$ (Lemma 2) and (ii) $v_{ij}\cdot v_{ik}=0$ for all $j,k\in P$ such that $j\neq k$ (Lemma 3).

Note that $v_{i}^{k}=v_{i}^{k-1}+2\cdot v_{ik}$ . This implies that $v_{i}^{p/2}=-\sum_{k=1}^{p/2}v_{ik}$ and that $v_{i}^{0}=\sum_{k=1}^{p/2}v_{ik}$ . Thus, there is some rotation of the vectors in $C_{i}$ such that the resulting set of vectors is equivalent to $C_{0}$ .∎

Lemma 2.

For all $x_{i}\in{\cal X}$ and $k\in P$ , $||\frac{(v_{i}^{k}-v_{i}^{k-1})}{2}||=\frac{\sqrt{2}}{\sqrt{p}}$ .

Proof.

[TABLE]

∎

Lemma 3.

For $x_{i}\in{\cal X}$ and for $a,b,c,d\in[0,p/2]$ , the vectors $(v_{i}^{a}-v_{i}^{b})\cdot(v_{i}^{c}-v_{i}^{d})=0$ if $[a,b]$ and $[c,d]$ are non-overlapping intervals.

Proof.

[TABLE]

This equals 0 if the intervals $[a,b]$ and $[c,d]$ are non-overlapping, since then $d(a,c)+d(b,d)=d(a,d)+d(b,c)$ .∎

Another way to write a semidefinite program is to use a standard formulation based on assignment constraints (e.g., see Quadratic Program $(Q_{2})$ in [MN11]).

A Semidefinite Program ( $\mathit{P^{+}}$ ):

$\displaystyle\max\sum_{ij\in E}\sum_{k\in P}$ $\displaystyle\left(p-2d(k,d_{ij})\right)u_{i0}\cdot u_{jk}$

$\displaystyle u_{ih}\cdot u_{jk}$ $\displaystyle\geq 0,$ $\displaystyle x_{i},x_{j}\in{\cal X},h,k\in P,$

(9)

$\displaystyle u_{ih}\cdot u_{ik}$ $\displaystyle=0,$ $\displaystyle x_{i},x_{j}\in{\cal X},h,k\in P,$

(10)

$\displaystyle u_{ih}\cdot u_{jk}$ $\displaystyle=u_{ih+a}\cdot u_{jk+a},$ $\displaystyle x_{i},x_{j}\in{\cal X},h,k,a\in P,$

(11)

$\displaystyle u_{ih}\cdot u_{ih}$ $\displaystyle=\frac{1}{p},$ $\displaystyle\forall x_{i}\in{\cal X},$

(12)

$\displaystyle|\sum_{h\in P}u_{ih}-\sum_{k\in P}u_{jk}|^{2}$ $\displaystyle=0,$ $\displaystyle\forall x_{i},x_{j}\in{\cal X},$

(13)

$\displaystyle u_{ih}$ $\displaystyle\in\mathbbm{R}^{pn},$ $\displaystyle\forall x_{i}\in{\cal X},h\in P.$

(14)

Given a solution for $(P^{+})$ , we can construct a solution for $(P)$ as follows.

[TABLE]

It is not difficult to see that the transformation in (15) preserves the objective value. In our computational experiments, we used solutions for $(P^{+})$ , which are more constrained than solutions for $(P)$ (e.g., Constraint (9) is not implied by the constraints in $(P)$ ). However, we feel it is somewhat clearer to present the rounding algorithm in the next section based on a solution for $(P)$ .

2.2 Relaxation on an Arbitrarily Large Domain

Note that we can replace $p$ with an arbitrarily large constant $s$ . Suppose $s$ is a multiple of $p$ (i.e., $s=\ell p$ ). Then we can scale each constraint so that $x_{j}-x_{i}\equiv d_{ij}(\bmod~{}p)$ becomes $x_{j}-x_{i}\equiv\ell d_{ij}(\bmod~{}s)$ . The optimal objective value of the original and the scaled problem are the same. Moreover, given a solution for $(P)$ on the domain of size $p$ , we can create a solution for the scaled problem on the domain of size $s=\ell p$ with the same objective value without resolving the relaxation $(P)$ . Thus, we can assume that $s$ is an extremely large constant. We assume this since our rounding algorithms work best on a large domain.

3 Rounding the Relaxation

Our algorithm for Rel-Lin-Eq is based on rounding a solution for the semidefinite relaxation ( $P$ ) presented in Section 2.1. The first issue is, how do we use the constellation of vectors $C_{i}$ to determine the position or value of variable $x_{i}$ ? We will consider the following random process with $s$ steps. Let $r\in{\mathbbm{R}}^{sn}$ be a random vector in which each coordinate is chosen according to the normal distribution ${\cal N}(0,1)$ . We can view the $s$ values $r\cdot v_{i}^{0},r\cdot v_{i}^{1},\dots,r\cdot v_{i}^{s-1}$ as a discrete random process in which the expected correlation of $r\cdot v_{i}^{a}$ and $r\cdot v_{i}^{b}$ is given by the dot product $v_{i}^{a}\cdot v_{i}^{b}$ .

Let us view these $s$ values as a discrete random process on the interval $[0,s]$ . For a subinterval $[t,t^{\prime}]$ , we say time step $q$ is in the interval $[t,t^{\prime}]$ if $d(s\cdot t/2,s\cdot q/2)\leq d(s\cdot t/2,s\cdot t^{\prime}/2)$ and $d(s\cdot t^{\prime}/2,s\cdot q/2)\leq d(s\cdot t/2,s\cdot t^{\prime}/2)$ .

Given such a random process, we say that there is an extreme sign change with threshold $\alpha$ between times $t$ and $t^{\prime}$ if $v^{i}_{t}\cdot r\leq-\alpha$ , $v^{i}_{t^{\prime}}\cdot r\geq\alpha$ and $v^{i}_{q}\cdot r<\alpha$ for all $q\in[t,t^{\prime}]$ . Our algorithm is based on the observation that in this random process, it is very likely that there is exactly one extreme sign change for the threshold $\alpha=1$ (i.e., there do not exist two disjoint intervals that both contain extreme sign changes). This is stated in Theorem 1. If this random process has exactly one extreme sign change, then we say that the process first reaches a threshold $\alpha$ at time $t$ if there is an interval $[t^{\prime},t]$ such that $v^{i}_{t}\cdot r\geq\alpha$ , $v^{i}_{t^{\prime}}\cdot r\leq-\alpha$ , and $v^{i}_{q}\cdot r<\alpha$ for all $q\in[t^{\prime},t]$ . Note that these intervals are taken modulo $s$ (i.e., they are intervals on a circle).

Definition 2.

An extreme sign change with threshold $\alpha$ in the sequence $\{v_{i}^{0}\cdot r,v_{i}^{1}\cdot r,\dots,v_{i}^{s-1}\cdot r\}$ occurs when $v^{i}_{t_{1}}\cdot r\leq-\alpha$ and $v^{i}_{t_{2}}\cdot r\geq\alpha$ and for no value of $t:t_{1}<t<t_{2}$ is $v^{i}_{t}\cdot r\geq\alpha$ .

Theorem 1.

If $s$ is a sufficiently large constant, then with probability at least $.96$ , the random process $\{v_{i}^{k}\cdot r\}$ with $s$ steps has exactly one extreme sign change with threshold 1.

3.1 Rounding Algorithm

Given Theorem 1, we present the following rounding algorithm.

(i)

Solve the semidefinite relaxation ( $P$ ).

(ii)

Choose a random vector $r\in{\mathbbm{R}}^{sn}$ where each coordinate $r_{i}\sim{\cal N}(0,1)$ for $i\in\{1,\dots,sn\}$ .

(iii)

For each variable $x_{i}\in X$ , consider the sequence $\{v_{i}^{k}\cdot r\}$ for all $k\in S$ .

(a)

If there is one extreme sign change:

Place $x_{i}$ at the $k$ where $\{v_{i}^{k}\cdot r\}$ first reaches threshold 1.

(b)

If there are no extreme sign changes:

Assign $x_{i}$ a random position in $[0,s-1]$ .

For each $x_{i}\in X$ , we associate the random walk $w^{i}$ . Each walk $w^{i}$ has the same expected behaviour. This follows from the fact that for each $i$ , there is a rotation matrix such that the set of vectors $\{v_{i}^{k}\}$ is equal to a canonical set of vectors, as stated in Lemma 1 (i.e., for a fixed vertex $i$ , the pairwise (setwise) relationships of the vectors is exactly prescribed by constraints (2) and (4) of the SDP). We prove Theorem 1 in Section 5. First, we briefly discuss Brownian motion, which we will use in the proof of Theorem 1.

To measure the quality of the solution produced by this rounding algorithm, one must analyze the correlation of two random walks. In Section 7, we state a conjecture regarding this correlation, which is supported by extensive computational experiments.

4 Brownian Motion

In order to analyze the randomized rounding scheme presented above, we will interpret the sequence of vectors corresponding to any fixed variable, $v_{1}\cdot r,v_{2}\cdot r,\ldots,v_{s}\cdot r$ as a random walk. We will show that this random walk is a discrete sampling of a fundamental continuous stochastic process called Brownian motion. We will then use properties of Brownian motion to prove properties of our discrete walk. For background in Brownian motion, we refer the reader to the textbook [KS88].

A stochastic process $W_{t}$ for $0\leq t<\infty$ is a Brownian motion if it satisfies the following properties:

For times $t_{1}<t_{2}$ , $W_{t_{2}}-W_{t_{1}}\sim{\cal N}(0,t_{2}-t_{1})$ . 2. 2.

For all choices of times $0\leq t_{0}<t_{1}<\ldots<t_{n}<\infty$ , $W_{t_{i+1}}-W_{t_{i}}$ is independent of $W_{t_{j+1}}-W_{t_{j}}$ for all choices of $i,j,n$ . 3. 3.

$W_{0}=0$ . 4. 4.

$W_{t}$ has continuous sample paths with probability 1.

Our proofs will rely on two basic properties of Brownian motion: the distributions of hitting times and the Reflection Principle.

The first hitting time for level $b$ , denoted by $\tau_{b}$ , is defined to be the first time at which $W_{t}$ takes the value $b$ : $\tau_{b}=\inf\left\{t:W_{t}=b\right\}$ . $\tau_{b}$ is a random variable whose distribution has the density function:

[TABLE]

Roughly stated, the Reflection Principle is the intuitive property that once a Brownian motion hits a level $b$ , it is equally likely to be above and below the level $b$ in the future. More precisely, it states that if $W_{t}$ is a Brownian motion and $\tau_{b}$ its hitting time to $b$ , then the process $W^{\prime}_{t}$ defined by

[TABLE]

(which is the formula for $W_{t}$ “reflected” about the horizontal line $y=b$ after it hits $b$ ) is also a Brownian motion. We will use the reflection principle many times in our calculations.

4.1 Mapping Our Process to Brownian Motion

Given a constellation of vectors $C_{i}$ , we can assume by Lemma 1 that the vectors have the following configuration. Each vector in this configuration has dimension $s/2$ . Note that in order to make each vector a unit vector, we can multiply each entry by $\frac{\sqrt{2}}{\sqrt{s}}$ .

[TABLE]

Suppose that $r\in{\cal N}(0,1)^{s/2}$ is a vector such that $r=(r_{1},r_{2},\dots r_{s/2})$ . Let $a^{\prime}=\sum_{i=1}^{s/2}r_{i}=v_{i}^{0^{\prime}}\cdot r$ . Define the process $w_{k}^{i^{\prime}}$ as $w_{k}^{i^{\prime}}=\sum_{i=0}^{k}r_{k}$ . Thus, we have that:

[TABLE]

Let the vector $v^{k}$ represent the vector $v^{k^{\prime}}$ with each entry multiplied by $\frac{\sqrt{2}}{\sqrt{s}}$ , so that $\{v^{k}\}$ is a set of unit vectors. Let $a=\sum_{i=1}^{s/2}r_{i}\cdot\frac{\sqrt{2}}{\sqrt{s}}=v^{0}\cdot r=(v^{0^{\prime}}\cdot r)\cdot\frac{\sqrt{2}}{\sqrt{s}}$ , and define the process $w_{k}^{i}$ as $w_{k}^{i}=\sum_{i=0}^{k}r_{k}\cdot\frac{\sqrt{2}}{\sqrt{s}}$ . Thus, we have that:

[TABLE]

Note that when $v^{k}\cdot r=1$ or $-1$ , it is the case that $w_{k}^{i}=\frac{a-1}{2}$ and $\frac{a+1}{2}$ , respectively. If we define $W_{t}$ to be a Brownian motion on the continuous interval $[0,1]$ , then $w_{t}$ is a discretization of this continuous process. In the remainder of the paper, when we refer to a particular process $w_{k}^{i}$ , we will drop the superscript $i$ when it is clear from the context, or when we are just referring to a single process.

5 Probability of Exactly One Extreme Sign Change

In this section, we we prove Theorem 1. We adopt standard statistical notation and denote density function of a continuous random variable $X$ by $\Pr[X\in dx]$ . For example, if $X\sim{\cal N}(0,1)$ , then $\Pr[X\in dx]=\frac{1}{\sqrt{2\pi}}e^{-x^{2}/2}dx$ . (Heuristically, $dx$ denotes a very small region around the value $x$ .) We will furthermore write $\Pr[X\in dx(1)]$ to denote the density function of $X$ when $x$ takes the specific value of 1.

We compute the probability of an event $A$ conditioned on a random variable $X$ taking value $X=x$ by applying the formula:

[TABLE]

5.1 Probability of at Least One Sign

Change

Suppose Brownian motion $W_{t}$ begins at $W_{0}=0$ and at $t=1$ satisfies $W_{1}=a$ . Let $H^{+}$ be the minimum time that $W$ reaches the threshold $a/2+1/2$ , and $H^{-}$ be the minimum time that $W$ reaches $a/2-1/2$ . Note that $H^{+}=\tau_{W_{1}/2+1/2}$ and that $H^{-}=\tau_{W_{1}/2-1/2}$ . These hitting times depend on the value of $W_{t}$ at time $1$ , and are therefore are not stopping times. In order to calculate their distributions, we will first fix $a$ and then calculate the distributions of $H^{+}$ and $H^{-}$ , conditioned on the value of $W_{1}=a$ .

For our proof, we will find it helpful to generalize our definition of $H^{-}$ and $H^{+}$ as follows. Suppose a Brownian motion satisfies $W_{1}=a$ . Define $H^{+}_{i}$ to be the first time that $W_{t}$ finishes hitting the following sequence of barriers: $a/2+1/2$ , then $a/2-1/2$ , then $a/2+1/2$ and so forth until it has crossed $i$ barriers, alternating between the upper and lower barriers. As an example, $H^{+}_{3}$ would be the first time that the path hits $a/2+1/2$ after having first hit $a/2+1/2$ , and then $a/2-1/2$ . Similarly, define $H^{-}_{i}$ to be the first time that $W_{t}$ finishes hitting the sequence of barriers: below $a/2-1/2$ , above $a/2+1/2$ , and so forth until it has crossed $i$ barriers.

Lemma 4.

$\Pr[H^{+}\leq 1\mbox{ or }H^{-}\leq 1]\geq.985612$ .

Our approach will be to consider the decomposition of the total probability into probabilities conditioned on $W_{1}=a$ :

[TABLE]

and calculate the integral on the right-hand-side.

We break the domain of the integral into three parts.

5.1.1 Case (i): $a\geq 1$

In this case, the upper barrier is $a/2+1/2\leq a$ . The condition $W_{1}=a$ implies that $W_{t}$ crosses this barrier with probability 1 (i.e. $H^{+}\leq 1$ ). Therefore,

[TABLE]

5.1.2 Case (ii): $a\leq-1$

Analogous to Case (i).

5.1.3 Case (iii): $-1<a<1$

From an application of the inclusion-exclusion principle, note that:

[TABLE]

5.1.4 An example of applying the reflection

principle

We now sketch the reasoning behind a standard calculation involving the reflection principle of Brownian motion and apply it to calculating $\Pr[H^{+}\leq 1~{}|~{}W_{1}=a]$ for the case $-1<a<1$ . We will use this sort of calculation many times in our proofs.

Fix a value of $a$ . Suppose a Brownian motion $W_{t}$ (but not restricted to satisfy $W_{1}=a$ ) hits the value $b=a/2+1/2$ at time $\tau_{b}$ (i.e., $W_{\tau_{b}}=b$ ). Define $W^{\prime}_{t}$ to be the process $W^{\prime}_{t}=W_{t}$ for $0\leq t\leq\tau_{b}$ and $W^{\prime}_{t}=2b-W_{t}$ for $t\geq\tau_{b}$ . By the reflection principle, the random process $W^{\prime}_{t}$ is also a Brownian motion.

Now consider the subset of Brownian motions $\left\{W_{t}|H^{+}\leq 1,W_{1}=a\right\}$ (i.e., they satisfy $\tau_{a/1+1/2}\leq 1,W_{1}=a$ ). Note that these processes correspond exactly to reflected processes $W^{\prime}$ that satisfy $W^{\prime}_{1}=2b-W_{1}=1$ . Thus, the elements in the set $\left\{W_{t}|H^{+}\leq 1,W_{1}=a\right\}$ correspond exactly to elements in the set $\left\{W^{\prime}_{t}|W^{\prime}_{1}=1\right\}$ . Then

[TABLE]

For a more rigorous justification of these calculations, see [Cha01].

Similarly, one can show that: $\Pr[H^{-}\leq 1~{}|~{}W_{1}=a]=\phi(1)/\phi(a)$ . Therefore

[TABLE]

5.1.5 An example applying the reflection principle twice

Note that the event $\left\{W_{t}|H^{+}\leq 1\mbox{ and }H^{-}\leq 1,W_{1}=a\right\}$ corresponds to processes that either cross above the barrier $a/2+1/2$ then below $a/2-1/2$ or vice versa, i.e. processes that satisfy $H^{+}_{2}\leq 1$ or $H^{-}_{2}\leq 1$ . Therefore, from the inclusion-exclusion principle we have:

[TABLE]

In order to calculate $\Pr[H^{+}_{2}\leq 1~{}|~{}W_{1}=a]$ , we apply the reflection principle twice. First, we reflect $W_{t}$ about the line $a/2+1/2$ when it first hits $a/2+1/2$ . Call this reflected process $W^{\prime}_{t}$ . A process $W_{t}$ that hits $a/2+1/2$ , then hits $a/2-1/2$ , then satisfies $W_{1}=a$ will correspond exactly to a reflected process $W^{\prime}_{t}$ that first hits $a/2+1/2$ then hits $a/2+3/2$ then achieves $W^{\prime}_{1}=1$ .

Next, we reflect the process $W^{\prime}_{t}$ the first time it hits $a/2+3/2$ about the line $a/2+3/2$ ; call this new process $W^{\prime\prime}_{t}$ . It is easy to verify that a process $W_{t}$ (prior to reflection) that hits $a/2+1/2$ , then $a/2-1/2$ , then achieves $W_{1}=a$ will correspond exactly to a reflected process $W^{\prime\prime}_{t}$ that satisfies $W^{\prime\prime}_{1}=2+a$ . Therefore,

[TABLE]

[I]n order to calculate $\Pr[H^{+}_{2}\leq 1~{}|~{}W_{1}=a]$ , we apply the reflection principle twice. After the process $W_{t}$ $W_{t}$ first hits $a/2+1/2$ and then hits $a/2-1/2$ , we reflect the process $W_{t}$ about the line $a/2-1/2$ . Call this reflected process $W^{\prime}_{t}$ . A process $W_{t}$ that hits $a/2+1/2$ , then hits $a/2-1/2$ , then achieves $W_{1}=a$ will be reflected to a process $W^{\prime}_{t}$ that first hits $a/2+1/2$ and then achieves $W^{\prime}_{1}=-1$ . Next, we reflect the process $W^{\prime}_{t}$ the first time it hits $a/2+1/2$ about the line $a/2+1/2$ ; call this new process $W^{\prime\prime}_{t}$ . It is easy to verify that a process $W_{t}$ (prior to reflection) that hits $a/2+1/2$ , then $a/2-1/2$ , then achieves $W_{1}=a$ will correspond exactly to a twice reflected process $W^{\prime\prime}_{t}$ that achieves $W^{\prime\prime}_{1}=2+a$ . Therefore,

[TABLE]

Therefore,

[TABLE]

Similarly, one can show

[TABLE]

Note that the event $[H^{+}_{2}\leq 1\mbox{ and }H^{-}_{2}\leq 1~{}|~{}W_{1}=a]$ corresponds to the event $[H^{+}_{3}\leq 1\mbox{ or }H^{+}_{3}\leq 1]$ . From the calcuations in Section 5.6, the following bound can be easily derived:

[TABLE]

Combining these calculations, we arrive at:

[TABLE]

5.2 Totals

Combining the results of the three cases, we obtain:

[TABLE]

5.3 Probability of Three or More Sign Changes

In this section, we prove the following Lemma:

Lemma 5.

$\Pr[H_{3}^{+}\leq 1\mbox{ or }H_{3}^{-}\leq 1]\leq.0178.$ **

Since the barriers in $H^{+}_{3}$ and $H^{-}_{3}$ depend on the value of $W_{1}$ , as in the previous section, it will be necessary to decompose the total probability into probabilities conditioned on $a=W_{1}$ :

[TABLE]

We partition the domain of the integral into three cases, and calculate the probabilities in each case using the reflection principle.

[A] Brownian motion, $W$ , begins at 0 and after $t$ time steps achieves the value $W_{1}=a$ . Then let $H^{+}$ be the minimum time that $W$ finishes reaching the thresholds $a/2+1/2,~{}a/2-1/2,~{}a/2+1/2$ in that order. (Let ${H^{-}}$ be the min time that $W$ reaches the thresholds $a/2-1/2,~{}a/2+1/2,~{}a/2-1/2$ in that order.) We define another process $B_{t}$ , which is a reflection of the process $W_{t}$ over certain thresholds (depending on the case). There are three cases.

5.4 Case (i): $a>1$

In this case, we only need to calculate the probability that $H^{-}_{3}\leq 1$ occurs, since if $H^{+}_{3}$ occurs, then $H^{-}_{3}$ must also occur.

To obtain $W^{\prime}_{t}$ , the process $W_{t}$ is reflected the first time it hits $a/2+1/2$ , then the first time this reflected process hits $a/2+3/2$ . Using reasoning similar to Section 5.1.5, it can be shown that if the process $W_{t}$ (prior to reflection) hits $a/2+1/2$ , then $a/2-1/2$ , then $a/2+1/2$ , then satisfies $W_{1}=a$ (i.e., it satisfies $H^{+}_{3}\leq 1$ for fixed $a$ ), then it will correspond exactly to a process $W^{\prime}_{t}$ that achieves $W^{\prime}_{1}=a+2$ . Therefore,

[TABLE]

Thus, we have:

[TABLE]

5.5 Case (ii): $a\leq-1$

Analogous to Case (i).

5.6 Case (iii): $-1<a<1$

By the inclusion-exclusion principle, we have:

[TABLE]

First, we calculate $\Pr[H^{+}_{3}\leq 1~{}|~{}W_{1}=a]$ . The process $W^{\prime}_{t}$ is obtained by reflecting $W_{t}$ the first time it hits $a/2+1/2$ , then the first the reflected process hits $a/2+3/2$ , then the first time the twice reflected process hits $a/2+5/2$ . If $W_{t}$ (prior to reflection) hits $a/2+1/2$ , then $a/2-1/2$ , then $a/2+1/2$ , then achieves $W_{1}=a$ , then it will correspond exactly to a thrice reflected process $W^{\prime}_{t}$ that satisfies $W^{\prime}_{1}=3$ .

We want to calculate:

[TABLE]

We have:

[TABLE]

Thus, we have:

[TABLE]

Now we calculate $\Pr[H^{-}_{3}\leq 1~{}|~{}W_{1}=a]$ . In this case, the process $W^{\prime}_{t}$ is obtained by reflecting $W_{t}$ the first time it hits $a/2-1/2$ , then the first time the reflected process hits $a/2-3/2$ , then the first time the twice reflected process hits $a/2-5/2$ . A process $W_{t}$ that hits $a/2-1/2$ , then $a/2+1/2$ , then $a/2-1/2$ , then satisfies $W_{1}=a$ (i.e., it satisfies $H^{-}_{3}\leq 1$ for fixed $a$ ) will correspond exactly to a thrice reflected process $W^{\prime}_{t}$ that satisfies $W^{\prime}_{1}=-3$ . We want to compute:

[TABLE]

We have:

[TABLE]

Thus, we have:

[TABLE]

Thus, a naive bound on the probability of three sign changes would be to add expressions (17) and (18):

[TABLE]

The above bound is an overestimate of the probability, because the event $[H^{-}_{3}\leq 1\mbox{ and }H^{+}_{3}\leq 1~{}|~{}W_{1}=a]$ is contained in both (17) and (18).

We now calculate $\Pr[H^{-}_{3}\leq 1\mbox{ and }H^{+}_{3}\leq 1~{}|~{}W_{1}=a]$ . Note that the event $\left\{W|H^{-}_{3}\leq 1\mbox{ and }H^{+}_{3}\leq 1\right\}$ occurs when there are at least four sign changes; either $H^{+}_{4}\leq 1$ or $H^{-}_{4}\leq 1$ occurs, or possibly both. Using the same argument as we did for $H^{+}_{3}$ , $H^{-}_{3}$ , it can be shown that:

[TABLE]

and that

[TABLE]

Again applying the inclusion-exclusion principle, we have:

[TABLE]

Therefore:

[TABLE]

5.7 Totals

Combining the results of the three cases, we arrive at:

[TABLE]

6 From Brownian Motion to Discrete Random Walks

The randomized rounding procedure for our algorithm involves a discrete random walk; we have proven Lemmas 4 and 5 for the continuous process, Brownian motion. We show in this section that the discretized random walk of the rounding procedure will also satisfy Lemmas 4 and 5.

Suppose $W_{t}$ is a Brownian motion. As we showed earlier, the discretized random walk of $s$ steps, $w_{1},\ldots,w_{s}$ , can be modeled as the sequence: $w_{1}=W_{1},w_{2}=W_{2/s},w_{3}=W_{3/s},\ldots,w_{s}=W_{1}$ .

First, consider the question of whether Lemma 5 implies that $w_{1},\ldots,w_{s}$ also does not touch the sequence of barriers $\frac{w_{s}}{2}+\frac{1}{2},\frac{w_{s}}{2}-\frac{1}{2},\frac{w_{s}}{2}+\frac{1}{2}$ before time $t=1$ . Certainly, if $W_{t}$ does not hit this sequence of barriers, then its discretized version also does not hit these three barriers, since $w_{s}=W_{1}$ . Therefore Lemma 5 holds for the discrete random walk as well.

Now consider the question of whether Lemma 4 implies that $w_{1},\ldots,w_{s}$ hits either of the barriers $\frac{w_{s}}{2}+\frac{1}{2}$ or $\frac{w_{s}}{2}-\frac{1}{2}$ . Note that if $W_{t}$ hits the barrier $b$ at time $\tau_{b}$ , it is not necessarily true that there exists an $i$ such that $w_{i}\geq b$ , since $W_{t}$ could have hit $b$ at some time between the steps of the discretized walk. Therefore, Lemma 4 cannot be immediately adapted to proving properties of the discretized walk. We now prove that the Lemma is true for the discretized walk when the number of steps is a sufficiently large constant.

Recall that random variables $H^{+}$ and $H^{-}$ were defined as the first times that the Brownian motion $W_{t}$ hits the barrier defined by $\frac{W_{1}}{2}+\frac{1}{2}$ and $\frac{W_{1}}{2}-\frac{1}{2}$ , respectively. We slightly strengthen these conditions by defining random variables $\tilde{H}^{+}$ and $\tilde{H}^{-}$ to be the first times that $W_{t}$ hits the barriers $\frac{W_{1}}{2}+\frac{1}{2}+\eta$ and $\frac{W_{1}}{2}-\frac{1}{2}-\eta$ , respectively, for some very small constant $\eta$ .

Since $\eta$ will be chosen to be very small, it will not have a large impact on the distributions of $\tilde{H}^{+}$ and $\tilde{H}^{-}$ relative to $H^{+}$ and $H^{-}$ . The proof of the following lemma involves the same calculations as in the proof of Lemma 4.

Lemma 6.

For $\eta>0$ chosen sufficiently small,

[TABLE]

We use the above Lemma to prove that if the continuous process $W_{t}$ hits the barrier $\frac{a}{2}+\frac{1}{2}+\eta$ , then the discrete random walk will hit the barrier $\frac{a}{2}+\frac{1}{2}$ with high probability. The case for the barrier $\frac{a}{2}-\frac{1}{2}$ is similar.

Lemma 7.

If the number of steps $s$ of the discretized random walk satisfies $s\geq\frac{c}{\eta^{2}}$ for some constant $c$ , then:

[TABLE]

where $W_{1}=a$ and $\tau_{\frac{a}{2}+\frac{1}{2}}$ is the time the continuous process hits the barrier $\frac{a}{2}+\frac{1}{2}$ .

Proof.

Let $b=\frac{a}{2}+\frac{1}{2}$ be the barrier of interest. Since $b$ depends on value of $W_{1}=a$ , as in Sections 5.1 and 5.3, we will work with probabilities conditioned on the event $\left\{W|W_{1}=a,\tau_{b}\leq 1\right\}$ .

Note that (a) $a=W_{1}\sim N(0,1)$ ; therefore, with probability at least $.999$ , $|a|\leq 10$ and $b+\eta<6$ . Also, (b) the probability that $\tau_{b+\eta}\leq 1-c$ , for some constant $c>0$ , conditioned on $\tau_{b+\eta}\leq 1$ , is at least 0.999. This is because the density function of $\tau_{b+\eta}$ conditioned on $W_{1}=a$ is given by:

[TABLE]

Therefore,

[TABLE]

for appropriately chosen $c$ . In particular $c\approx 10^{-9}$ is sufficiently small.

If $\tau_{b+\eta}$ is the time that the process $W_{t}$ hits the barrier $b+\eta$ , let the index $\lceil s\cdot\tau_{b+\eta}\rceil$ denote the step in the discretized random walk that immediately follows $\tau_{b+\eta}$ . The value of this step is $w_{\lceil s\cdot\tau_{b+\eta}\rceil}=W_{\lceil s\cdot\tau_{b+\eta}\rceil/s}$ . Intuitively, this value should be very close to $b+\eta$ if the number of steps is sufficiently large. Indeed, we will prove the lemma by showing that if the number of steps in the discrete random walk satisfies $s\geq\frac{20}{c\eta^{2}}$ , then

[TABLE]

Suppose that $W$ is conditioned on reaching the barrier $b+\eta$ at time $T$ and that $W$ is restricted to satisfying $W_{1}=a$ . We use basic properties of the distribution of the increments of a Brownian Bridge (see [Cha01] for details) to show that the value of a Brownian motion at time $T+t<1$ , under the condition that $W_{T}=b+\eta$ and $W_{1}=a$ , has the following distribution:

[TABLE]

Note that $\lceil s\cdot\tau_{b+\eta}\rceil$ is the index of the closest step in the discretization to $\tau_{b+\eta}$ and that $\lceil s\cdot\tau_{b+\eta}\rceil/s-\tau_{b+\eta}\leq 1/s\leq c\eta^{2}/20$ . If $a\leq 10,T<(1-c)$ and $s\geq 20/(c\eta^{2})$ , then Equation (24) implies that $w_{\lceil s\cdot\tau_{b+\eta}\rceil}=W_{\lceil s\cdot\tau_{b+\eta}\rceil/s}$ is distributed with mean at least $b+\eta/2$ and variance at most $\eta^{2}/20$ . Thus, if $s\geq 20/(c\eta^{2})$ ,

[TABLE]

The Lemma follows. ∎

Lemma 7 can thus be applied to prove Theorem 1.

7 Correlated walks

To prove an approximation ratio of our rounding algorithm, we need to show that the positions of $x_{i}$ and $x_{j}$ (corresponding to the constraint $x_{j}-x_{i}\equiv d_{ij}(\bmod~{}s)$ ), determined by the random walks $w^{i}$ and $w^{j}$ , are close to the required distance if the vectors $v_{i}^{0}$ and $v_{j}^{d_{ij}})$ are close. In other words, without loss of generality, let us assume that for a fixed constraint, we have $d_{ij}=0$ . Then our goal is to show that the distance between the two positions assigned by our rounding procedure to $x_{i}$ and $x_{j}$ are close if the vectors $v_{i}^{0}$ and $v_{j}^{0}$ are close. After extensive computational investigation (on solutions obeying the constraints of $(P^{+})$ ), we believe the following conjecture holds.

Conjecture 1.

In our rounding scheme, the expected distance between $x_{i}$ and $x_{j}$ is bounded above by $\frac{\theta}{2\pi}$ if both $w^{i}$ and $w^{j}$ each have exactly one extreme sign change.

Proving the above conjecture would lead to an approximation guarantee slightly below $\alpha_{GW}=.87856$ , because we do not have an extreme sign change with probability 1.

We can show that if $v_{i}^{0}$ and $v_{j}^{0}$ have a small angle, then the two walks are (globally) close to each other in the sense that the area between the two walks is small. However, this does not immediately lead to a proof that the positions of their extreme sign changes are close.

Lemma 8.

Given two unit vectors $x$ and $y$ with angle $\theta$ , and a vector $r\in\mathbbm{R}^{n}$ with each coordinate drawn from ${\cal N}(0,1)$ , then,

[TABLE]

Proof.

Let $x=(\cos{\frac{\theta}{2}},~{}\sin{\frac{\theta}{2}})$ and $y=(\cos{\frac{\theta}{2}},-\sin{\frac{\theta}{2}})$ . Let $r=(r_{1},r_{2})$ .

[TABLE]

The expected value of $r_{2}$ given that it is non-negative is $\frac{\sqrt{2}}{\sqrt{\pi}}$ . Since $\sin{\frac{\theta}{2}}$ is always non-negative for $\theta$ from 0 to $\pi$ , the above statement follows by linearity of expectation. ∎

If we consider the random walks on the interval $[0,1]$ (i.e. we map the interval $[0,2]$ to the smaller interval $[0,1]$ ), then the expected area between the two walks is $\frac{2\sqrt{2}}{\sqrt{\pi}}\sin{\frac{\theta}{2}}$ . Thus, as the contribution to the objective function increases, the two walks converge and the positions assigned to them by the rounding procedure should converge to one another.

Acknowledgements

We would like to thank Martin Becker and Larry Shepp for helpful discussions about Brownian motion. Most of this work was done in 2007 at the Max-Planck-Institut für Informatik in Saarbrücken, Germany.

Bibliography6

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[Ban 10] Nikhil Bansal. Constructive algorithms for discrepancy minimization. In Proceedings of 51st Annual IEEE Symposium on Foundations of Computer Science (FOCS) , pages 3–10, 2010.
2[Cha 01] Joe Chang. Brownian motion. Lecture notes for Statistics 251/551, Yale University , 2001.
3[GHM + 11] Venkatesan Guruswami, Johan Håstad, Rajsekar Manokaran, Prasad Raghavendra, and Moses Charikar. Beating the random ordering is hard: Every ordering CSP is approximation resistant. SIAM Journal on Computing , 40(3):878–914, 2011.
4[KS 88] Ioannis Karatzas and Steven E. Shreve. Brownian Motion and Stochastic Calculus . Springer-Verlag, New York, 1988.
5[MN 11] Konstantin Makarychev and Alantha Newman. Complex semidefinite programming revisited and the assembly of circular genomes. In Innovations in Computer Science (ICS) , pages 444–459, 2011.
6[Sin 11] Amit Singer. Angular synchronization by eigenvectors and semidefinite programming. Applied and computational harmonic analysis , 30(1):20, 2011.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Rounding semidefinite programs for large-domain problems

Abstract

1 Introduction

1.1 Organization

2 Quadratic Programs

2.1 Semidefinite Relaxations

Definition 1**.**

Lemma 1**.**

Proof.

Lemma 2**.**

Proof.

Lemma 3**.**

Proof.

2.2 Relaxation on an Arbitrarily Large Domain

3 Rounding the Relaxation

Definition 2**.**

Theorem 1**.**

3.1 Rounding Algorithm

4 Brownian Motion

4.1 Mapping Our Process to Brownian Motion

5 Probability of Exactly One Extreme Sign Change

5.1 Probability of at Least One Sign

Lemma 4**.**

5.1.1 Case (i): a≥1a\geq 1a≥1

5.1.2 Case (ii): a≤−1a\leq-1a≤−1

5.1.3 Case (iii): −1<a<1-1<a<1−1<a<1

5.1.4 An example of applying the reflection

5.1.5 An example applying the reflection principle twice

5.2 Totals

5.3 Probability of Three or More Sign Changes

Lemma 5**.**

5.4 Case (i): a>1a>1a>1

5.5 Case (ii): a≤−1a\leq-1a≤−1

5.6 Case (iii): −1<a<1-1<a<1−1<a<1

5.7 Totals

6 From Brownian Motion to Discrete Random Walks

Lemma 6**.**

Lemma 7**.**

Proof.

7 Correlated walks

Conjecture 1**.**

Lemma 8**.**

Proof.

Acknowledgements

Definition 1.

Lemma 1.

Lemma 2.

Lemma 3.

Definition 2.

Theorem 1.

Lemma 4.

5.1.1 Case (i): $a\geq 1$

5.1.2 Case (ii): $a\leq-1$

5.1.3 Case (iii): $-1<a<1$

Lemma 5.

5.4 Case (i): $a>1$

5.5 Case (ii): $a\leq-1$

5.6 Case (iii): $-1<a<1$

Lemma 6.

Lemma 7.

Conjecture 1.

Lemma 8.