Complex phase retrieval from subgaussian measurements
Felix Krahmer, Dominik St\"oger

TL;DR
This paper demonstrates that the PhaseLift method can reliably reconstruct signals from subgaussian measurements in complex phase retrieval, even without small-ball probability assumptions, extending previous real-valued results.
Contribution
It extends phase retrieval guarantees for the PhaseLift method to complex signals with subgaussian measurements, without requiring small-ball probability conditions.
Findings
PhaseLift successfully reconstructs signals from subgaussian measurements.
The proof introduces new techniques applicable to various measurement scenarios.
Reconstruction is possible under minimal assumptions, extending prior real-valued results.
Abstract
Phase retrieval refers to the problem of reconstructing an unknown vector or from measurements of the form , where are known measurement vectors. While Gaussian measurements allow for recovery of arbitrary signals provided the number of measurements scales at least linearly in the number of dimensions, it has been shown that ambiguities may arise for certain other classes of measurements such as Bernoulli measurements or Fourier measurements. In this paper, we will prove that even when a subgaussian vector does not fulfill a small-ball probability assumption, the PhaseLift method is still able to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Complex phase retrieval from subgaussian measurements
Felix Krahmer
Dominik Stöger
Abstract
Phase retrieval refers to the problem of reconstructing an unknown vector or from measurements of the form y_{i}=\big{|}\langle\xi^{\left(i\right)},x_{0}\rangle\big{|}^{2}, where are known measurement vectors. While Gaussian measurements allow for recovery of arbitrary signals provided the number of measurements scales at least linearly in the number of dimensions, it has been shown that ambiguities may arise for certain other classes of measurements such as Bernoulli measurements or Fourier measurements. In this paper, we will prove that even when a subgaussian vector does not fulfill a small-ball probability assumption, the PhaseLift method is still able to reconstruct a large class of signals from the measurements. This extends recent work by Krahmer and Liu from the real-valued to the complex-valued case. However, our proof strategy is quite different and we expect some of the new proof ideas to be useful in several other measurement scenarios as well. We then extend our results up to an additional assumption which, as we show, is necessary.
1 Introduction
Phase retrieval refers to the problem of reconstructing an unknown vector from measurements of the form
[TABLE]
where the are known measurement vectors and represents additive noise. Such problems are ubiquituous in many areas of science and engineering such as X-ray crystallography [23, 32], astronomical imaging [18], ptychography [35], and quantum tomography [28].
The foundational papers [7, 13, 4] proposed to reconstruct via the PhaseLift method, a convex relaxation of the original problem. These papers have triggered many follow-up works since they were the first to establish rigorous recovery guarantees under the assumption that the measurement vectors are sampled uniformly at random from the sphere. Since then several papers have analyzed scenarios where the measurement vectors possess a significantly reduced amount of randomness, in particular spherical designs [21] and coded diffraction patterns [5, 22]. However, the theoretical results for coded diffraction patterns rely on the assumption that the modulus of the illumination patterns is varying. Indeed, it was shown in [17] that for certain illumination patterns with constant modulus ambiguities can arise, i.e., it is not possible to determine uniquely from the measurements . In fact, such ambiguities can already arise in much simpler settings, where the measurement vectors are i.i.d. subgaussian. For example, consider the case that , where the are i.i.d. Rademacher random variables. That is, they only take the values and each with probability . In this case the vector can never be distinguished from the vector . Note that in this scenario it holds that \mathbb{E}\left[\big{|}\xi^{\left(i\right)}_{j}\big{|}^{4}\right]=\mathbb{E}\left[\big{|}\xi^{\left(i\right)}_{j}\big{|}^{2}\right] and, hence, the vector does not fulfill a small-ball probability assumption, which means that there is no constant such that for all and for all vectors it holds that
[TABLE]
When the signals are complex even additional classes of ambiguities can arise. For example, when the measurement vectors are real, any signal and its complex-conjugate signal will result in identical observations.
For these reasons, previous works on phase-retrieval from subgaussian measurements (see, e.g., [11]) work with real signals and require that all entries of the vector fulfill
[TABLE]
for all or make even stronger assumptions.
The only exception is [26] which shows for the real-valued case ( and ) PhaseLift recovers a large class of signals from subgaussian measurements even if estimates of the type (1.3) are not satisfied. More precisely, one obtains that all signals whose peak-to-average power ratio satisfies a mild bound of the form
[TABLE]
for some absolute constant , can be recovered with high probability as long as . However, as the approach in [26] is intrinsically based on arguments in [16] it cannot be generalized to the complex case in a straightforward manner. This paper provides an analysis both for real-valued and complex-valued signals. We believe that this understanding will be of importance for the subsequent study of structured scenarios such as coded diffraction patterns, which are also intrinsically complex in nature.
While the proofs in previous papers [5, 21, 22, 26] relied on the construction of a so-called dual certificate, our paper will employ a more geometric approach based on Mendelson’s small ball method [25, 31]. This is motivated by recent work [28, 24, 27], which showed that a geometric analysis based on the descent cone of the trace norm can often yield additional insights compared to an approach based on dual certificates.
For the problem studied in this paper, however, the small-ball method cannot be applied directly to the entire descent cone or the entire cone of directions in which positive semidefiniteness is preserved. Rather we divide the latter cone into two parts: One that contains all the problematic cases, but is small, and one that is larger, but easier to analyze. Then we control one of these cones using a restricted isometry property and one via the small-ball method.
We think that this novel viewpoint and also some of the techniques developed in this paper will be useful for the analysis of other interesting measurement scenarios, such as the case of heavy-tailed measurement vectors or the case that has only entries [math] and .
2 Background and main results
2.1 Notation
denotes the vector space of all Hermitian matrices in . By we will denote the set of all positive definite Hermitian matrices. For the Hilbert-Schmidt inner product is defined by . The corresponding norm will be denote by . For a matrix we will denote their eigenvalues by , which are assumed to be arranged in decreasing order, i.e., . If no confusion can arise, we will suppress the dependence on and write instead of . By we will denote the Schatten- norm of , i.e. . By we denote the matrix, which we obtain by setting all off-diagonal entries of equal to zero. We will write or if there is a universal constant such that .
2.2 PhaseLift
The PhaseLift method was first introduced in [7]. In this paper we focus on a variant [4, 13] based on the observation that the measurements can be rewritten in the form
[TABLE]
where is a rank- matrix encoding the signal to be recovered up to the true inherent phase ambiguity. From this observation, PhaseLift relaxes the constraint that is of rank to obtain the optimization problem
[TABLE]
In order to simplify notation we introduce the linear operator as
[TABLE]
Hence, setting , (2.2) can be rewritten as
[TABLE]
We note that while understanding the relaxation (2.4) is an important benchmark approach and can be solved in polynomial time, it is typically not practical for applications, as lifting increases the number of optimization variables. For this reason, a very active line of research study recovery guarantees for algorithms that operate in the natural parameter domain such as alternating minimization (see, e.g., [34, 43]), gradient-descent based formulations (see, e.g., [6, 9, 38, 39, 10]), and anchored regression [2, 1, 3, 20]. However, most of these guarantees have been shown under the assumption that the measurement vectors are sampled i.i.d. from the unit sphere, so it will be a natural follow-up of this work to study to which extent our results generalize to the more practical nonconvex algorithms. In particular, most reconstruction guarantees for these non-convex approaches require an appropriate initialization. For this reason, one needs to study which initialization schemes work for the measurements considered in this paper. A natural approach will be to try spectral initializations and recent generalizations that have been shown to be feasible for a basically minimal number of measurements [29, 33, 15, 30]. We expect that the analysis provided in this paper will prove useful for this endeavour as the spectral initialization is somewhat connected to trace-norm minimization.
2.3 Subgaussian measurements
We consider random measurement vectors given as independent copies of a random vector , whose entries are assumed to be i.i.d. subgaussian random variables with parameter , expectation , and variance . Recall that a random variable is subgaussian with parameter , if and only if
[TABLE]
It is well known (see, e.g., [42]) that from this definition it follows for any (measurable) random variable that
[TABLE]
Since inequality (2.6) immediately implies that
[TABLE]
Moreover, it is well known (see, e.g., [42]) that for all the random variable is subgaussian with parameter .
2.4 Previous work
A number of previous works have studied phase retrieval with subgaussian measurements in the real-valued setting , i.e., and . For measurements fulfilling \mathbb{E}\left[\big{|}\xi_{j}\big{|}^{4}\right]>\mathbb{E}\left[\big{|}\xi_{j}\big{|}^{2}\right], [11] showed that PhaseLift admits order optimal uniform recovery guarantees.111In [11] instead of (2.4) the original PhaseLift approach as in [7] is analysed. Without the assumption \mathbb{E}\left[\big{|}\xi_{j}\big{|}^{4}\right]>\mathbb{E}\left[\big{|}\xi_{j}\big{|}^{2}\right], in [26] the following result was proven, again for the real-valued case.
Theorem 1**.**
[26, Theorem V.1]** Let be a random vector with i.i.d. subgaussian entries. Then there exist constants , , , and , which depend only on the distribution of , such that whenever
[TABLE]
the following statement holds with probability at least : For all signals with and all noise vectors any minimizer of (2.4) fulfills
[TABLE]
3 Main results
3.1 Complex signals and complex measurement vectors
In Theorem 1 both the signal and the measurement vectors are assumed to be real. While for the measurement vectors this is often too restrictive, the signal is indeed typically real-valued in applications. This important special case will be discussed in Section 3.2 below. Nevertheless, we find it still interesting from a mathematical point of view under which assumptions recovery is possible for complex-valued signals. Our first result deals with this case.
As we have explained in Section 1, there are subgaussian distributions for which we cannot achieve uniform recovery of all signals . For this reason, we define for all the set of all signals of mildly bounded peak-to-average power ratio
[TABLE]
Indeed, this restriction is very mild as will not depend on the dimension, whereas for a Gaussian random signal the ratio would scale like . Now we are prepared to state the following theorem, which is our first main result.
Theorem 2**.**
Let the observation vector be given as in (1.1), where the random measurement vectors are defined as in Section 2.3. Assume that for some and that
[TABLE]
Then for some probability parameter the following two statements hold.
With probability at least one has that for all vectors and any noise vector any solution of (2.4) satisfies
[TABLE] 2. 2.
If, in addition, , then with probability at least inequality (3.3) holds for all .
Here , , and are universal constants.
The first case of Theorem 2, where one makes no assumption on the fourth moment of , can be applied also to certain scenarios, where unique recovery is not possible without this assumption. One important example is that the entries are drawn from uniformly at random. Note that these measurements will always yield the same observations for the two signals
[TABLE]
Such very sparse signals are exactly prevented by Condition 1, so there is no contradiction to the theorem’s conclusion that unique recovery can be achieved via (2.4) for all signals such that .
Note that in the second scenario, where assumptions on the fourth moment of are available, we obtain a uniform recovery result over all . In the real-valued case a similar result has been shown in [11].
Remark 1**.**
An assumption of the form cannot be avoided as the following argument shows. Indeed, if the assumption implies that almost surely, where is fixed and is a real random variable. We observe that
[TABLE]
Consequently, and its complex-conjugate will always lead to the same measurements.
3.2 Real signals and complex measurement vectors
We have seen in Remark 1 that the assumption is necessary to distinguish between a signal and . However, if, as in many practical applications, it is known a priori that the signal is real-valued then this ambiguity cannot arise and we can uniquely recover without additional assumptions via the following natural variant of the PhaseLift method, where we restrict the search space to real-valued matrices.
[TABLE]
The following theorem shows that in this scenario the assumption is indeed not necessary.
Theorem 3**.**
Let the observation vector be given as in (1.1), where the random measurement vectors are as defined in Section 2.3. Then the following two statements hold.
Assume that
[TABLE]
Then, with probability at least one has that for all vectors and any noise vector any solution of (3.10) satisfies
[TABLE] 2. 2.
If, in addition, it holds that for some , then, under the refined assumption
[TABLE]
one has a more general bound. Namely it holds that with probability at least and for all vectors , again for arbitrary noise vectors , any solution of (3.10) satisfies
[TABLE]
Here , , and are universal constants.
Remark 2**.**
In comparison to Theorem 1 the probability bound in Theorem 2 and Theorem 3 is slightly better, as it improves from to . Moreover, note that in contrast to Theorem 1 the dependence on the subgaussian distribution of is not hidden in the constants. Also note that in our result the dependence on is stated explicitly. However, we do not know whether these bounds are optimal with respect to and .
4 Proof of main results
4.1 Proof of Theorem 2
Our goal is to show that with high probability the matrix is close to the minimizer of the expression over all . A common proof strategy that we will also follow is to establish that all with
[TABLE]
are sufficiently close to the true solution in -norm. More precisely, a sufficient condition for inequality (3.3) is that every fulfilling condition (4.1) satisfies
[TABLE]
Setting , equation (4.1) reads
[TABLE]
By the triangle inequality this implies that
[TABLE]
Hence, the upper bound (4.2) that we aim to establish directly follows from an appropriate lower bound for . Here ranges over those matrices for which is positive semidefinite. This set is convex, so it is locally well-approximated by a convex cone. To establish a uniform recovery result over all , we need to study the union of the corresponding cones as given by
[TABLE]
We will refer to this set as the cone of admissible directions.
With this notation, our proof strategy can be summarized as establishing a lower bound for
[TABLE]
which in the literature is commonly referred to as the minimum conic singular value (see, e.g., [40, 27]). Except for the precise nature of the cone under consideration, this strategy is exactly analogous to a number of works in the recent literature on linear inverse problems [8, 28]. In particular, the following lemma, which summarizes our motivating considerations above, can be seen as a variant of [8, Proposition 2.2].
Lemma 1**.**
Let be the operator defined in (2.3). Assume that . Then the minimizer of (2.4) satisfies
[TABLE]
In the following, our goal will be to derive an appropriate lower bound for . One difficulty in the analysis is that not all matrices belonging to are positive semidefinite. Indeed, in this scenario one could use that for positive semidefinite matrices an approximate isometry holds (see, e.g. [7, Section 3]). While not all matrices in are positive semidefinite the following lemma states that each matrix belonging to possesses at most one negative eigenvector.
Lemma 2**.**
Suppose that . Then has at most one strictly negative eigenvalue.
Proof.
Let . By definition of we can find and such that
[TABLE]
Suppose now by contradiction that has two (strictly) negative eigenvalues with corresponding eigenvectors . Then we can find a vector such that . This implies that for any we have that
[TABLE]
which is a contradiction to (4.8). ∎
Recall that for a matrix we denoted its eigenvalues by in decreasing order. By the previous lemma it holds that for all and all . For the proof we will partition into two sets. Namely, for we define
[TABLE]
The two sets can be interpreted in the following way. If we would suppose that it would follow that for all matrices . In particular, this implies that there is such that is in the descent cone of the function at the point . Hence, for we can interpret as a slightly enlarged union of descent cones. In order to bound from below we will rely on the following lemma, which is proven in Section 6.
Lemma 3**.**
Assume that one of following two conditions is satisfied for :
It holds that . In this case we set . 2. 2.
In addition to \big{|}\mathbb{E}\left[\xi_{1}^{2}\right]\big{|}^{2}\leq 1-\beta, the inequality , is fulfilled. In this case we set .
Moreover, assume that
[TABLE]
Then with probability at least it holds that
[TABLE]
where . Here , , and are universal constants.
The proof of Lemma 3 makes use of the fact that the set has low complexity in the sense that the matrices in are approximately low-rank.
In contrast, the set has rather high complexity. For example, note that . Nevertheless, the quantity can be bounded from below, because the measurement matrices are positive semidefinite and the matrices in also have a dominant positive semidefinite component. This is achieved by the following lemma, whose proof can be found in Section 5.
Lemma 4**.**
Let , , and . Assume that
[TABLE]
Then with probability at least for all it holds that
[TABLE]
Here and are absolute constants.
We remark that Lemma 4 would no longer hold if the measurement matrices would be replaced by symmetric matrices with i.i.d. Gaussian entries (see [37, Proposition 1]).
Having gathered all the necessary ingredients we can prove the main result of this manuscript.
Proof of Theorem 2.
Set . The proof of the two statements is analogous, except that for the first statement we set whereas for the second statement we set . By Lemma 4 and assumption (3.2) it follows that with probability at least
[TABLE]
Furthermore, by Lemma 3 we have with probability at least that
[TABLE]
holds.
Set . Note that by definition we have that is an admissible direction, i.e., . It follows by (4.16), (4.17), and that
[TABLE]
where in the last inequality we used (2.7) and . It follows by Lemma 1 that
[TABLE]
which finishes the proof. ∎
4.2 Proof of Theorem 3
The proof of Theorem 3 is in large parts analogous to the proof of Theorem 2. For this reason, we will only highlight the main differences. Replacing by and by we can argue analogously to Section 4.1 with the only difference that Lemma 3 has to be replaced by the following variant.
Lemma 5**.**
Assume that one of following two conditions is satisfied for :
It holds that and . 2. 2.
It holds that and .
Moreover, assume that
[TABLE]
Then with probability at least it holds that
[TABLE]
where . Here , , and are universal constants.
In order to prove Lemma 5 we can proceed similarly as in the proof of Lemma 3, see Section 6, where in the proof of Lemma 3 we have highlighted the necessary modifications.
5 Proof of Lemma 4
Proof.
Note that for any we have that . Let be the matrix whose rows are given by . It follows that . It follows from [42, Theorem 4.6.1] that due to our assumption on with probability at least for all it holds that
[TABLE]
Due to the observation above this is equivalent to
[TABLE]
for all . We will assume in the following that (5.2) holds for all .
Let with corresponding eigenvalue decomposition . We observe that
[TABLE]
By Lemma 2 we know that has at most one negative eigenvalue. If all eigenvalues are positive, this inequality chain and inequality (5.2) imply that
[TABLE]
which shows (4.15). Now suppose that . By (5.2) and , which is due to , we obtain that
[TABLE]
Again using the relation we can also observe that
[TABLE]
Combining (5.9) and (5.10) shows (4.15), which finishes the proof. ∎
6 Proof of Lemma 3 and Lemma 5
In order to prove Lemma 3 and Lemma 5 we will use the following version of Mendelson’s small ball method [25, 31], a tool for deriving a lower bound for nonnegative empirical process.
Lemma 6**.**
[14, Lemma 1]** Let and let be i.i.d. random vectors. Let and and define
[TABLE]
Then, with probability at least , it holds that
[TABLE]
where are independent, symmetric, -valued random variables that are independent of .
Our goal is to apply Lemma 6 to . The following key lemma shows that matrices in have two favorable properties: They are approximately low-rank and their mass with respect to the Frobenius norm is not concentrated on the diagonal for is small. The first property follows directly from the fact that the negative eigenvalue is rather small, the second property requires the spectral flatness of , i.e., that is bounded.
Lemma 7**.**
Let and . Assume that . Then it holds that
[TABLE] 2. 2.
[TABLE]
Proof.
Let . By definition of we have that , which implies that
[TABLE]
This proves inequality (6.4).
In order to prove the second inequality note that by definiton of we can choose such that there exists with positive semidefinite. For this choice of we can decompose uniquely into
[TABLE]
where , , and . We observe that
[TABLE]
We will bound the two summands separately. We begin with and observe that
[TABLE]
In the first inequality we used the triangle inequality and in the third line we used that due to . In the fourth line we used that and , which follows from the fact that the summands of are orthogonal to each other. In the last line we again used that as is decomposed orthogonally into .
In order to bound we note first that is positive semidefinite. Indeed, suppose by contradiction that is not positive semidefinite. Then there would exist a vector such that and . In particular, this would imply that for all , which is a contradiction to our choice of .
Now let be the normalized (i.e., ) eigenvector corresponding to the eigenvalue . Then we obtain that
[TABLE]
where the first inequality follows from the fact that is positive semidefinite. Using this observation we obtain that
[TABLE]
where in the fourth line we used that , which is a consequence of the first inequality of (6.6). Combining this estimate with (6.8) and (6.9) shows part (2), which finishes the proof.
∎
In analogy to [25] we bound using the following lemma, whose proof is based on the Paley-Zygmund inequality. A key difference is that we use the Hanson-Wright inequality to control the fourth moment appropriately.
Lemma 8**.**
Let and let be a random vector with independent and identically distributed entries taking values in such that , , and . Then we have that
[TABLE]
Here is an absolute constant.
Proof.
Note that by the Paley-Zygmund inequality (see, e.g., [12]) we have that for all
[TABLE]
In particular, setting yields that
[TABLE]
To estimate from above we note that the triangle inequality yields that
[TABLE]
In order to estimate the first summand we will use that \big{|}\xi^{*}A\xi-\mathbb{E}\left[\xi^{*}A\xi\right]\big{|} has a mixed subgaussian/subexponential tail. We can bound the tail probability using the Hanson-Wright inequality (in the version of [36]), which states that there is a numerical constant such that for all it holds that
[TABLE]
This yields that
[TABLE]
where the third line follows from a change of variables. Combining this inequality chain with (6.15) we obtain that
[TABLE]
Inserting this into (6.14) finishes the proof. ∎
In order to apply Lemma 8 we need a lower bound for . The next lemma computes this quantity.
Lemma 9**.**
Let be a random vector with independent and identically distributed entries taking values in such that and . Then for all matrices it holds that
[TABLE]
Proof.
First, we observe that
[TABLE]
where in the third line we used that and that the entries of are independent, which implies that there are no summands where one index appears exactly three times. The first summand can be computed by
[TABLE]
where we have used that for all and . The second summand can be computed by
[TABLE]
For equation we used the observation that
[TABLE]
By summing up and we obtain equality (6.22).
∎
The lemmas above would allows us to find a lower bound for in Lemma 6. We still need an upper bound for the Rademacher complexity \mathbb{E}\left[\underset{Z\in\mathcal{Z}}{\sup}\Big{|}\sum_{i=1}^{m}\varepsilon_{i}\langle\xi_{i}\xi^{*}_{i},Z\rangle_{HS}\Big{|}\right]. The next lemma provides such a bound. In [28] a version of this lemma has already been presented. Nevertheless, we include a proof for completeness.
Lemma 10**.**
Assume that . Let , and set . Then we have that
[TABLE]
* and are absolute constants.*
Proof.
First, we note that by Hoelder’s inequality and Lemma 7 we obtain that
[TABLE]
To bound \mathbb{E}\left[\Big{\|}\sum^{m}_{i=1}\varepsilon_{i}\xi^{\left(i\right)}\left(\xi^{\left(i\right)}\right)^{*}\Big{\|}\right] let be an -covering of the unit sphere with respect to the Euclidean norm such that
[TABLE]
By [41, Lemma 5.4] we have that
[TABLE]
Fix and observe that
[TABLE]
where we have set . We observe that and, moreover,
[TABLE]
where the first equality follows directly from the definition of the -norm. 222For the definition of the -norm see, e.g., [41, Section 5.2.4]. The first inequality can be seen using [42, Lemma 2.7.6] and the second one using [42, Lemma 3.4.2]. By the Bernstein inequality (see, e.g., [42, Theorem 2.8.1]) we obtain that
[TABLE]
where is some numerical constant. It follows from (6.40), (6.41), (6.43), and a union bound that
[TABLE]
where . Then, whenever , we obtain that
[TABLE]
In order to finish we need to estimate the two integrals. By a change of variables and [19, Lemma C.7] we obtain that
[TABLE]
Inserting this in the inequality chain above yields that
[TABLE]
Combined with inequality (6.39) this finishes the proof. ∎
Now we have gathered all the ingredients to complete the proof.
Proof of Lemma 3 and Lemma 5.
We will start by showing that for all in the case of Lemma 3, or all in the case of Lemma 5, respectively.
We first consider the second case and assume that the condition is satisfied for some . By Lemma 9 we obtain that for all under the conditions of Lemma 3
[TABLE]
Under the assumptions of Lemma 5 we observe that and . Hence, a similar argument as before also leads to
[TABLE]
Under the first assumption we obtain by Lemma 9 that for all
[TABLE]
Similarly, under the assumptions of Lemma 5 we can again use that and to obtain by an analogous argument that
[TABLE]
The remainder of the proof will be the same for Lemma 3 and Lemma 5. By Lemma 7 we have that
[TABLE]
By the triangle inequality it follows that
[TABLE]
for . Inserting into (6.57) one obtains that
[TABLE]
Hence, we have shown in all cases that .
By Lemma 8 it follows that for all
[TABLE]
Note that for all
[TABLE]
where in the last inequality we also used that . This shows that for all it holds that
[TABLE]
where we used that due to (2.7). Now recall that . Thus we have shown that
[TABLE]
where is defined in (6.1). We used that and is a constant chosen large enough. From Lemma 10 it follows that
[TABLE]
Combining this inequality with our choice of and choosing the constant in assumption (4.12) large enough it follows that
[TABLE]
Applying Lemma 6 yields that with probability at least
[TABLE]
Setting it follows that with probability at least it holds that
[TABLE]
Hence, by the definition of and it follows that
[TABLE]
Due to and Lemma 7 we have that for all . Combined with (6.75) this shows (4.13). ∎
Acknowledgements
This work has been supported by the German Science Foundation (DFG) in the context of the joint project Bilinear Compressed Sensing (KR 4512/2-1) as part of the Priority Program 1798 as well as the Emmy Noether Junior Research Group Randomized Sensing of Signals and Images (KR 4512/1-1). Furthermore, the authors want to thank Peter Jung for inspiring discussions.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] S. Bahmani and J. Romberg. A flexible convex relaxation for phase retrieval. Electron. J. Stat. , 11(2):5254–5281, 2017.
- 2[2] S. Bahmani and J. Romberg. Phase retrieval meets statistical learning theory: A flexible convex relaxation. In Artificial Intelligence and Statistics , pages 252–260, 2017.
- 3[3] S. Bahmani and J. Romberg. Anchored regression: Solving random convex equations via convex programming. Found. Comput. Math. , to appear.
- 4[4] E. J. Candès and X. Li. Solving quadratic equations via phaselift when there are about as many equations as unknowns. Found. Comput. Math. , 14(5):1017–1026, 2014.
- 5[5] E. J. Candes, X. Li, and M. Soltanolkotabi. Phase retrieval from coded diffraction patterns. Appl. Comput. Harmon. Anal. , 39(2):277–299, 2015.
- 6[6] E. J. Candès, X. Li, and M. Soltanolkotabi. Phase retrieval via Wirtinger flow: theory and algorithms. IEEE Trans. Inf. Theory , 61(4):1985–2007, 2015.
- 7[7] E. J. Candes, T. Strohmer, and V. Voroninski. Phaselift: Exact and stable signal recovery from magnitude measurements via convex programming. Comm. Pure Appl. Math. , 66(8):1241–1274, 2013.
- 8[8] V. Chandrasekaran, B. Recht, P. A. Parrilo, and A. S. Willsky. The convex geometry of linear inverse problems. Found. Comput. Math. , 12(6):805–849, 2012.
