On the Phase Transition of Corrupted Sensing
Huan Zhang, Yulong Liu, and Hong Lei

TL;DR
This paper provides a theoretical explanation for the sharp phase transition observed in corrupted sensing problems, identifying the threshold where convex recovery methods succeed or fail, supported by numerical validation.
Contribution
It establishes the precise threshold for successful recovery in corrupted sensing, linking it to the Gaussian widths of tangent cones, thus explaining the phase transition phenomenon.
Findings
Sharp phase transition occurs around the sum of Gaussian widths squared.
Theoretical thresholds match numerical experiments.
Convex procedures fail or succeed based on this threshold.
Abstract
In \cite{FOY2014}, a sharp phase transition has been numerically observed when a constrained convex procedure is used to solve the corrupted sensing problem. In this paper, we present a theoretical analysis for this phenomenon. Specifically, we establish the threshold below which this convex procedure fails to recover signal and corruption with high probability. Together with the work in \cite{FOY2014}, we prove that a sharp phase transition occurs around the sum of the squares of spherical Gaussian widths of two tangent cones. Numerical experiments are provided to demonstrate the correctness and sharpness of our results.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Numerical methods in inverse problems · Electrical and Bioimpedance Tomography
On the Phase Transition of Corrupted Sensing
Huan Zhang12, Yulong Liu3, and Hong Lei1 This work was supported by the National Natural Science Foundation of China under Grant 61301188. 1Institute of Electronics, Chinese Academy of Sciences, Beijing 100190, China
2University of Chinese Academy of Sciences, Beijing 100049, China
3School of Physics, Beijing Institute of Technology, Beijing 100081, China
Abstract
In [1], a sharp phase transition has been numerically observed when a constrained convex procedure is used to solve the corrupted sensing problem. In this paper, we present a theoretical analysis for this phenomenon. Specifically, we establish the threshold below which this convex procedure fails to recover signal and corruption with high probability. Together with the work in [1], we prove that a sharp phase transition occurs around the sum of the squares of spherical Gaussian widths of two tangent cones. Numerical experiments are provided to demonstrate the correctness and sharpness of our results.
Index Terms:
Corrupted sensing, phase transition, Gaussian width, compressed sensing, signal separation.
I Introduction
Corrupted sensing aims to recover a structured signal from a small number of corrupted measurements
[TABLE]
where is the sensing measurement matrix which is assumed to have i.i.d. standard Gaussian entries in this paper, is the unknown signal, and is an unknown corruption. The goal is to estimate and from and .
This problem is encountered in many practical applications, such as face recognition [2], subspace clustering[3], network data analysis [4], and so on. Theoretical guarantees for this problem include sparse signal recovery from sparse corruption [5, 6, 7, 8, 9, 10, 11] and structured signal recovery from structured corruption [1, 12, 13].
To make the recovery possible, we will assume that both and have some structures which are promoted by the convex functions and respectively. When prior information about or is available, it is natural to consider the following program to recover the signal and corruption:
[TABLE]
or
[TABLE]
In [1], Foygel and Mackey provided conditions under which convex program (2) or (3) succeeds with high probability. Numerical experiments in [1] also suggested that there is a sharp phase transition when (2) or (3) is used to solve the corrupted sensing problem. However, little work has devoted to determining the threshold below which (2) or (3) fails with high probability. Therefore, theoretical understanding of the phase transition for program (2) and (3) is far from satisfactory.
In this paper, we present a theoretical analysis for the phase transition of (2) or (3). In particular, we figure out the exact position of phase transition, and demonstrate that the phase transition occurs in a relatively narrow region.
II Preliminaries
In this section, we present some preliminaries which will be used in our analysis.
Our result involves two important concepts: the Gaussian width and the tangent cone. Given a subset in , the Gaussian width is defined by
[TABLE]
We also define two tangent cones corresponding to signal and corruption respectively. The tangent cone of at the true signal is defined as
[TABLE]
Similarly, the tangent cone of at the true corruption is given by
[TABLE]
III Main results
In this section, we state our main results with some discussions.
Theorem 1** (Failure of convex program (2) or (3)).**
Consider convex program (2) or (3). Assume that both tangent cones and are closed. For any , if the measurement number satisfies
[TABLE]
*then the constrained convex program (2) or (3) fails with probability at least , where and are the unit sphere of and respectively. *
Proof.
See Appendix A. ∎
Remark 1** (Phase transition of corrupted sensing).**
Recall Theorem and Remark in [1], which stated that 111The authors believe that the small additive constants are artifacts of the proof technique. 222The original result is stated in terms of Gaussian complexity , difined as \gamma^{2}(\mathcal{D}_{s}\cap B^{n})=\mathbb{E}\big{(}\sup_{t\in\mathcal{D}_{s}\cap B^{n}}\left<g,t\right>\big{)}^{2}, where denotes the unit ball in . However, as the author stated, the Gaussian complexity is only very slightly larger than . when
[TABLE]
the constrained convex program (2) or (3) succeeds with probability at least . This, together with our result Theorem 1, demonstrate that the phase transition of corrupted sensing occurs around
[TABLE]
and the width of phase transition area is about
[TABLE]
where is an absolute constant.
Remark 2**.**
Our result also agrees with the result of Amelunxen el al. [14]. Indeed, by Proposition 10.2 and Proposition 3.1 (9) in [14], we have
[TABLE]
where denotes the statistical dimension of a convex cone .
Remark 3**.**
In [14], Amelunxen et al. considered the phase transition of the following demixing problem:
[TABLE]
where , are unknown signals and is a random orthogonal matrix. This model is different from ours since we have random Gaussian measurement matrix with .
Remark 4**.**
In [15], Oymak and Tropp considered the phase transition of the following demixing model:
[TABLE]
where , are two signals and , are some random transformation matrices. This model is also different from ours since is a deterministic matrix in our case. This makes the problem more difficult to analyze.
IV Simulation Results
In this section, we employ a numerical experiment to verify our theoretical guarantees (Theorem 1). In the experiment, both signal and corruption are designed to be sparse vectors. We use CVX [16] [17] to solve the convex program (2) or (3).
In the experiment, we assume that the prior information of is known exactly, and solve program (3). The experiment settings are as follows: the ambient dimension is set to , the measurement number , the sparsity level of signal changes from to with step , and the same for corruption. For every sparsity level of signal and corruption, we run and solve (3) times. We declare success if the solution to (3), denoted by , satisfies . Then we get the empirical probability of successful recovery. At last, we plot the theoretical curve predicted by Theorem 1.
Our numerical experiment result is shown in Fig. 1. We can see that the theoretical threshold given by Theorem 1 is closely matched with the empirical phase transition. It means that our theory can give a reliable prediction of the phase transition curve.
V Conclusion
This paper studied the problem of phase transition when we use convex program to solve corrupted sensing problem. Our results, together with previous work [1], gave the exact location of phase transition and the size of transition region. Simulations were provided to verify the correctness of our results. Our ongoing work is to establish a general framework to analyze the phase transition of various convex programs with noise-free or noisy data.
Appendix A Proof of Main Results
In this section, we present proof for our main result (Theorem 1). First, we will establish a sufficient condition under which convex program (2) or (3) fails, then some necessary tools are introduced, and at last, we give the proof for Theorem 1.
A-A Sufficient Condition for failure
In this subsection, we establish an easy-to-handle sufficient condition under which program (2) or (3) fails.
Lemma 1**.**
Let and denote the signal and the corruption tangent cones defined in (4) and (5) respectively. Then a sufficient condition under which constrained convex program (2) or (3) fails is
[TABLE]
In other words, the subset intersects the null space of matrix .
Proof.
Lemma 1 is a generalization of Proposition 2.1 of [18]. The proof is similar, and hence is omitted. ∎
Although Lemma 1 gives a sufficient condition for failure, it is difficult to check when (6) holds. The following lemma can overcome this drawback.
Lemma 2** (Sufficient condition for failure, Proposition 3.8, [15]).**
Under the condition of Lemma 1, if both and are closed, a sufficient condition for (6) to hold is
[TABLE]
where denotes the polar cone of , , and denotes the identity matrix.
Remark 5**.**
One can easily check that
[TABLE]
Thus, the sufficient condition under which convex program (2) or (3) fails can be rewritten as
[TABLE]
In the following parts, we will prove that (8) holds with high probability when the condition of Theorem 1 is satisfied. Before this, let’s state some tools that will be used in our proof.
A-B Other Useful Tools
Lemma 3** (Gordon’s inequality, Theorem 3.16, [19]).**
Let and be two Gaussian processes indexed by pairs of points in a product set . Assume that
[TABLE]
[TABLE]
Then we have
[TABLE]
Lemma 4** (Concentration of measure, Theorem 5.6, [20]).**
Let be a vector of independent standard normal random variables. Let : denotes an L-Lipschitz function. Then, for all ,
[TABLE]
Lemma 5** (Lemma 3.7, [18]).**
Let be a non-empty closed, convex cone. Then we have that
[TABLE]
Lemma 6**.**
Let and be subsets of and respectively. Then the function
[TABLE]
is a 1-Lipschitz function, where is the same as in (1).
Proof.
See Appendix B. ∎
A-C Proof of Main Results
According to Remark 5, we only need to prove that when
[TABLE]
the following event
[TABLE]
holds with probability at least . Moreover, a simple calculation verifies that this inequality is equivalent to
[TABLE]
Now, we will consider two cases for :
Case I: . In this case, when we minimize over , the second term \big{\|}\bm{s}_{2}-\bm{r}\big{\|}_{2}^{2} will be zero. Thus, the above inequality (A-C) is equivalent to
[TABLE]
For our purpose, we need to lower bound the left side of (A-C). Note that for any fixed , we have
[TABLE]
The first equality is due to the definition of -norm. The first inequality is because of the minimax inequality. The second equality comes from the linear property of inner product. The third equality uses the fact that when , otherwise it equals . The last equality can be derived by a simple transformation. As the above inequality holds for any , we have
[TABLE]
It remains to bound the right side. To this end, we will first use Gordon’s inequality (Lemma 3) to derive a lower bound for the expectation, and then concentration of measure (Lemma 4) to obtain the desired result. Let and be two Gaussian processes, where and are independent standard Gaussian random vectors. It can be easily checked that the increments satisfy
[TABLE]
[TABLE]
Therefore, Gordon’s inequality (Lemma 3) gives us:
[TABLE]
Since is a symmetric random vector, we have
[TABLE]
Substituting this into (A-C), we get
[TABLE]
As is a closed convex cone, by Lemma 5, we know that
[TABLE]
which implies
[TABLE]
Substituting this into (13), we get the following result:
[TABLE]
In the last inequality, we have used the assumption that .
Next, Lemma 6 confirms that the following function
[TABLE]
is a -Lipschitz function. Thus, concentration of measure (Lemma 4) gives us that for any ,
[TABLE]
Putting the above inequality and (A-C), (11), (A-C), (A-C) together, we eventually get that when
[TABLE]
we have
[TABLE]
Case II: . In this case, it is clear that no matter what and takes value, it is always holds that
[TABLE]
Thus,
[TABLE]
which, by (A-C) and (A-C), implies that
[TABLE]
Union bound. Combining case I and case II and taking a union bound, we have
[TABLE]
provided
[TABLE]
By Lemma 1 and Lemma 2, it means that when
[TABLE]
the convex program (2) or (3) fails with probability at least . This completes the proof.
Appendix B Proof of Lemma 6
To prove Lemma 6, we only need to show that for any
[TABLE]
For any fixed , let
[TABLE]
And we have
[TABLE]
Then, let
[TABLE]
and we have
[TABLE]
Similarly,
[TABLE]
Therefore,
[TABLE]
The same argument gives
[TABLE]
Thus, combining (B) and (16), we get
[TABLE]
The conclusion follows immediately.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] R. Foygel and L. Mackey, “Corrupted sensing: Novel guarantees for separating structured signals,” IEEE Trans. Inf. Theory , vol. 60, no. 2, pp. 1223–1247, Feb. 2014.
- 2[2] J. Wright, A. Y. Yang, A. Ganesh, S. S. Sastry, and Y. Ma, “Robust face recognition via sparse representation,” IEEE Trans. Pattern Anal. Mach. Intell. , vol. 31, no. 2, pp. 210–227, Feb. 2009.
- 3[3] E. Elhamifar and R. Vidal, “Sparse subspace clustering,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit. , Miami Beach, FL, 2009, pp. 2790–2797.
- 4[4] J. Haupt, W. U. Bajwa, M. Rabbat, and R. Nowak, “Compressed sensing for networked data,” IEEE Signal Process. Mag. , vol. 25, no. 2, pp. 92–101, Mar. 2008.
- 5[5] J. Wright and Y. Ma, “Dense error correction via ℓ 1 subscript ℓ 1 \ell_{1} -minimization,” IEEE Trans. Inf. Theory , vol. 56, no. 7, pp. 3540–3560, Jul. 2010.
- 6[6] X. Li, “Compressed sensing and matrix completion with constant proportion of corruptions,” Constructive Approximation , vol. 37, no. 1, pp. 73–99, Feb. 2013.
- 7[7] N. H. Nguyen and T. D. Tran, “Exact recoverability from dense corrupted observations via ℓ 1 subscript ℓ 1 \ell_{1} -minimization,” IEEE Trans. Inf. Theory , vol. 59, no. 4, pp. 2017–2035, Jan. 2013.
- 8[8] ——, “Robust lasso with missing and grossly corrupted observations,” IEEE Trans. Inf. Theory , vol. 4, no. 59, pp. 2036–2058, Apr. 2013.
