Low-rank matrix recovery via regularized nuclear norm minimization
Wendong Wang, Feng Zhang, Jianjun Wang

TL;DR
This paper provides a theoretical analysis of low-rank matrix recovery using regularized nuclear norm minimization, establishing conditions under which robust recovery from noisy measurements is guaranteed, and introduces new coefficient estimates for null space properties.
Contribution
It is the first to establish $tk$-order RIC based coefficient estimates for the null space property in the case of $0<t\, extless=1$, extending recovery guarantees.
Findings
Recovery condition matches previous sharp bounds for $t>4/3$
Robust recovery is possible with noisy measurements under certain RIC constraints
First to analyze coefficient estimates for null space property when $0<t extless=1$
Abstract
In this paper, we theoretically investigate the low-rank matrix recovery problem in the context of the unconstrained regularized nuclear norm minimization (RNNM) framework. Our theoretical findings show that, the RNNM method is able to provide a robust recovery of any matrix (not necessary to be exactly low-rank) from its few noisy measurements with a bounded constraint , provided that the -order restricted isometry constant (RIC) of satisfies a certain constraint related to . Specifically, the obtained recovery condition in the case of is found to be same with the sharp condition established previously by Cai and Zhang (2014) to guarantee the exact recovery of any rank- matrix via the constrained nuclear norm minimization method. More importantly, to the best of our knowledge,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Low-rank matrix recovery via regularized nuclear norm minimization111Email addresses: [email protected] (Wendong Wang), [email protected] (Feng Zhang), [email protected] (Jianjun Wang). The corresponding author is Jianjun Wang.
Wendong Wang1, Feng Zhang2, and Jianjun Wang2
-
College of Artificial Intelligence, Southwest University, Chongqing, 400715, China
-
School of Mathematics and Statistics, Southwest University, Chongqing, 400715, China
Abstract.
In this paper, we theoretically investigate the low-rank matrix recovery problem in the context of the unconstrained regularized nuclear norm minimization (RNNM) framework. Our theoretical findings show that, the RNNM method is able to provide a robust recovery of any matrix (not necessary to be exactly low-rank) from its few noisy measurements with a bounded constraint , provided that the -order restricted isometry constant (RIC) of satisfies a certain constraint related to . Specifically, the obtained recovery condition in the case of is found to be same with the sharp condition established previously by Cai and Zhang (2014) to guarantee the exact recovery of any rank- matrix via the constrained nuclear norm minimization method. More importantly, to the best of our knowledge, we are the first to establish the -order RIC based coefficient estimate of the robust null space property in the case of .
Key words.
Low-rank matrix recovery, regularized nuclear norm minimization, restricted isometry property, robust null space property
1 Introduction
Over the past decade, low-rank matrix recovery (LRMR) problem has attracted considerable interest of researchers in many fields, including computer vision [1], recommender systems [2], and machine learning [3], to name a few. Mathematically, this problem aims to recover an unknown low-rank matrix from
[TABLE]
where is an observed vector, is the unknown noise, and is a known linear measurement map defined as
[TABLE]
Here, for is denoted as a matrix with size , and is the trace function.
A popular approach for the LRMR problem is to solve a convex nuclear norm minimization (NNM) model
[TABLE]
So far, much work has been done to explore the theoretical performance of (1.2) in exact/robust recovery of any matrix that is not necessary to be exactly low-rank, see, e.g., [4, 5, 6, 7, 8, 9, 10, 11, 12]. More specifically, one may seek the sufficient conditions under which the upper-bound estimate of the recovery error will take the form
[TABLE]
where and are denoted by the optimal solution of (1.2) and the best rank- approximate of , respectively, and are two constants only related to the map . Note that (1.3) also indicates that under these conditions any rank- matrices, i.e., the matrices whose rank is at most , can be exactly recovered from (1.2) provided that there is no noise involved, i.e., =0 and . As one of the most powerful and widely used theoretical tools, restricted isometry property (RIP) captures particular attention in establishing these desired conditions and their resulting upper-bound estimates of the recovery error.
Definition 1.1** ([5]).**
A linear map given in (1.1) is said to satisfy the RIP with restricted isometry constant (RIC) of order , denoted by 222When is not an integer, we define as ., if is the smallest value such
[TABLE]
holds for every rank- matrix .
Some representative conditions include and in [13], and in [14], and and in [8]. In particular, the sharp conditions for exactly rank- matrix recovery, which takes the form of , have been completely given by Cai and Zhang in [10] and Zhang and Li in [11] for the cases of and , respectively. To be specific, we can write these sharp conditions into a compact form as below,
[TABLE]
In fact, under the condition (1.4) any (nearly) low-rank matrix can still be robustly recovered from (1.2) in the presence of noise, and more details can be found within [10, 11].
Generally, when confronted with the relatively small problems where a high degree of numerical precision is required, one can easily formulate (1.2) as a semidefinite program (SDP), see, e,.g., [4, 5], and thus numerically solve it by any of the standard SDP solvers. However, when the scale of the input data is relatively large, it is often not convenient (sometimes maybe impossible) to solve (1.2) by any standard SDP solvers. Moreover, it is also difficult to estimate a proper parameter value of in (1.2) to well accommodate the unknown noise. Instead of solving (1.2) directly, many algorithms, see, e.g., [15, 16, 17, 18], were proposed to solve the following unconstrained regularized NNM (RNNM) model
[TABLE]
where is a trade-off parameter. Compared with the constrained optimization problem (1.2), the unconstrained optimization problem (1.5) can well balance the low-rankness of the desired output matrix and the resultant recovery error with properly chosen values of parameter . It has been proved in the practical application that (1.5) is much more suitable for noisy measurements and approximately low-rank matrix recovery [18]. Nevertheless, one would hope that a result similar to (1.3) can be proved for (1.5) as well. To the best of our knowledge, Candès and Plan [5] gave the first RIP-based performance guarantee for (1.5), and their results show that, when the noise obeys , and the map satisfies , the robust recovery of any rank- matrices can be guaranteed through (1.5). However, after their initial work, the theoretical investigation of (1.5) is rarely reported. Note that their noise setting is based on the Dantzig selector rather than the often used -norm setting (i.e., ), and the obtained sufficient condition still has room to improve.
In this paper, by means of the powerful RIP tool, we theoretically investigate the performance guarantees of the unconstrained RNNM model (1.5) when the noise obeys . In summary, our contributions are two-fold. First, we show that if obeys for certain , then the unconstrained RNNM model (1.5) will be able to provide a robust matrix recovery performance. The obtained sufficient condition is in line with the sharp recovery condition (1.4) in the case of for the constrained problem (1.2). Second, by establishing the -order RIC based coefficient estimate of the robust null space property (RNSP) in the case of , we develop another -order RIC based sufficient condition for (1.5), and also obtain some new upper-bound estimates of recovery error.
The remainder of the paper is organized as follows. Section 2 introduces some necessary notations and lemmas. Section 3 presents a performance guarantee of the RNNM model (1.5) by means of the -order RIC with . In Section 4, we first establish a -order RIC based coefficient estimate of the RNSP with , and then obtain another parallel performance guarantee result for (1.5). Finally, conclusion and future work are given in Section 5.
2 Notations and preliminaries
2.1 Notations
Without loss of generality we assume that . For any positive integer , we denote , and for any , we denote . We denote the singular value decomposition (SVD) of as
[TABLE]
where is the th largest singular value of , and and are the left and right singular value vectors of , respectively. If there is no confusion caused we will write , and as , and for simplicity, respectively. For convenience, we denote , , and also denote by the vector whose element is equal to for and 0 otherwise. Then clearly and . In the end, for any given positive number , we denote as
[TABLE]
and for any , we further denote as
[TABLE]
where is denoted as the number of the nonzero elements in .
2.2 Three key lemmas
Before presenting our main results, we need some auxiliary lemmas. We start with introducing the first one, which provides a powerful tool to represent a non-sparse vector by the sparse ones. This lemma was first established by Cai and Zhang in [10], and later was extended by Zhang and Li in [12].
Lemma 2.1**.**
Suppose that is a positive number and is a positive integer with . Then obeys if and only if is in the convex hull of . In particular, any can be expressed as
[TABLE]
where , and . Moreover,
[TABLE]
We also need the following Lemma 2.2, which provides a family of RIC-based conditions under which the RNSP can be guaranteed. More importantly, under these conditions, we will show in Theorem 3.1 that the RNNM model (1.5) is able to robustly recover any matrix that is not necessary to be exactly low-rank.
Lemma 2.2**.**
For any fixed and any positive integer with , if the map obeys the RIP of order with
[TABLE]
then have the RNSP with and . Specifically, for any matrix and with , it holds that
[TABLE]
where
[TABLE]
The RNSP, including the classical NSP as a special case, has been demonstrated to be a powerful theoretical tool in providing the robust recovery guarantees of sparse signals or low-rank matrices via some certain constrained optimization problems, see, e.g., [6, 19, 20, 21]. However, so far it is still an open problem to verify whether a given matrix/map obeys the NSP or not, and also to determine the values of two coefficients (i.e., and ) in RNSP. We note that there exist few researchers who focused on the characterization of these two coefficients with some other theoretical tools (such as the RIC and coherence) that are relatively easy to be checked. To the best of our knowledge, the first -order RIC based coefficient estimate of RNSP was obtained independently by Shen, et al. in [22, Lemma 1] and Foucart in [19, Theorem 5] to fit the sparse recovery scenarios. Later, by using the -order RIC tool with , Ge, et al. in [23, Lemma 2] extended their results to a more general case. Recently, the coherence-based coefficient estimate of RNSP was obtained by Wang, et al. in [24, Lemma 3] to deal with the robust signal recovery from the basis pursuit de-noising [25]. In fact, our Lemma 2.2 can be viewed as an extension of [23, Lemma 2] established for the measurement matrix to that for the measurement map.
Proof of Lemma 2.2.
The proof mainly follows from [23]. When is not an integer, let , then and is an integer. In view of this, we here only need to prove Lemma 2.2 when is a positive integer for a given . Let’s denote the SVD of as , and
[TABLE]
Then clearly and . In what follows, we start with proving that
[TABLE]
To do so, we first show that . In fact it holds naturally if . When , we know that
[TABLE]
This also yields the desired result. On the other hand, we can easily induce from the definition of and that
[TABLE]
which, together with Lemma 2.1, indicates that we can express as
[TABLE]
with satisfying
[TABLE]
By further defining
[TABLE]
where , we can easily induce that both and are all rank-, and . Next, we consider estimating the upper and lower bounds of
[TABLE]
As to the upper bound of , we have
[TABLE]
where we have applied the -order RIP in the last inequality. As to the lower bound of , by applying the -order RIP on , we get
[TABLE]
where we have used in the first equality and (2.4) in the last inequality. Therefore, combing (2.2) and (2.2) gives
[TABLE]
Therefore,
[TABLE]
where we have used for any in the last inequality. This, together with , directly yields (2.2). The obtained condition (2.1) follows trivially from (2.2) by enforcing . ∎
In the end, we introduce the last lemma (i.e., Lemma 2.2), which characterizes the relationship between the original solution and the optimal solution of (1.5).
Lemma 2.3**.**
Assume that is the solution of (1.5) and . If the noisy measurements are observed with the noise level , then for any subset with , we have
[TABLE]
and
[TABLE]
Proof of Lemma 2.3.
Since is the optimal solution of (1.5), we have
[TABLE]
which is equivalent to
[TABLE]
As to the left-hand side (LHS) of (2.9), we have
[TABLE]
As to the right-hand side (RHS) of (2.9), we know
[TABLE]
where we have used [26, Theorem 1] in the first inequality. Then combing (2.9), (2.10), and (2.2) leads to the desired result (2.7), and (2.8) follows trivially from (2.7). ∎
3 Performance guarantee of RNMM model under -order RIC with
With previous preparations in mind, we present our first theoretical result.
Theorem 3.1**.**
For any observed vector with a bounded constraint , if the -order RIC of with satisfies condition (2.1), then we have
[TABLE]
where is the optimal solution of (1.5), and
[TABLE]
Remark 3.2**.**
The condition (2.1) has been obtained previously by Cai and Zhang in [10] for exact/robust signal recovery from (1.2), and it has been proved to be sharp for the exactly rank- matrix recovery when . To the best of our knowledge, we first extend nontrivially this condition from the constrained NNM model (1.2) to its unconstrained counterpart, i.e., the unconstrained RNNM model (1.5). On the other hand, note that the obtained coefficients (for ) might seem a bit complicated since they not only involve , but also involve , and . To remedy this problem, we need to do some simplification. We here only take and for examples. Since
[TABLE]
and
[TABLE]
we thus can induce from (3.2) that
[TABLE]
where and are two constants only relying on the map and the value of , and they are given as below.
[TABLE]
Note that the induced upper-bound estimate (3.3) also coincide with the ones established in [22, 23, 24, 27] in form.
Remark 3.3**.**
Theorem 3.1 states that if the measurement map obeys a certain -order RIP condition related to , any matrix that is not necessary to be exactly low-rank can be robustly recovered from (1.5) for any fixed parameter . According to the obtained results, it is difficult to determine a “good” parameter to yield a “good” solution in general case. In fact, so far it is still an open problem to theoretically determine a general parameter to make sure that the unconstrained RNNM model (1.5) can perform well. However, if taking a close look at the obtained (3.2) and (3.3), one will find that the selected parameter should not be much too large or small. Furthermore, if the desired matrix is assumed to be exactly rank-, then we can induce from (3.2) that
[TABLE]
Obviously, if we desire a optimal solution with recovery error as small as possible from (1.5), we need to make sure that the value of is also as small as possible with respect to the parameter . Considering that
[TABLE]
where the equality in (a) holds when satisfies
[TABLE]
a ideal selection of parameter is to set it as in (3.4), which is related to the two coefficient estimates of RNSP of the map , noise level and also the rank parameter . In realistic situations, such a setting of is impractical. However, from (3.4) we can capture some information to help set a proper , i.e., the value of is proportional to that of , and inversely proportional to that of .
Now, we present the proof of Theorem 3.1 as follows.
Proof.
We start with proving (3.1). Let’s define and . Then by using Lemma 2.2 and Lemma 2.3 with , we have
[TABLE]
Due to (2.1), and thus we induce from (3) that
[TABLE]
This directly leads to
[TABLE]
which is the desired (3.1).
Before proving (3.2), let’s define , , , and so on. Thus for , we have , and therefore,
[TABLE]
where the last inequality is due the fact that
[TABLE]
Note that by combining Lemma 2.2 and Lemma 2.3 with again, we can provide two upper-bound estimates of and , respectively, which are independent from each other. First, as to that of , we have
[TABLE]
which is equivalent to
[TABLE]
Similarly, we can also easily get the upper-bound estimate of as below.
[TABLE]
On the other hand, by combining (3) and Lemma 2.2 with , we have
[TABLE]
which, together with (3) and (3.8), yields
[TABLE]
Now by combining (3), (3), (3.8) and (3), we can estimate as follows.
[TABLE]
which, together with (3.1), yields
[TABLE]
This completes the proof of (3.2). ∎
4 -order RIC based coefficient estimate of RNSP with
In the previous section, a family of -order RIP conditions and their resultant recovery error estimate results are established for the robust matrix recovery from the unconstrained RNNM model (1.5). As is seen from Theorem 3.1 and its proof, Lemma 2.2, i.e., the RNSP with -order RIC based coefficient estimate, plays a vital role in establishing the desired results. Unfortunately, Lemma 2.2, as well as its resultant Theorem 3.1, only considers the case of . In this section, we will show that under the -order RIP condition with , (1.5) is still able to provide a robust recovery performance. Before moving on, we have to introduce [11, Lemma 1] since it will be frequently used in the proof of our main results, and one can find it in Lemma 4.1.
Lemma 4.1**.**
Let be a vector with . Choose all subsets with , with , then we have
[TABLE]
and
[TABLE]
Now we are ready to present our second theoretical result, i.e., the -order RIC based coefficient estimate of RNSP with .
Theorem 4.2**.**
For any fixed and any positive integer with , if the map obeys the RIP of order with
[TABLE]
where , then obeys the RNSP with and . Specifically, for any matrix and any subset with , it holds that
[TABLE]
where
[TABLE]
with , and and being given in (4) and (4), respectively.
Remark 4.3**.**
To the best of our knowledge, Theorem 4.2 for the first time presents the -order RIC based coefficient estimate of the RNSP in the case of . This theorem, together with Lemma 2.2, affirmatively answers under what kind of -order RIP condition with , RNSP will hold. Note that we can also resort to Theorem 4.2 to yield a similar result with Theorem 3.1, and one can find it in Theorem 4.4. Since the proof Theorem 4.4 is almost same with that of Theorem 3.1, we here omit it.
Theorem 4.4**.**
For any observed vector with a bounded constraint , if the -order RIC of satisfies (4.1), then we have
[TABLE]
where is denoted as the optimal solution of (1.5).
Remark 4.5**.**
Theorem 4.4 and Theorem 3.1 indicate that under a certain -order RIP condition with , the unconstrained RNNM model (1.5) is able to provide a robust recovery performance for any matrix that is not necessary to be exactly low-rank. Note that the previous analysis results on Theorem 3.1 still apply to Theorem 4.4 if one replaces and with and , respectively. On the other hand, one may wonder how the obtained condition (4.1) performs when compared with the sharp condition (1.4) established for the constrained NNM model (1.2).
Figure 1 plots the comparison between these two recovery conditions. Unfortunately, our condition (4.1) is a bit weaker than the sharp condition (1.4). Considering the fact that problems of type (1.5) and type (1.2) are not completely equivalent in both theoretical and applied aspects, it is still an open problem to determine whether the sharp condition (1.4) for is appropriate for (1.5) or not. According to the established Lemma 2.2 and Theorem 3.1, it is expected that the condition (4.1) with for both Theorem 4.2 and Theorem 4.4 can be further improved to the sharp condition (1.4) with . We hope we can solve this problem in the future.
Proof of Theorem 4.2.
Our proof is inspired by [11]. We here only prove the case when is an integer since the case when is not an integer can be induced easily. Note that for any subset with and a fixed subset , and always hold, and hence we will prove
[TABLE]
to complete the proof of (4.2). We start with denoting the SVD of as and . We denote when is an even number, and and when is an odd number. We also denote all the possible index with and , respectively, where and . According to Lemma 4.1, we directly have
[TABLE]
Besides, we need to introduce the following partition for , i.e.,
[TABLE]
By using the similar manipulations as in proof of Lemma 2.2, we can express and as
[TABLE]
respectively, with and satisfying
[TABLE]
Let’s denote and , where
[TABLE]
then we have and . We also need to denote
[TABLE]
where is a number which will be determined later. Since , , , are rank-, -, -, and -, respectively, we can easily induce that any linear combination of and , as well as that of and , is rank-. Let’s first apply -order RIP on . This directly gives
[TABLE]
where we have used (4.4) and (4.7) in the last equality. Similarly, we can also apply -order RIP on to get
[TABLE]
Therefore, combing (4) and (4) yields
[TABLE]
Due to the fact that
[TABLE]
we can further induce from (4) that
[TABLE]
On the other hand, according to the definition of and we can induce that
[TABLE]
To simplify and , from the definition of and the SVD of we can induce that
[TABLE]
and
[TABLE]
where we have used Lemma (4.1) again. Similarly, we can also get
[TABLE]
and . Those indicate that
[TABLE]
and
[TABLE]
and thus we can equivalent write (4) as
[TABLE]
where . Since , , and have been well defined, we can easily know that . Note that there are exactly sets for a fixed , exactly sets for a fixed , and exactly sets with for two fixed indices with and . Therefore, by means of Lemma 4.1 and some simple calculation, we get
[TABLE]
Moreover, applying -order RIP on the left-hand side (LHS) of (4) and also using (4.4) again, we also have
[TABLE]
Now combining (4) and (4) yields
[TABLE]
As to the LHS of (4), we can directly induce from (4) that
[TABLE]
where we have used . As to the right-hand side (RHS) of (4), we induce from (4) that
[TABLE]
where, with the aid of [8, Lemma 4.1], we have used
[TABLE]
Therefore, we can induce from (4.18) and (4) that
[TABLE]
which is also equivalent to
[TABLE]
where .
Case 1: . In this case, we know from that
[TABLE]
To make sure , we only need to set , i.e.,
[TABLE]
To obtain as large an upper bound as possible, we need to set , and thus the largest upper bound with respect to will take the form
[TABLE]
One can easily check that holds for any . Based on the above settings, we can further know from (4.20) that
[TABLE]
and hence get
[TABLE]
which is the desired (4.2) for . To guarantee that (4.2) satisfies the RNSP with and , we have to set , i.e.,
[TABLE]
Obviously, the obtained condition (4.23) is the desired condition (4.1) for , which is also included in (4.22).
Case 2: . In this case, by using , we can also know from that
[TABLE]
To make sure that and has as large an upper bound as possible, By using the similar manipulations as in Case 1, we select , and thus get . Obviously, holds for any . Furthermore, we can also know from (4.20) that
[TABLE]
and hence get
[TABLE]
which is the desired (4.2) for . Similarly, to enforce (4.2) to obey the RNSP with and , we have to set , i.e.,
[TABLE]
which is the desired condition (4.1) for . Combining Case 1 and Case 2, we obtain the desired (4.3), and thus establish the results showed in Theorem 4.2.
∎
5 Conclusion and future work
This paper has considered the robust matrix recovery from the unconstrained RNMM model. First, equipped with the powerful -order RIP tool for , we developed a family of -order RIC based coefficient estimates for the RNSP. To the best of our knowledge, the obtained RNSP results in the case of have not been explored before. Furthermore, by mean of these RNSP results, some upper-bound estimates of error were established for the unconstrained RNMM model to guarantee the robust matrix recovery. As we have pointed out in Remark 4.5, one of our future work will focus on extending the condition (4.1) with to the sharp condition (1.4) with . Besides, determining a proper from the theoretical aspect for the unconstrained RNMM model will be another future work.
Acknowledgement
The authors would like to thank the editors and referees for their valuable comments that improve the presentation of this paper.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] E.J. Candès, X.D. Li, Y. Ma, and J. Wright. Robust principal component analysis? J. ACM , 58(3):1–37, 2011.
- 2[2] R. Mazumder, T. Hastie, and R. Tibshirani. Spectral regularization algorithms for learning large incomplete matrices. J. Mach. Learn. Res. , 11:2287–2322, 2010.
- 3[3] A. Argyriou, T. Evgeniou, and M. Pontil. Convex multitask feature learning. Mach. Learn. , 73(3):243–272, 2008.
- 4[4] B. Recht, M. Fazel, and P.A. Parrilo. Guaranteed minimum-rank solutions of linear matrix equations via nuclear norm minimization. SIAM review , 52(3):471–501, 2010.
- 5[5] E.J. Candès and Y. Plan. Tight oracle inequalities for low-rank matrix recovery from a minimal number of noisy random measurements. IEEE Trans. Inf. Theory , 57(4):2342–2359, 2011.
- 6[6] Simon Foucart and Holger Rauhut. A mathematical introduction to compressive sensing . Birkhäuser Verlag, Basel, 2013.
- 7[7] M.-J. Lai and W.T. Yin. Augmented ℓ 1 subscript ℓ 1 \ell_{1} and nuclear-norm models with a globally linearly convergent algorithm. SIAM J. Imaging Sci. , 6(2):1059–1091, 2013.
- 8[8] T. T. Cai and A. R. Zhang. Sharp RIP bound for sparse signal and low rank matrix recovery. Appl. Comput. Harmon. Anal. , 35(1):74–93, 2013.
