Study of Compressed Randomized UTV Decompositions for Low-Rank Matrix Approximations in Data Science
M. Kaloorazi, R. C. de Lamare

TL;DR
This paper introduces CoR-UTV, a novel randomized matrix decomposition method that efficiently computes low-rank approximations of large matrices with fewer data passes, suitable for modern computational platforms.
Contribution
The paper proposes the CoR-UTV algorithm, a new randomized low-rank matrix decomposition technique with variants, improving efficiency and robustness in data science applications.
Findings
CoR-UTV requires only a few passes over the data.
It runs in $O(mnk)$ operations, suitable for large matrices.
Simulations show CoR-UTV outperforms existing methods.
Abstract
In this work, a novel rank-revealing matrix decomposition algorithm termed Compressed Randomized UTV (CoR-UTV) decomposition along with a CoR-UTV variant aided by the power method technique is proposed. CoR-UTV computes an approximation to a low-rank input matrix by making use of random sampling schemes. Given a large and dense matrix of size with numerical rank , where , CoR-UTV requires a few passes over the data, and runs in floating-point operations. Furthermore, CoR-UTV can exploit modern computational platforms and can be optimized for maximum efficiency. CoR-UTV is also applied for solving robust principal component analysis problems. Simulations show that CoR-UTV outperform existing approaches.
| Methods | Time(s) | Iter. | ||||||
| 1000 | 50 | 5e4 | InexactALM 50 5e4 4.1 12 2.1e-6 ALM-CoRUTV 50 5e4 0.6 12 9.6e-6 | |||||
| 2000 | 100 | 2e5 | InexactALM 100 2e5 27.4 12 2.7e-6 ALM-CoRUTV 100 2e5 3.7 12 8.3e-6 | |||||
| 3000 | 150 | 45e4 | InexactALM 150 45e4 75.6 12 3.1e-6 ALM-CoRUTV 150 45e4 9.4 12 8.7e-6 | |||||
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Tensor decomposition and applications · Stochastic Gradient Optimization Techniques
Study of Compressed Randomized UTV Decompositions for Low-Rank Matrix Approximations in Data Science
Abstract
In this work, a novel rank-revealing matrix decomposition algorithm termed Compressed Randomized UTV (CoR-UTV) decomposition along with a CoR-UTV variant aided by the power method technique is proposed. CoR-UTV computes an approximation to a low-rank input matrix by making use of random sampling schemes. Given a large and dense matrix of size with numerical rank , where , CoR-UTV requires a few passes over the data, and runs in floating-point operations. Furthermore, CoR-UTV can exploit modern computational platforms and can be optimized for maximum efficiency. CoR-UTV is also applied for solving robust principal component analysis problems. Simulations show that CoR-UTV outperform existing approaches.
**Index Terms— ** Rank-revealing decompositions, low-rank approximations, randomized algorithms, robust PCA.
1 Introduction
Low-rank matrix approximations play an increasingly important role in signal processing and its applications. Such compact representations which retain the key features of a high-dimensional matrix provide a significant reduction in memory requirements, and more importantly, computational costs when the latter scales, e.g., according to a high-degree polynomial, with the dimensionality. Matrices with low-rank structures have found many applications in background subtraction [1, 2], system identification [3], IP network anomaly detection [4, 5], latent variable graphical modeling [6], subspace clustering [7, 8] and sensor and multichannel signal processing [9, 10, 11, 12, 13, 14, 15], [16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27]. .
Singular value decomposition (SVD) [28] and the rank-revealing QR (RRQR) decomposition [29, 30] are among the most commonly used algorithms for computing a low-rank approximation of a matrix. On the other hand, a UTV decomposition [31] is a compromise between the SVD and the RRQR decomposition with the virtues of both: UTV (i) is more efficient than the SVD, and (ii) provides information on the numerical null space of the matrix [31]. Given a matrix , the UTV algorithm computes a decomposition , where and have orthonormal columns, and is triangular (either lower or upper triangular). These deterministic algorithms, however, are computationally expensive for large data sets. Furthermore, standard techniques for their computation are challenging to parallelize in order to utilize advanced computer architectures [32, 33]. Recently developed algorithms for low-rank approximations based on random sampling schemes, however, have been shown to be remarkably efficient, highly accurate and robust, and are known to outperform existing algorithms in many practical situations [34, 35, 32, 33]. The power of randomized algorithms lies in that (i) they are computationally efficient, and (ii) their main operations can be optimized for maximum efficiency on modern architectures.
This work presents a novel randomized rank-revealing method termed compressed randomized UTV (CoR-UTV) decomposition. Given a large and dense rank- matrix , CoR-UTV computes a low-rank approximation of such that
[TABLE]
where and have orthonormal columns, and is triangular (either lower or upper, whichever is preferred). CoR-UTV only requires a few passes through data, for a matrix stored externally, and runs in floating-point operations (flops). The operations of CoR-UTV involve matrix-matrix multiplication, the QR and RRQR decompositions. Due to recently developed Communication-Avoiding QR algorithms [36, 37, 38], which can perform the computations with optimal/minimum communication costs, CoR-UTV can be optimized for peak machine performance on modern architectures. We illustrate, through numerical examples, that CoR-UTV is rank-revealer and provides a highly accurate low-rank approximation to a given matrix. Furthermore, we apply CoR-UTV to solve the robust principal component analysis (robust PCA) problem [39, 40], i.e., to decompose a given matrix with grossly corrupted entries into a low-rank matrix plus a sparse matrix of outliers.
The rest of this paper is structured as follows. In Section 2, we introduce the mathematical model of the data and discuss related works. In Section 3, we describe the proposed CoR-UTV method in detail. In Section 4, we develop an algorithm for robust PCA using CoR-UTV. In Section 5, we present and discuss simulation results and conclusions are given in Section 6.
2 Mathematical Model and Related Works
Given a matrix , where , with numerical rank , its SVD [28] is defined as:
[TABLE]
where , , and have orthonormal columns, and are diagonal matrices containing the singular values, i.e., and . can be written as , where , and . The SVD constructs the optimal rank- approximation to , [28] i.e.,
[TABLE]
where and denote the spectral norm and the Frobenius norm, respectively. In this paper we focus on the matrix defined.
The SVD is highly accurate for computing singular subspaces and singular values. However, its computation is costly for large data sets. Moreover, standard techniques for its computation are challenging to parallelize in order to take advantage of modern processors [32, 33]. An economical version of the SVD is the partial SVD based on Krylov subspace methods, such as the Lanczos and Arnoldi algorithms, which constructs an approximate SVD of an input matrix, for instance , at a cost . However, the partial SVD suffers from two drawbacks: (i) it is numerically unstable [41, 28], and (ii) it does not lend itself to parallel implementations [32, 33], which makes it unsuitable for modern architectures. Other approaches for low-rank matrix approximations include the RRQR [29] and the UTV decompositions [31]. Even though the QR with column pivoting (QRCP) and UTV decompositions provide highly accurate approximations to , they suffer from two drawbacks: (i) they are costly, i.e., , and (ii) methods for their computation are challenging to parallelize and hence they cannot exploit modern computational platforms [32, 33].
Recently developed algorithms for low-rank approximations based on randomization [34, 35, 32, 33, 42] have attracted significant attention. The randomized algorithms project a large input matrix onto a lower dimensional space using a random matrix, and apply deterministic methods on the smaller matrix to give an approximation of the matrix. Hence (i) they are computationally efficient, and (ii) lend themselves to parallel implementation. Halko et al. [32] proposed randomized SVD (R-SVD) in which a smaller matrix is formed by linear combinations of columns of the given matrix. The low-rank approximation is then given through the SVD of a reduced-size matrix. Gu [33] applied a slightly modified version of the R-SVD algorithm to improve subspace iteration methods, and presents a new error analysis. Another algorithm proposed in [32, Section 5.5], which we call two-sided randomized SVD (TSR-SVD), is a single-pass method, i.e., it required only one pass through data. It captures most attributes of the data by means of forming the smaller matrix through linear combinations of both rows and columns of the given matrix, and then applies the SVD for further computations. The work in [43] proposed a randomized algorithm termed subspace-orbit randomized SVD (SOR-SVD). SOR-SVD alternately projects the matrix onto its column and row space. The matrix is then transformed into a lower dimensional space, and a truncated SVD follows in order to construct an approximation.
TSR-SVD gives poor approximation compared to the optimal SVD due to the single-pass strategy. SOR-SVD has shown better performance than TSR-SVD, however both methods apply the SVD on the reduced-size matrix. This computation may be burdensome in terms of communication cost [36] for large matrices. In this work, we develop a randomized algorithm for low-rank approximation that with comparable flops (i) outperforms the TSR-SVD in terms of accuracy, and (ii) can utilize advanced computer architectures better than TSR-SVD as well as SOR-SVD.
3 Compressed Randomized UTV Decompositions
In this section, we present a randomized rank-revealing decomposition algorithm termed compressed randomized UTV (CoR-UTV) decomposition [44], which computes a low-rank approximation of a given matrix. We focus on the matrix with , where CoR-UTV, in the form of (1), produces an upper triangular middle matrix . The modifications required for a CoR-UTV for the case that produces a lower triangular middle matrix is straightforward.
3.1 Proposed CoR-UTV Decompositions
Given the matrix and an integer , the basic version of CoR-UTV is computed as follows: using a random number generator, we form a matrix with entries drawn independent, identically distributed (i.i.d.) from the standard Gaussian distribution. We then compute the matrix product:
[TABLE]
where is, in fact, a projection onto the subspace spanned by columns of . Having , we form :
[TABLE]
where is, in fact, a projection onto the subspace spanned by rows of . Using a QR decomposition, we factor and such that:
[TABLE]
where and are approximate bases for and , respectively. We now compress by left and right multiplications by the orthonormal bases obtained, forming the matrix :
[TABLE]
We then compute a QRCP of :
[TABLE]
The CoR-UTV-based low-rank approximation of is given by
[TABLE]
where and construct approximations to the leading left and right singular vectors of , respectively, and is upper triangular with diagonals approximating the first singular values of .
CoR-UTV requires three passes through data, for a matrix stored externally, but it can be altered to revisit the data only once. To this end, the compressed matrix (7) can be approximated as follows: both sides of the currently unknown are postmultiplied by . Having defined and , then .
CoR-UTV is accurate for matrices whose singular values display some decay, however in applications where the data matrix has a slowly decaying singular spectrum, it may produce a poor approximation compared to the SVD. Thus, we incorporate steps of a power iteration [35, 32] to improve the accuracy of the algorithm in these circumstances. Given the matrix , and integers and , the resulting algorithm is described in Alg. 1.
3.2 Computational Complexity
The cost of an algorithm involves both arithmetic, i.e., the number of flops, and communication, i.e., data movement either between different levels of a memory hierarchy or between processors [36]. On multicore and accelerator-based computers, for a data matrix stored externally, the communication cost becomes substantially more expensive compared to the arithmetic [36, 45]. The randomized algorithms operate on a compressed version of the data matrix rather than a matrix itself and therefore can be organized to exploit modern computational environments better than their classical counterparts.
To decompose , the simple version of CoR-UTV incurs the following costs: Step 1 costs , Step 2 costs , Step 3 costs , Step 4 costs , Step 5 costs (if is approximated by , this step costs ), Step 6 costs , Step 7 costs . The dominant cost of Steps 1-7 occurs when multiplying and with the corresponding matrices. Thus
[TABLE]
The sample size parameter is typically close to the minimal rank . The simple form of CoR-UTV requires either three or two passes (when is approximated by ) through data to factor . When the power method is incorporated, CoR-UTV requires either or passes (when is approximated by ) over the data with arithmetic costs of or , respectively.
In addition to matrix-matrix multiplications and QR decompositions, CoR-UTV performs one QRCP on an matrix, however TSR-SVD and SOR-SVD perform an SVD on the matrix. The SVD is more expensive than QRCP and, furthermore, recently developed QRCP algorithms based on randomization can perform the factorization with minimum communication costs [37, 38, 46], while standard techniques to compute an SVD are challenging for parallelization [32, 33]. Hence for large matrices to be factored on high performance computing architectures, where the compressed matrix does not fit into fast memory, the execution time to compute CoR-UTV can be substantially less than those of TSR-SVD and SOR-SVD. This is an advantage of CoR-UTV over TSR-SVD and SOR-SVD.
4 Robust PCA with CoR-UTV
This section describes how to solve the robust PCA problem using the proposed CoR-UTV method. Robust PCA [39, 40] represents an input low-rank matrix whose fraction of entries being corrupted, as a linear superposition of a low-rank matrix and a sparse matrix of outliers such as , by solving the following convex program:
[TABLE]
where is the nuclear norm of any matrix B, is the -norm of B, and is a tuning parameter. One efficient method to solve (14) is the method of augmented Lagrange multipliers (ALM) [47]. The ALM method yields the optimal solution, however its bottleneck is computing the costly SVD at each iteration to approximate the low-rank component of [40, 48]. To address this issue and to speed up the convergence of the ALM method, the work in [48] proposes a few techniques including predicting the principal singular space dimension, a continuation technique [49], and a truncated SVD by using PROPACK package [50]. The modified algorithm [48] substantially improves the convergence speed, however the truncated SVD [50] employed uses the Lanczos algorithm that (i) is unstable, and (ii) due to the limited data reuse in its operations, has very poor performance on modern architectures [41, 28, 32, 33].
To address this issue, by considering the original objective function proposed in [39, 40, 48], we apply CoR-UTV as a surrogate to the truncated SVD to solve the robust PCA problem. We adopt the continuation technique [49, 48], which increases in each iteration. The proposed ALM-CoRUTV method is given in Alg. 2.
In Alg. 2, for any matrix with a CoR-UTV decomposition described in Section 3, refers to a CoR-UTV thresholding operator defined as:
[TABLE]
where is the number of diagonals of greater than , refers to a singular value thresholding operator defined as , where is a shrinkage operator [51], , , , , , and are initial values. The main operation of ALM-CoRUTV is computing CoR-UTV in each iteration, which is efficient in terms of flops, , and can be computed with minimum communication costs.
5 Numerical Experiments
In this section, we present simulations that evaluate the performance of CoR-UTV for approximating a low-rank input matrix. We show that CoR-UTV provides highly accurate singular values and low-rank approximations, and compare CoR-UTV against competing algorithms from the literature. We also employ CoR-UTV for solving the robust PCA problem.
5.1 Rank-Revealing Property & Singular Values Estimation
We first show that CoR-UTV (i) is rank revealer, i.e., the gap in the singular value spectrum of the matrix is revealed, and (ii) provides highly accurate singular values. For the randomized algorithms considered, namely CoR-UTV, TSR-SVD, and SOR-SVD, the results presented are averaged over 20 trials. Each trial was run with the same input matrix with an independent draw of the test matrix. Due to space constraints, we only consider one class of low-rank matrices, and for simplicity we focus on a square matrix.
We construct a noisy rank- matrix of order generated as , where and are random orthonormal matrices, is diagonal containing the singular values s that decrease linearly from to , , and is a normalized Gaussian matrix. We set .
We compare the singular values of the matrix computed by CoR-UTV against those of competing methods such as the SVD [28], QRCP [29], UTV [52] and TSR-SVD [32]. For CoR-UTV and TSR-SVD, we arbitrarily set the sample size parameter to . Both algorithms require the same number of passes over , either two or when the power method is used, to perform a factorization.
The results are shown in Fig. 1. It is observed that (i) CoR-UTV strongly reveals the numerical rank , (ii) with no power iterations (), CoR-UTV provides very good approximations to singular values and outperforms TSR-SVD in approximating both leading and trailing singular values, (iii) with , CoR-UTV delivers singular values as accurate as the optimal SVD, (iv) QRCP only suggests the gap in the singular spectrum, and gives a fuzzy approximation to singular values of the matrix.
5.2 Low-Rank Approximation
We now compare the low-rank approximation constructed by our method against those of the SVD, QRCP, TSR-SVD, and SOR-SVD [43]. We construct a rank- approximation to by varying the sample size parameter with the rank fixed, and calculate the error:
[TABLE]
The results are shown in Fig. 2. It is observed that (i) when , CoR-UTV and SOR-SVD show similar performances, while TSR-SVD shows the worst performance, (ii) when , the errors resulting from CoR-UTV show no loss of accuracy compared to the optimal SVD. In this case, QRCP has the poorest performance.
5.3 Robust Principal Component Analysis
Here, we examine the efficiency and efficacy of ALM-CoRUTV in Alg. 2 for recovering the low-rank and sparse components of data. We compare the results obtained with those of the efficient inexact ALM method by [48], called InexactALM hereafter.
Robust PCA represents an input low-rank matrix , whose a fraction of entries being corrupted, as a linear superposition of a low-rank matrix and a sparse matrix of outliers such as , by solving the following convex program:
[TABLE]
where is the nuclear norm of any matrix B, is the -norm of B, and is a tuning parameter [53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63], [64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75], [76, 77, 78, 79, 80, 81, 82, 83, 84]. The matrix is generated as , where , have standard Gaussian distributed entries. The error matrix has non-zero entries independently drawn from the set -80, 80. We apply the ALM-CoRUTV and InexactALM algorithms to to recover and . The numerical results are summarized in Table 1, where the rank of and .
In our experiments, we adopt the initial values suggested in [48]. The algorithms are terminated when is satisfied, where is the pair of output of either algorithm. In the Table, refers to the runtime in seconds, refers to the number of iterations, and refers to the relative error.
CoR-UTV requires a prespecified rank to perform the factorization. Thus, we set , as a random start, and . The results in Table 1 show that ALM-CoRUTV detects the exact rank of the input matrix, provides the exact optimal solution, and outperforms InexactALM in terms of runtime.
6 Conclusions
In this paper, we have presented CoR-UTV for computing low-rank approximations of an input matrix. Simulations show that CoR-UTV reveals the numerical rank more sharply than QRCP, and provides results as good as those of the optimal SVD. CoR-UTV is more efficient than SVD, QRCP, UTV, TSR-SVD and SOR-SVD in terms of cost. CoR-UTV can better exploit advanced computational platforms by leveraging higher levels of parallelism than all compared algorithms. We have also applied CoR-UTV to solve the robust PCA problem via the ALM method. Simulations show that ALM-CoRUTV outperforms efficiently implemented InexactALM.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] T. Bouwmans, S. Javed, H. Zhang, Z. Lin, and R. Otazo, “On the applications of robust pca in image and video processing,” Proceedings of the IEEE , vol. 106, no. 8, pp. 1427–1457, Aug 2018.
- 2[2] M. F. Kaloorazi and R. C. de Lamare, “Low-Rank and Sparse Matrix Recovery Based on a Randomized Rank-revealing Decomposition,” in 22nd Intl Conf. on DSP 2017, UK , Aug 2017.
- 3[3] M. Fazel, T. K. Pong, D. Sun, and P. Tseng, “Hankel Matrix Rank Minimization with Applications to System Identification and Realization,” SIAM. J. Matrix Anal. & Appl. , vol. 34, no. 3, pp. 946––977, Apr 2013.
- 4[4] M. F. Kaloorazi and R. C. de Lamare, “Anomaly Detection in IP Networks Based on Randomized Subspace Methods,” in ICASSP , 2017.
- 5[5] A. M. J. Niyaz Hussain and M. Priscilla, “A Survey on Various Kinds of Anomalies Detection Techniques in the Mobile Adhoc Network Environment,” Int J S Res CSE & IT. , vol. 3, no. 3, pp. 1538–1541, 2018.
- 6[6] V. Chandrasekaran, P. Parrilo, and A. Willsky, “Latent variable graphical model selection via convex optimization,” The Ann. of Stat. , vol. 40, no. 4, pp. 1935––1967.
- 7[7] M. Rahmani and G. Atia, “Coherence Pursuit: Fast, Simple, and Robust Principal Component Analysis,” IEEE Trans. Signal Process. , vol. 65, no. 23, pp. 6260–6275, Dec 2017.
- 8[8] T.-H. Oh, Y. Matsushita, Y.-W. Tai, and I. S. Kweon, “Fast randomized singular value thresholding for low-rank optimization,” IEEE Trans. Pattern Anal. Mach. Intell. , vol. 40, no. 2, pp. 376–391, Mar. 2017.
