Co-regularized Multi-view Sparse Reconstruction Embedding for Dimension Reduction
Huibing Wang, Jinjia Peng, Xianping Fu

TL;DR
This paper introduces CMSRE, a novel multi-view dimension reduction method that effectively captures nonlinear manifold structures by exploiting sparse reconstructions and correlations across multiple views.
Contribution
The paper proposes a co-regularized multi-view sparse reconstruction embedding method that handles high-dimensional nonlinear features and preserves cross-view correlations for better low-dimensional representations.
Findings
Effective in document classification tasks
Improves face recognition accuracy
Enhances image retrieval performance
Abstract
With the development of information technology, we have witnessed an age of data explosion which produces a large variety of data filled with redundant information. Because dimension reduction is an essential tool which embeds high-dimensional data into a lower-dimensional subspace to avoid redundant information, it has attracted interests from researchers all over the world. However, facing with features from multiple views, it's difficult for most dimension reduction methods to fully comprehended multi-view features and integrate compatible and complementary information from these features to construct low-dimensional subspace directly. Furthermore, most multi-view dimension reduction methods cannot handle features from nonlinear spaces with high dimensions. Therefore, how to construct a multi-view dimension reduction methods which can deal with multi-view features from…
| Solving for all views |
|---|
| Input: |
| : features from all views; : the regularized parameter. |
| Output: |
| : the low-dimensional representations for all views. |
| Optimization Procedure: |
| 1. Initialize all for all views using single-view optimization in Eq.5. |
| 2. Iterate |
| 2.1. Utilized sparse representation to construct correlation between neighbors for all views. |
| 2.2. Construct using for all views. |
| 2.3. Combining with in Eq.7, can be calculated according to Eq.10. |
| 3. Until Convergence |
| 4. Return |
| Methods | Dimension=10 | Dimension=20 | Dimension=30 | |||
|---|---|---|---|---|---|---|
| Mean | Max | Mean | Max | Mean | Max | |
| CMSRE | 84.58 | 89.86 | 84.64 | 92.75 | 86.16 | 94.20 |
| Pariwise[40] | 79.36 | 85.51 | 80.62 | 89.86 | 82.61 | 91.30 |
| Centroid[40] | 82.00 | 86.96 | 82.90 | 89.86 | 85.29 | 89.86 |
| MSE | 79.14 | 85.51 | 79.29 | 88.41 | 84.57 | 89.86 |
| LE_BSV | 81.59 | 88.41 | 82.46 | 89.86 | 83.86 | 86.96 |
| LLE_BSV | 78.04 | 85.51 | 80.66 | 86.96 | 82.17 | 88.41 |
| PCA_BSV | 80.02 | 88.41 | 80.81 | 89.86 | 84.06 | 89.86 |
| Methods | Dimension=10 | Dimension=20 | Dimension=30 | |||
|---|---|---|---|---|---|---|
| Mean | Max | Mean | Max | Mean | Max | |
| CMSRE | 55.79 | 60.03 | 66.46 | 70.90 | 68.35 | 71.61 |
| Pariwise[40] | 50.06 | 54.52 | 51.73 | 55.93 | 52.24 | 56.15 |
| Centroid[40] | 50.82 | 55.79 | 56.49 | 60.17 | 58.56 | 63.42 |
| MSE | 53.10 | 57.49 | 58.36 | 62.85 | 58.99 | 62.99 |
| LE_BSV | 44.34 | 47.74 | 57.05 | 61.02 | 57.48 | 61.30 |
| LLE_BSV | 46.67 | 50.71 | 50.67 | 55.79 | 51.38 | 55.93 |
| PCA_BSV | 51.61 | 55.23 | 52.41 | 56.92 | 52.87 | 58.19 |
| Methods | Dimension=10 | Dimension=20 | Dimension=30 | |||
|---|---|---|---|---|---|---|
| Mean | Max | Mean | Max | Mean | Max | |
| CMSRE | 70.38 | 80.00 | 75.08 | 87.69 | 80.78 | 90.77 |
| Pariwise[40] | 62.46 | 72.31 | 72.68 | 86.15 | 73.98 | 87.69 |
| Centroid[40] | 62.92 | 69.23 | 76.53 | 88.17 | 77.78 | 90.77 |
| MSE | 59.69 | 66.15 | 65.90 | 76.92 | 66.99 | 83.08 |
| LE_BSV | 53.92 | 63.08 | 60.60 | 76.92 | 58.78 | 73.85 |
| LLE_BSV | 48.08 | 56.92 | 57.42 | 72.31 | 57.88 | 72.31 |
| PCA_BSV | 56.69 | 63.08 | 60.58 | 73.01 | 60.65 | 73.85 |
| Parameter | ||||||
|---|---|---|---|---|---|---|
| Accuracy | 65.85 | 74.31 | 76.78 | 78.32 | 80.78 | 80.03 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Advanced Image and Video Retrieval Techniques · Image Processing Techniques and Applications
Co-regularized Multi-view Sparse Reconstruction Embedding for Dimension Reduction
Huibing Wang
Jinjia Peng
Xianping Fu
Information Science and Technology College, Department of Computer Science and Technology, Dalian Maritime University, Dalian, Liaoning, 116026, China.
Abstract
With the development of information technology, we have witnessed an age of data explosion which produces a large variety of data filled with redundant information. Because dimension reduction is an essential tool which embeds high-dimensional data into a lower-dimensional subspace to avoid redundant information, it has attracted interests from researchers all over the world. However, facing with features from multiple views, it’s difficult for most dimension reduction methods to fully comprehended multi-view features and integrate compatible and complementary information from these features to construct low-dimensional subspace directly. Furthermore, most multi-view dimension reduction methods cannot handle features from nonlinear spaces with high dimensions. Therefore, how to construct a multi-view dimension reduction methods which can deal with multi-view features from high-dimensional nonlinear space is of vital importance but challenging. In order to address this problem, we proposed a novel method named Co-regularized Multi-view Sparse Reconstruction Embedding (CMSRE) in this paper. By exploiting correlations of sparse reconstruction from multiple views, CMSRE is able to learn local sparse structures of nonlinear manifolds from multiple views and constructs significative low-dimensional representations for them. Due to the proposed co-regularized scheme, correlations of sparse reconstructions from multiple views are preserved by CMSRE as much as possible. Furthermore, sparse representation produces more meaningful correlations between features from each single view, which helps CMSRE to gain better performances. Various evaluations based on the applications of document classification, face recognition and image retrieval can demonstrate the effectiveness of the proposed approach on multi-view dimension reduction.
keywords:
Multi-view , Dimension Reduction , Sparse Reconstruction , Multi-view Sparse Reconstruction Embedding.
††journal: Neurocomputing
1 Introduction
In recent years, with the development of information technology, researchers have proposed a plenty of algorithms or equipments to describe samples from various perspectives [1, 2]. For one single sample, multi-view features can reflect different properties of one same sample from different viewpoints. Take image processing as examples, one image can be represented by multiple feature descriptors, such as Local Binary Patterns (LBP) [3], Locality-constrained Linear Coding (LLC) [4], Histograms of Oriented Gradients (HOG) [5], etc [6, 7]. Even though features extracted from different descriptors show multiple properties, they are all descriptions of one same image, which contain some common interior relations between each other. Specifically, because more information exists in multiple views rather than in one single view, how to fully integrate compatible and complementary information from multiple views, which can make better decisions for whole system is of vital importance but challenging.
For most research fields, such as image classification [8, 9], image retrieval [10, 11, 12, 13, 14], image understanding [15, 16], most features which are utilized in these fields always locate in high-dimensional spaces. Therefore, it will consume much time and computation space to cope with multi-view features with high dimensions. It’s essential for researchers to construct meaningful dimension reduction (DR) methods which can find optimal subspaces to maintain some properties of original features into subspaces with lower dimensions. To address this problem, there are many effective DR methods have been proposed, which can be categorized into two divisions. The first category contains various linear DR methods, such as Principal Component Analysis (PCA) [17], Linear Discriminant Analysis (LDA) [18], Locality Preserving Projections (LPP) [19], etc [20, 21, 22]. PCA [17] and LDA are two most famous ones which can exploit global properties of data and maintain them into low-dimensional subspace. However, both of them care more about global Euclidean structure while neglecting local information between adjacent samples. To address this problem, Locality Preserving Projections (LPP) [19] and Neighborhood Preserving Embedding (NPE) [20] are proposed to maintain adjacent structure using different reconstruction tricks. Meanwhile, some sparse subspace learning methods [21] have also arose much attentions. Sparsity Preserving Projections (SPP) [21] is a well-known methods which exploits sparse correlations [23] between samples and preserves them into a sparse subspace. However, all DR methods mentioned above are linear DR ones which cannot deal with nonlinear datasets well. Locally Linear Embedding (LLE) [24] and Laplacian Eigenmaps (LE) [25] are two popular nonlinear dimension reduction techniques, which aim to obtain the representations with lower dimensions directly by preserving certain correlations between samples from nonlinear manifolds. Even though there are many DR methods which can deal with various circumstances, most of them can only utilize features from one single view but multiple views. They failed to integrate compatible and complementary information from multi-view features and construct optimal subspaces with lower dimensions, which is a hot research topic during the last decades.
The past decade has also witnessed the blossom of multi-view learning in many practical areas, such as clustering [26, 27, 28, 29, 30, 31], metric learning [32], etc [33, 27, 34, 35]. Xia et al. [28] proposed a multi-view spectral embedding (MSE) which fully encodes different features in different ways and finds a low-dimensional embedding wherein the distribution of each view is sufficiently smooth. Kumar et al. [27] develops a co-regularized scheme to combine features from multiple view together and obtain good results for spectral clustering. Meanwhile, they proposed an iterative procedure to deal with the co-regularized optimization problem. Xu et al. [29] proposed a novel method called Multi-view Self-Packed Learning (MSPL) which adopts a novel probabilistic smoothed weighting scheme for clustering. Multi-view discriminant analysis [36] is an extension of traditional LDA [18], which projects features from multiple views to one discriminative common subspace. However, as we all known, there is few multi-view DR method which directs at nonlinear datasets to find their low-dimensional representations.
In this paper, we proposed a multi-view nonlinear DR method named Co-regularized Multi-view Sparse Reconstruction Embedding (CMSRE) to deal with the problem of multi-view nonlinear dimension reduction. CMSRE first exploits the correlations of sparse reconstruction between features from each single view and obtains the initial low-dimensional representations. Then, it utilizes a co-regularized scheme to update the low-dimensional representations of all views. It seems to be worth mentioning that CMSRE considers the correlations of sparse reconstructions from all views while updating the low-dimensional representations of each single view. Therefore, the low-dimensional representation of each single view can fully utilize the information from the other views, which can improve the representational abilities of CMSRE. Finally, an iterative solving procedure is constructed to obtain optimal low-dimensional representations for CMSRE. And we summarized the contributions of this paper as follows:
We have carefully surveyed the filed of multi-view DR method and proposed a method named Co-regularized Multi-view Sparse Reconstruction Embedding (CMSRE) to fill the blank of this field.
- 2.
CMSRE fully considers correlations of sparse reconstruction from all views and maintains these correlations into low-dimensional subspaces for all views as much as possible.
- 3.
In order to update low-dimensional representations of features, we introduced a co-regularized scheme to integrate compatible and complementary information from all views.
The rest of paper is organized as follows: in Section II, we introduced some basic elements of multi-view learning and some typical related works. Section III described the construction procedure of our proposed CMSRE to deal with the problem of multi-view nonlinear DR problem. In Section IV, we have carried out various experiments on applications of document classification, face recognition [37] and image retrieval to demonstrate the effectiveness of our proposed CMSRE. Then, we added an experiment to show the convergence property of our proposed method. Finally, Section V made a conclusion of this paper.
2 Related works
In this section, we introduced some element backgrounds of multi-view learning in detail to help readers to improve the readability of this paper. Then, a multi-view embedding method called Multiview Spectral Embedding (MSE) [28] is illustrated, which has attracted widely attentions.
2.1 Background
As mentioned above, multi-view dimension reduction is a hot topic and has been started on in the past decade. Therefore, it’s essential to introduce the background of this field. For multi-view learning methods, multi-view features with samples having views can be represented as , where is a feature matrix which consists of features from the th view. is the dimension of features from th view. However, features from multiple views always locate in high-dimensional spaces with different dimensions, which cannot be utilized directly due to the high time and space complexities. Meanwhile, multi-view features contains integrate compatible and complementary information, which can improve the performances of traditional single-view approaches greatly. Therefore, the goal of our proposed methods is to construct an excellent architecture to consider features from multiple views simultaneously. As described, CMSRE aims to obtain low-dimensional representations as , where is the low-dimensional representations for the th view and is dimension of common subspace constructed by CMSRE.
2.2 Multiview Spectral Embedding (MSE)
Multiview Spectral Embedding (MSE) is a well-known multi-view DR method which has attracted widely attentions from researchers. MSE finds a low-dimensional embedding wherein the distribution of each view is sufficiently smooth, and MSE explores the complementary property of different views. It first builds a patch for each features on a view and performs the procedure of the single-view optimization to get the optimal low-dimensional embedding for each view. Finally, MSE proposed a method called global coordinate alignment to unify all low-dimensional embeddings from different patches as a whole one. Alternating optimization is utilized to obtain the solution for MSE. In order to finish its aim, MSE constructs the objective function as follows:
[TABLE]
Where is an unnormalized graph Laplacian matrix for the th view. It is the reflection of the neighborhood relationships between features in the th views. is a set of nonnegative weights and can reflect the importance of the th view played for MSE to obtain the low-dimensional representations. represents the trace of the matrix. MSE has verified that multi-view features contains more discriminative information and effectively explores the complementary property of different views to obtain an effective low-dimensional embedding for multi-view data sets. Meanwhile, authors argued that MSE achieve better performances facing with multimedia data but text data whose feature from different views are nearly independent.
3 Co-regularized Multi-view Sparse Reconstruction Embedding
In this section, we described our proposed CMSRE which finds a low-dimensional embedding over all views simultaneously. Given multi-view features with samples having views, , where is a feature matrix which consists of features from the th view. is the dimension of features from the th view. CMSRE aims to fully exploit the correlations of sparse correlations for all views and utilizes a co-regularized scheme to construct low-dimensional representations as , where is the low-dimensional representations for the th view and is dimension of common subspace constructed by CMSRE.
Based on the aforementioned statements, CMSRE first builds a patch for each sample on a view. Then, based on the patches from different views, the single-view optimization is performed for CMSRE to get the optimal low-dimension representations for each view. Afterward, a co-regularized scheme is adopted by CMSRE to unify multiple views and fully exploits information from multi-view features. Finally, the solution of CMSRE is obtained by using an iterative optimization procedure. And Fig.1 shows the working procedure of CMSRE.
3.1 Single-view Optimization
In this section, we introduced how we conducted single-view optimization for CMSRE. Single-view optimization can help CMSRE to obtain best low-representations for features from each single view. Firstly, we utilized sparse representation [38] to realize sparse reconstruction between each feature and its local neighbours as follows:
[TABLE]
Where is a subset of and contains the local neighbours of is the th feature from the th view. reflects the correlations of sparse reconstruction between and its neighbours. It contains most information of its location and correlations with its neighbours. is a vector of all ones and is a small constant. The solution in Eq.2 is a sparse vector which can reconstruct each feature using its neighbours from the same view.
For our proposed CMSRE, it’s essential to fully exploit correlations of sparse reconstruction and maintain them into the low-dimensional representations. Therefore, CMSRE construct the following single-view optimization function which can maintain the correlations of sparse reconstruction for each view as much as possible:
[TABLE]
Where is a subset of , which contains neighbours of . I is a unit matrix whose diagonal elements are all ones, zeros the other. Eq.3 aims to find low-dimensional representations for . In order to obtain a more general expression, we rewrite Eq.3 as follows:
[TABLE]
Where we utilized to replace . The elements in is contained in at the corresponding locations while the other elements are supplemented using zeros. In order to obtain a more compact expression, CMSRE can be reorganized as follows and the inference process can be found in Appendix I.
[TABLE]
Where and . is the low-dimensional representations for features from the th view. Because the smallest eigenvalue is close to [math], is formed by eigenvectors corresponding the smallest to eigenvalues. Because Eq.5 can calculate all optimal from each single view without multi-view coordination, it’s essential for us to develop a co-regularized scheme to integrate compatible and complementary information from multiple views.
3.2 Co-regularized Scheme
In section III-, we have shown how CMSRE conduct single-view optimization to obtain low-dimensional representations. However, single-view optimization only considers features from each single view and cannot integrate compatible and complementary information from multiple views. Even though low-dimensional representations preserve correlations of sparse reconstructions from each view, CMSRE fails to consider multi-view features from multiple views simultaneously. In this section, we illustrate the co-regularized scheme which is adopted by CMSRE to achieve multi-view learning. The co-regularized scheme provide a tool which can combine multiple kernels (or similarity matrices) from multiple views for multi-view dimension reduction problem. And we clearly show how we construct the co-regularized scheme for CMSRE in the following content.
For CMSRE, the eigenvector matrix contains low-dimensional representations for th view. In the co-regularized scheme, we encourage the pairwise similarities of low-dimensional representations (in terms of rows of ’s, ) be similar across all the views.
Firstly, we work with two-view case for the ease of explanation and then extend it to multi-view situation. Because features from multiple views are all descriptors of one same sample, low-dimensional representations from different views should be in line with consensus as much as possible. Therefore, we define the following cost function as a measure of disagreement between low-dimensional representations from two views:
[TABLE]
Where is the similarity matrix for and CMSRE defines the th row with the th column element of is , which reflects the correlations between the th and th features from the th view. Therefore, . is the Frobenius norms to make comparable across all views. In order to obtain a compact expression, Eq.6 can be further transfered as follows without considering the constant additive and scaling terms. And the inference process can be found in Appendix II :
[TABLE]
Because features from different views are all descriptors of one same sample, minimizing the disagreement aims to integrate compatible and complementary information from the th and views. And Due to Appendix II, it’s equivalent to minimize to finish this goal. In order to combine it with single-view optimization, a co-regularized scheme is constructed and we get the following joint minimization problem for two views:
[TABLE]
Where is a regularized parameter which balances the trade-off. Eq.8 simultaneously maintains sparse reconstructions from two views and minimizes disagreement between these two views. It can help features from two views to learn correlations of sparse reconstructions from each other to finish the cooperation for dimension reduction. And we always set in our experiments. Then, in order to extend Eq.8 to multiple views, we further give the optimization objective as follows:
[TABLE]
can be calculated using the optimization problem as Eq.9. And for each from the th view, it is constructed by considering the disagreements between the th view and the other views. Through the co-regularized scheme, low-dimensional representations from all views have fully utilized information from the other views, which leads CMSRE to be meaningful for multi-view features. Furthermore, we illustrate the optimization procedure of CMSRE in the next part.
3.3 Optimization Procedure of CMSRE
In this section, we introduce the optimization procedure of CMSRE in detail. It is clearly that CMSRE aims to obtain the low-dimensional representations for features from multiple views simultaneously. The optimization procedure should optimize the solutions of CMSRE at the same time. Firstly, we utilize Eq.5 to initialize which are trained without any information from other views. Then, in order to utilized Eq.9 to obtain for the th view, CMSRE maintain the low-dimensional representations from the other views to update individually. By replace using Eq.7, CMSRE update as follows:
[TABLE]
can be calculated using eigen decomposition once all the other low-dimensional representations are determined. The optimization procedure utilizes iterative eigen decomposition to update low-dimensional representations from different views separately to obtain the optimal solution for all views. In order to help readers to understand the solving procedure, we show the optimization procedure as 1.
4 Experiment
In order to show the excellent performance of our proposed method, we conduct various experiments(including document classification, image retrieval, etc.) on several benchmark multi-view datasets in this paper. All these experiments can verify that our proposed CMSRE achieves better performances in most situations.
4.1 Datasets and comparing methods
In our experiments, 6 datasets are utilized to show the excellent performances of CMSRE, including document datasets (such as, 3Sources111http://mlg.ucd.ie/datasets/3sources.html, Cora222http://lig-membres.imag.fr/grimal/data.html) and face datasets (Yale333http://cvc.yale.edu/projects/yalefaces/yalefaces.html), and image datasets (such as, Holidays444http://lear.inrialpes.fr/ jegou/data.php[39] and Corel-1K555https://sites.google.com/site/dctresearch/Home/content-based-image-retrieval). All document datasets are benchmark multi-view datasets. For those image datasets, we extracted features using multiple descriptors as multi-view features for our experiments. Some images from these image datasets are shown as Fig.2.
The performance of CMSRE is evaluated by comparing the following methods: 1. Pairwise[40], which is a multi-view spectral embedding methods using pairwise constraints. 2. Centroid[40], which utilizes centroid constraints for multi-view learning. 3. MSE[28]. 4. LE_BSV which is laplacian eigenmaps[25] with best view. 5. LLE_BSV is locally linear embedding[24] with best view. 6. PCA_BSV is principle component analysis with best view.
4.2 Document classification
In this section, in order to show the advantages CMSRE on document classification, we conducted 2 experiments on benchmark multi-view document datasets, including 3Sources and Cora.
3Sources was collected from 3 online news sources, BBC, Reuters and Guardian. Each source was treated as one view in this dataset. All samples have three views and are utilized to train all multi-view DR methods in our experiments. Then, among all 169 samples, 69 samples are randomly assigned as those ones which need to be classified. After all DR methods, 1NN classifier is adopted to show the classification results. And we conducted this experiment for 20 times to show the mean and max results as Table.2:
Cora consists of 2708 scientific publications which come from 7 classes. Each document is represented by content and cites views. Among all 2708 samples, 708 samples are randomly assigned as those ones which need to be classified. After all DR methods, 1NN classifier is adopted to show the classification results. And we conducted this experiment for 20 times to show the mean and max results as Table.3:
It can be found easily that our proposed CMSRE can achieve best performances to deal with these two document datasets. Meanwhile, experiment results show that performances of single-view methods are always lower than those multi-view learning ones, which can verify that multi-view learning is a valuable topic and need further researches.
4.3 Face Recognition
In this section, we carried on a face recognition experiment on Yale dataset to show the effectiveness of our method. There are 165 faces corresponding to 15 peoples in Yale datasets. We extract features using gray-scale intensity, local binary patterns [41] and edge direction histogram [42] as 3 views. Among all 165 faces, we randomly selected 65 faces as those ones to be recognized. After all DR methods, 1NN classifier is adopted to calculate the recognition results. This experiment has been conducted for 20 times and we show the mean and max recognition accuracies as Table.4.
This experiment can verify that CMSRE is a good multi-view dimension reduction method for face recognition. It can achieve good performances in most situations. Meanwhile, Centroid[40] is also a good method to deal with multi-view datasets.
4.4 Image Retrieval
In this section, we conducted 2 experiments on different image datasets (including Holidays and Corel-1k datasets) for image retrieval, which can verify the excellent performances of CMSRE.
For Holidays dataset, there are 1491 images corresponding to 500 categories, which are mainly captured for sceneries. Among all these images, 500 images are assigned as the query images while the other 991 are assigned as the corresponding relevant images. We utilized MSD [43], Gist [44] and HOC [45] to extract features as 3 views for all images. And all these methods were conducted to project all samples to a 50-dimensional subspace. We selected the best view for all these methods. Distance metric [46] is essential for image retrieval and utilized distance to measure similarities between samples. This experiment has been conducted 20 times and we show the mean results of precision, recall, mean average precision (MAP) and -Measure in Table 5.
It can be found that our proposed CMSRE is the best method to deal with multi-view features. Meanwhile, Pariwise [40] can also achieve good performance in most situations. And those single-view methods methods always perform poorly.
For Corel-1k dataset, there are 1000 images corresponding to 10 categories, which are collected just for image retrieval. For each category, we randomly select 10 images as query ones. We utilized the 3 features descriptors [43, 44, 45] to extract features. All the methods were conducted to project all samples to a 50-dimensional subspace. The procedure of this experiment is same with that one on Holidays. We conducted this experiment 20 times and show the results as Fig.3.
Through these 2 experiments for image retrieval, we can clearly find that our proposed CMSRE outperforms the other DR methods in most situations. CMSRE can integrate compatible and complementary information from multi-view features and obtain better subspaces from these views. And the first 4 multi-view methods outperform the other single-view methods, which can show multi-view learning is a valuable research field indeed.
4.5 Convergence Analysis of CMSRE
Because CMSRE utilized iterative optimization procedure to obtain its solution, we should show the convergence property of CMSRE in this section. Therefore, we designed an experiment on Yale dataset, which is according to TABLE 4. We still project features into a 30-dimensional subspace and show the objective values and recognition accuracies with the increase of iterations. It is obvious that the change variation trend of objective values can show the convergence properties of CMSRE as Fig.4:
It can be found that the objective values of CMSRE tend to be stable when 10 iterations finished. The recognition accuracies of Yale dataset is raising with the increase of iterations. Therefore, Fig.4 can verify that CMSRE converges once enough iterations are finished.
4.6 The Impact of the Regularized Parameter
In order to show the impact of the regularized parameter , we conducted another experiment to show the performances of CMSRE with different values of . The experiment is based on the former experiment on Yale dataset. The experimental settings are the same as before in section 4.3. This experiment utilizes CMSRE with different to project features into a 30-dimensional subspace. And we conducted this experiment for 20 times and showed the mean results as Table 6.
It can be found clearly that value of can influence the performance of CMSRE to some extent. And the performance of CMSRE is best when . Therefore, we set in our experiment. Meanwhile, CMSRE degenerates into single view method when and the performance of CMSRE is still better than LLE_BSV. It is because CMSRE can exploit the correlations of sparse reconstruction for these views.
5 Conclusion
In this paper, we proposed a novel method named Co-regularized Multi-view Sparse Reconstruction Embedding (CMSRE) to deal with features from multiple views. By exploiting correlations of sparse reconstruction from multiple views, CMSRE is able to learn local sparse structures of nonlinear manifolds from multiple views and constructs significative low-dimensional representations for them. Furthermore, in order to preserve correlations of sparse reconstructions from multiple views, CMSRE proposes a co-regularized scheme to integrate compatible and complementary information from these features to construct low-dimensional subspace directly. Due to the novel construction of CMSRE, it can be utilized to deal with multi-view features for dimension reduction. Various experiments have shown that our proposed CMSRE can achieve excellent performances.
Acknowledge
We would like to thank the anonymous reviewers for their valuable comments and suggestions to significantly improve the quality of this paper. This work was supported in part by the National Natural Science Foundation of China Grant 61370142 and Grant 61272368, by the Fundamental Research Funds for the Central Universities Grant 3132016352, by the Fundamental Research of Ministry of Transport of P. R. China Grant 2015329225300.
Appendix I
This appendix shows how to obtain Eq.5 from Eq.4 using matrix operations.
[TABLE]
Where and .
Acknowledge II
Because from different views are all orthogonal matrices, . Therefore, we can transfer Eq.6 as follows:
[TABLE]
Where is a unit matrix whose diagonal elements are all , [math] others. By ignoring the constant additive and scaling terms, we can utilize to represent and minimize it to shrink the disagreement between low-dimensional representations between the th view and the th view.
Reference
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] G. Li, K. Chang, S. C. Hoi, Multiview semi-supervised learning with consensus, IEEE Transactions on Knowledge and Data Engineering 24 (11) (2012) 2040.
- 2[2] S. C. Hoi, R. Jin, P. Zhao, T. Yang, Online multiple kernel classification, Machine Learning 90 (2) (2013) 289–316.
- 3[3] D. Huang, C. Shan, M. Ardabilian, Y. Wang, L. Chen, Local binary patterns and its application to facial image analysis: a survey, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews) 41 (6) (2011) 765–781.
- 4[4] J. Wang, J. Yang, K. Yu, F. Lv, T. Huang, Y. Gong, Locality-constrained linear coding for image classification, in: Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on, IEEE, 2010, pp. 3360–3367.
- 5[5] N. Dalal, B. Triggs, Histograms of oriented gradients for human detection, in: Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on, Vol. 1, IEEE, 2005, pp. 886–893.
- 6[6] X. Bi, C.-M. Pun, X.-C. Yuan, Multi-level dense descriptor and hierarchical feature matching for copy–move forgery detection, Information Sciences 345 (2016) 226–242.
- 7[7] J. Yang, Z. Cao, Q. Zhang, A fast and robust local descriptor for 3d point cloud registration, Information Sciences 346 (2016) 163–179.
- 8[8] S. C. Hoi, R. Jin, J. Zhu, M. R. Lyu, Batch mode active learning and its application to medical image classification, in: Proceedings of the 23rd international conference on Machine learning, ACM, 2006, pp. 417–424.
