Consistent Representation Learning for High Dimensional Data Analysis
Stan Z. Li, Lirong Wu, Zelin Zang

TL;DR
This paper introduces CRL, a neural network method that jointly performs dimensionality reduction, clustering, and visualization to ensure consistency among these tasks, improving data interpretation in high-dimensional analysis.
Contribution
The paper proposes a novel end-to-end neural network framework, CRL, that simultaneously achieves dimensionality reduction, clustering, and visualization with improved consistency across tasks.
Findings
CRL outperforms t-SNE and UMAP in evaluation metrics.
CRL provides more consistent clustering and visualization results.
The method introduces a new metric, CVI, for measuring task inconsistency.
Abstract
High dimensional data analysis for exploration and discovery includes three fundamental tasks: dimensionality reduction, clustering, and visualization. When the three associated tasks are done separately, as is often the case thus far, inconsistencies can occur among the tasks in terms of data geometry and others. This can lead to confusing or misleading data interpretation. In this paper, we propose a novel neural network-based method, called Consistent Representation Learning (CRL), to accomplish the three associated tasks end-to-end and improve the consistencies. The CRL network consists of two nonlinear dimensionality reduction (NLDR) transformations: (1) one from the input data space to the latent feature space for clustering, and (2) the other from the clustering space to the final 2D or 3D space for visualization. Importantly, the two NLDR transformations are performed to best…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Anomaly Detection Techniques and Applications · Gaussian Processes and Bayesian Inference
