Supervised Visualization for Data Exploration
Jake S. Rhodes, Adele Cutler, Guy Wolf, Kevin R. Moon

TL;DR
This paper introduces a new supervised visualization method using random forest proximities and diffusion techniques, which effectively preserves data structure and is robust to noise and parameter tuning, enhancing data exploration.
Contribution
The paper presents a novel supervised visualization approach that improves data structure retention and robustness over existing methods by leveraging random forest proximities and diffusion-based dimensionality reduction.
Findings
Better preservation of local and global data structures.
Robustness to noise and parameter tuning.
Effective variable importance highlighting.
Abstract
Dimensionality reduction is often used as an initial step in data exploration, either as preprocessing for classification or regression or for visualization. Most dimensionality reduction techniques to date are unsupervised; they do not take class labels into account (e.g., PCA, MDS, t-SNE, Isomap). Such methods require large amounts of data and are often sensitive to noise that may obfuscate important patterns in the data. Various attempts at supervised dimensionality reduction methods that take into account auxiliary annotations (e.g., class labels) have been successfully implemented with goals of increased classification accuracy or improved data visualization. Many of these supervised techniques incorporate labels in the loss function in the form of similarity or dissimilarity matrices, thereby creating over-emphasized separation between class clusters, which does not realistically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Advanced Clustering Algorithms Research · Face and Expression Recognition
MethodsPrincipal Components Analysis
