Exploration of Heterogeneous Data Using Robust Similarity
Mahsa Mirzargar, Ross T. Whitaker, Robert M. Kirby

TL;DR
This paper introduces a generic similarity-based exploration method for heterogeneous data that enhances visualization and analysis by revealing subtle structures, outliers, and modes across diverse data types.
Contribution
It proposes a novel similarity measure applicable to various data types and a visual encoding framework for multi-level exploration of heterogeneous datasets.
Findings
Effective detection of multiple modes and outliers.
Applicable to diverse heterogeneous datasets including ensembles.
Enhances data exploration and visualization capabilities.
Abstract
Heterogeneous data pose serious challenges to data analysis tasks, including exploration and visualization. Current techniques often utilize dimensionality reductions, aggregation, or conversion to numerical values to analyze heterogeneous data. However, the effectiveness of such techniques to find subtle structures such as the presence of multiple modes or detection of outliers is hindered by the challenge to find the proper subspaces or prior knowledge to reveal the structures. In this paper, we propose a generic similarity-based exploration technique that is applicable to a wide variety of datatypes and their combinations, including heterogeneous ensembles. The proposed concept of similarity has a close connection to statistical analysis and can be deployed for summarization, revealing fine structures such as the presence of multiple modes, and detection of anomalies or outliers. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Visualization and Analytics · Anomaly Detection Techniques and Applications · Video Analysis and Summarization
