Exploration of Heterogeneous Data Using Robust Similarity

Mahsa Mirzargar; Ross T. Whitaker; Robert M. Kirby

arXiv:1710.02862·cs.GR·October 10, 2017

Exploration of Heterogeneous Data Using Robust Similarity

Mahsa Mirzargar, Ross T. Whitaker, Robert M. Kirby

PDF

Open Access

TL;DR

This paper introduces a generic similarity-based exploration method for heterogeneous data that enhances visualization and analysis by revealing subtle structures, outliers, and modes across diverse data types.

Contribution

It proposes a novel similarity measure applicable to various data types and a visual encoding framework for multi-level exploration of heterogeneous datasets.

Findings

01

Effective detection of multiple modes and outliers.

02

Applicable to diverse heterogeneous datasets including ensembles.

03

Enhances data exploration and visualization capabilities.

Abstract

Heterogeneous data pose serious challenges to data analysis tasks, including exploration and visualization. Current techniques often utilize dimensionality reductions, aggregation, or conversion to numerical values to analyze heterogeneous data. However, the effectiveness of such techniques to find subtle structures such as the presence of multiple modes or detection of outliers is hindered by the challenge to find the proper subspaces or prior knowledge to reveal the structures. In this paper, we propose a generic similarity-based exploration technique that is applicable to a wide variety of datatypes and their combinations, including heterogeneous ensembles. The proposed concept of similarity has a close connection to statistical analysis and can be deployed for summarization, revealing fine structures such as the presence of multiple modes, and detection of anomalies or outliers. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Visualization and Analytics · Anomaly Detection Techniques and Applications · Video Analysis and Summarization