VISPUR: Visual Aids for Identifying and Interpreting Spurious Associations in Data-Driven Decisions
Xian Teng, Yongsu Ahn, Yu-Ru Lin

TL;DR
VISPUR is a visual analytic system designed to help humans identify, interpret, and prevent spurious associations in data-driven decisions by providing causal analysis tools and visualizations.
Contribution
It introduces a comprehensive visual analytic framework with dashboards and views to detect confounders, visualize subgroup patterns, and reason about paradoxes in causal data analysis.
Findings
Effective in helping users identify spurious associations
Improves understanding of confounding factors and subgroup effects
Enhances accountable decision-making in data analysis
Abstract
Big data and machine learning tools have jointly empowered humans in making data-driven decisions. However, many of them capture empirical associations that might be spurious due to confounding factors and subgroup heterogeneity. The famous Simpson's paradox is such a phenomenon where aggregated and subgroup-level associations contradict with each other, causing cognitive confusions and difficulty in making adequate interpretations and decisions. Existing tools provide little insights for humans to locate, reason about, and prevent pitfalls of spurious association in practice. We propose VISPUR, a visual analytic system that provides a causal analysis framework and a human-centric workflow for tackling spurious associations. These include a CONFOUNDER DASHBOARD, which can automatically identify possible confounding factors, and a SUBGROUP VIEWER, which allows for the visualization and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Visualization and Analytics · Explainable Artificial Intelligence (XAI) · Big Data and Business Intelligence
