VISLIX: An XAI Framework for Validating Vision Models with Slice   Discovery and Analysis

Xinyuan Yan; Xiwei Xuan; Jorge Piazentin Ono; Jiajing Guo; Vikram; Mohanty; Shekar Arvind Kumar; Liang Gou; Bei Wang; Liu Ren

arXiv:2505.03132·cs.CV·May 7, 2025

VISLIX: An XAI Framework for Validating Vision Models with Slice Discovery and Analysis

Xinyuan Yan, Xiwei Xuan, Jorge Piazentin Ono, Jiajing Guo, Vikram, Mohanty, Shekar Arvind Kumar, Liang Gou, Bei Wang, Liu Ren

PDF

Open Access

TL;DR

VISLIX is a visual analytics framework that leverages foundation models to automatically analyze and validate computer vision models by discovering and interpreting data slices without needing additional metadata.

Contribution

The paper introduces VISLIX, a novel framework that enables interactive, metadata-free data slice analysis for vision models using foundation models and natural language insights.

Findings

01

Effective in identifying data slices where models underperform

02

Provides natural language explanations of data slices

03

Supports hypothesis testing interactively

Abstract

Real-world machine learning models require rigorous evaluation before deployment, especially in safety-critical domains like autonomous driving and surveillance. The evaluation of machine learning models often focuses on data slices, which are subsets of the data that share a set of characteristics. Data slice finding automatically identifies conditions or data subgroups where models underperform, aiding developers in mitigating performance issues. Despite its popularity and effectiveness, data slicing for vision model validation faces several challenges. First, data slicing often needs additional image metadata or visual concepts, and falls short in certain computer vision tasks, such as object detection. Second, understanding data slices is a labor-intensive and mentally demanding process that heavily relies on the expert's domain knowledge. Third, data slicing lacks a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques · Semantic Web and Ontologies · Multimodal Machine Learning Applications

MethodsVisual Analytics · Sparse Evolutionary Training