VISLIX: An XAI Framework for Validating Vision Models with Slice Discovery and Analysis
Xinyuan Yan, Xiwei Xuan, Jorge Piazentin Ono, Jiajing Guo, Vikram, Mohanty, Shekar Arvind Kumar, Liang Gou, Bei Wang, Liu Ren

TL;DR
VISLIX is a visual analytics framework that leverages foundation models to automatically analyze and validate computer vision models by discovering and interpreting data slices without needing additional metadata.
Contribution
The paper introduces VISLIX, a novel framework that enables interactive, metadata-free data slice analysis for vision models using foundation models and natural language insights.
Findings
Effective in identifying data slices where models underperform
Provides natural language explanations of data slices
Supports hypothesis testing interactively
Abstract
Real-world machine learning models require rigorous evaluation before deployment, especially in safety-critical domains like autonomous driving and surveillance. The evaluation of machine learning models often focuses on data slices, which are subsets of the data that share a set of characteristics. Data slice finding automatically identifies conditions or data subgroups where models underperform, aiding developers in mitigating performance issues. Despite its popularity and effectiveness, data slicing for vision model validation faces several challenges. First, data slicing often needs additional image metadata or visual concepts, and falls short in certain computer vision tasks, such as object detection. Second, understanding data slices is a labor-intensive and mentally demanding process that heavily relies on the expert's domain knowledge. Third, data slicing lacks a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Semantic Web and Ontologies · Multimodal Machine Learning Applications
MethodsVisual Analytics · Sparse Evolutionary Training
