Detecting Systematic Weaknesses in Vision Models along Predefined   Human-Understandable Dimensions

Sujan Sai Gannamaneni; Rohil Prakash Rao; Michael Mock; Maram Akila,; Stefan Wrobel

arXiv:2502.12360·cs.CV·March 7, 2025

Detecting Systematic Weaknesses in Vision Models along Predefined Human-Understandable Dimensions

Sujan Sai Gannamaneni, Rohil Prakash Rao, Michael Mock, Maram Akila,, Stefan Wrobel

PDF

Open Access

TL;DR

This paper introduces a new method combining foundation models and combinatorial search to identify human-understandable weaknesses in vision models, validated on synthetic and real datasets.

Contribution

The paper presents a novel algorithm that leverages foundation models for zero-shot classification to find systematic weaknesses aligned with human-understandable dimensions in image data.

Findings

01

Successfully identifies weaknesses in state-of-the-art vision models

02

Effective on both synthetic and real-world datasets

03

Addresses noise in semantic metadata

Abstract

Slice discovery methods (SDMs) are prominent algorithms for finding systematic weaknesses in DNNs. They identify top-k semantically coherent slices/subsets of data where a DNN-under-test has low performance. For being directly useful, slices should be aligned with human-understandable and relevant dimensions, which, for example, are defined by safety and domain experts as part of the operational design domain (ODD). While SDMs can be applied effectively on structured data, their application on image data is complicated by the lack of semantic metadata. To address these issues, we present an algorithm that combines foundation models for zero-shot image classification to generate semantic metadata with methods for combinatorial search to find systematic weaknesses in images. In contrast to existing approaches, ours identifies weak slices that are in line with pre-defined…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIndustrial Vision Systems and Defect Detection · Advanced Vision and Imaging