Understanding Failures of Deep Networks via Robust Feature Extraction

Sahil Singla; Besmira Nushi; Shital Shah; Ece Kamar; Eric Horvitz

arXiv:2012.01750·cs.CV·June 15, 2021

Understanding Failures of Deep Networks via Robust Feature Extraction

Sahil Singla, Besmira Nushi, Shital Shah, Ece Kamar, Eric Horvitz

PDF

1 Repo

TL;DR

This paper presents a novel method for identifying and visualizing failure modes in deep networks by leveraging robust features, aiding understanding and debugging beyond traditional aggregate metrics.

Contribution

It introduces a feature-based failure analysis approach that does not rely on crowdsourced labels and includes visualization tools for interpretability.

Findings

01

Effective discovery of failure modes on ImageNet

02

Visualization aids human understanding of features

03

Insights assist engineers in debugging models

Abstract

Traditional evaluation metrics for learned models that report aggregate scores over a test set are insufficient for surfacing important and informative patterns of failure over features and instances. We introduce and study a method aimed at characterizing and explaining failures by identifying visual attributes whose presence or absence results in poor performance. In distinction to previous work that relies upon crowdsourced labels for visual attributes, we leverage the representation of a separate robust model to extract interpretable features and then harness these features to identify failure modes. We further propose a visualization method aimed at enabling humans to understand the meaning encoded in such features and we test the comprehensibility of the features. An evaluation of the methods on the ImageNet dataset demonstrates that: (i) the proposed workflow is effective for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

singlasahil14/barlow
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.