TL;DR
This paper introduces a versatile, model-agnostic framework for generating input exemplars to explain black-box AI models across various data types using generative models and evolutionary strategies.
Contribution
It presents a novel, generic, and model-agnostic exemplar synthesis method that works with any generator and data type without needing internal model details.
Findings
Effective across image, text, and tabular data formats.
Comparable exemplar quality to gradient-based methods with faster computation.
Works with any black-box model without access to internal parameters.
Abstract
With the growing complexity of deep learning methods adopted in practical applications, there is an increasing and stringent need to explain and interpret the decisions of such methods. In this work, we focus on explainable AI and propose a novel generic and model-agnostic framework for synthesizing input exemplars that maximize a desired response from a machine learning model. To this end, we use a generative model, which acts as a prior for generating data, and traverse its latent space using a novel evolutionary strategy with momentum updates. Our framework is generic because (i) it can employ any underlying generator, e.g. Variational Auto-Encoders (VAEs) or Generative Adversarial Networks (GANs), and (ii) it can be applied to any input data, e.g. images, text samples or tabular data. Since we use a zero-order optimization method, our framework is model-agnostic, in the sense that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
