Probably Approximately Correct Explanations of Machine Learning Models   via Syntax-Guided Synthesis

Daniel Neider; Bishwamittra Ghosh

arXiv:2009.08770·cs.AI·September 21, 2020

Probably Approximately Correct Explanations of Machine Learning Models via Syntax-Guided Synthesis

Daniel Neider, Bishwamittra Ghosh

PDF

Open Access

TL;DR

This paper introduces a new method combining PAC learning and syntax-guided synthesis to generate high-probability, accurate, and interpretable explanations for complex machine learning models, especially neural networks.

Contribution

It presents a novel framework that leverages PAC and SyGuS to produce reliable and understandable explanations for deep learning models, advancing interpretability methods.

Findings

01

Produces explanations with high probability and few errors

02

Generates small, human-interpretable explanations

03

Effective in explaining complex models empirically

Abstract

We propose a novel approach to understanding the decision making of complex machine learning models (e.g., deep neural networks) using a combination of probably approximately correct learning (PAC) and a logic inference methodology called syntax-guided synthesis (SyGuS). We prove that our framework produces explanations that with a high probability make only few errors and show empirically that it is effective in generating small, human-interpretable explanations.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Software Engineering Research · Adversarial Robustness in Machine Learning