Answer-Set Programs for Reasoning about Counterfactual Interventions and   Responsibility Scores for Classification

Leopoldo Bertossi; Gabriela Reyes

arXiv:2107.10159·cs.AI·September 3, 2021

Answer-Set Programs for Reasoning about Counterfactual Interventions and Responsibility Scores for Classification

Leopoldo Bertossi, Gabriela Reyes

PDF

Open Access

TL;DR

This paper introduces a method using answer-set programming to model counterfactual interventions and compute responsibility scores, enhancing explainability of classification models with domain knowledge integration.

Contribution

It presents a novel declarative approach using answer-set programs to reason about counterfactuals and responsibility scores in classification, including domain knowledge incorporation.

Findings

01

Effective modeling of counterfactual interventions

02

Ability to compute responsibility scores for explanations

03

Supports domain knowledge and query answering

Abstract

We describe how answer-set programs can be used to declaratively specify counterfactual interventions on entities under classification, and reason about them. In particular, they can be used to define and compute responsibility scores as attribution-based explanations for outcomes from classification models. The approach allows for the inclusion of domain knowledge and supports query answering. A detailed example with a naive-Bayes classifier is presented.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLogic, Reasoning, and Knowledge · Bayesian Modeling and Causal Inference · Topic Modeling