TL;DR
This paper introduces a method for revising neuro-symbolic models using semantic explanations at the object level, enabling better identification of confounders and targeted model interventions, demonstrated on a new complex scene dataset.
Contribution
It proposes a novel semantic explanation approach for neuro-symbolic models and a new dataset, enabling effective model revision by interacting with explanations at the concept level.
Findings
Semantic explanations identify confounders not seen in visual explanations
Model revision is effective when based on semantic feedback
The approach improves understanding and control of model behavior
Abstract
Most explanation methods in deep learning map importance estimates for a model's prediction back to the original input space. These "visual" explanations are often insufficient, as the model's actual concept remains elusive. Moreover, without insights into the model's semantic concept, it is difficult -- if not impossible -- to intervene on the model's behavior via its explanations, called Explanatory Interactive Learning. Consequently, we propose to intervene on a Neuro-Symbolic scene representation, which allows one to revise the model on the semantic level, e.g. "never focus on the color to make your decision". We compiled a novel confounded visual scene data set, the CLEVR-Hans data set, capturing complex compositions of different objects. The results of our experiments on CLEVR-Hans demonstrate that our semantic explanations, i.e. compositional explanations at a per-object level,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
