Semantic Reasoning from Model-Agnostic Explanations
Timen Stepi\v{s}nik Perdih, Nada Lavra\v{c}, Bla\v{z} \v{S}krlj

TL;DR
ReEx is a novel method that enhances black-box model explanations by translating feature-level explanations into semantically meaningful descriptions using ontologies, improving interpretability especially in biological contexts.
Contribution
ReEx introduces a way to generalize and semantically interpret explanations from tools like SHAP using ontologies, bridging the gap between raw explanations and human-understandable knowledge.
Findings
ReEx produces more informative, compact explanations.
Semantic explanations outperform direct ontology mappings.
Applicable to biological datasets with improved interpretability.
Abstract
With the wide adoption of black-box models, instance-based \emph{post hoc} explanation tools, such as LIME and SHAP became increasingly popular. These tools produce explanations, pinpointing contributions of key features associated with a given prediction. However, the obtained explanations remain at the raw feature level and are not necessarily understandable by a human expert without extensive domain knowledge. We propose ReEx (Reasoning with Explanations), a method applicable to explanations generated by arbitrary instance-level explainers, such as SHAP. By using background knowledge in the form of ontologies, ReEx generalizes instance explanations in a least general generalization-like manner. The resulting symbolic descriptions are specific for individual classes and offer generalizations based on the explainer's output. The derived semantic explanations are potentially more…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsShapley Additive Explanations · Local Interpretable Model-Agnostic Explanations
