Comparing Neural Network Encodings for Logic-based Explainability
Levi Cordeiro Carvalho, Saulo A. F. Oliveira, Thiago Alves Rocha

TL;DR
This paper compares two logical encodings of neural networks for explainability, demonstrating that a more efficient encoding can improve scalability and performance in generating logic-based explanations.
Contribution
It introduces an adapted, more efficient encoding for neural networks in logic-based explainability and empirically compares it to existing methods.
Findings
The adapted encoding uses fewer variables and constraints.
Both encodings have similar explanation computation times.
The adapted encoding improves overall efficiency by up to 16%.
Abstract
Providing explanations for the outputs of artificial neural networks (ANNs) is crucial in many contexts, such as critical systems, data protection laws and handling adversarial examples. Logic-based methods can offer explanations with correctness guarantees, but face scalability challenges. Due to these issues, it is necessary to compare different encodings of ANNs into logical constraints, which are used in logic-based explainability. This work compares two encodings of ANNs: one has been used in the literature to provide explanations, while the other will be adapted for our context of explainability. Additionally, the second encoding uses fewer variables and constraints, thus, potentially enhancing efficiency. Experiments showed similar running times for computing explanations, but the adapted encoding performed up to 18\% better in building logical constraints and up to 16\% better…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
