Factor Graph-based Interpretable Neural Networks
Yicong Li, Kuanjiu Zhou, Shuo Yu, Qiang Zhang, Renqiang Luo, Xiaodong, Li, Feng Xia

TL;DR
The paper introduces AGAIN, a factor graph-based neural network that generates comprehensible explanations under unknown perturbations by integrating logical rules and rectifying explanations during inference without retraining.
Contribution
The novel AGAIN model leverages factor graphs and logical rules to improve explanation comprehensibility under unknown perturbations without retraining.
Findings
Outperforms state-of-the-art baselines on three datasets
Effectively identifies and rectifies logical errors in explanations
Demonstrates strong correlation between factor graph and explanation quality
Abstract
Comprehensible neural network explanations are foundations for a better understanding of decisions, especially when the input data are infused with malicious perturbations. Existing solutions generally mitigate the impact of perturbations through adversarial training, yet they fail to generate comprehensible explanations under unknown perturbations. To address this challenge, we propose AGAIN, a fActor GrAph-based Interpretable neural Network, which is capable of generating comprehensible explanations under unknown perturbations. Instead of retraining like previous solutions, the proposed AGAIN directly integrates logical rules by which logical errors in explanations are identified and rectified during inference. Specifically, we construct the factor graph to express logical rules between explanations and categories. By treating logical rules as exogenous knowledge, AGAIN can identify…
Peer Reviews
Decision·ICLR 2025 Poster
- The proposed method enables both the detection of logical errors and the correction of these errors within a single framework. - Compared to other concept-based interpretable neural networks, the proposed method achieves higher comprehensiveness in explanations, regardless of whether the perturbations are known or unknown.
- While the proposed method assumes that explanations change due to perturbations without affecting predictions, this seems unrealistic. Particularly in interpretable neural networks with a concept bottleneck structure, as assumed in this study, changes in the concepts outputted by the neural network would naturally lead to changes in predictions, which undermines this assumption. - The proposed method requires predefined logic rules between concepts and categories. If these rules are already kn
- **Good presentation and structure** The paper is well-structured and easy to read. The mathematical definition of the proposed method is clear and well-defined. - **Nice experimental campaign**: Extensive experiments on three datasets (CUB, MIMIC-III EWS, and Synthetic-MNIST) demonstrate the superior performance of AGAIN over the compared baselines, although these results hold mainly for a metric (LSM) that has been created ad-hoc.
## Major Issues - **Novelty, Related work and Compared methods**: the main issue with the current paper is that it only considers methods injecting knowledge into the models by means of factor graphs. However, the integration of knowledge into model predictions has been fully studied under different point of views: probabilistic (e.g., DeepProblog [1] and all its variants), logic constraints (see the survey of Giunchiglia et al. [2]). Also, it is not the first method defending against adversari
1. The paper proposes a novel and interesting idea that uses the logical correctness of explanation to detect and defense against noises and adversaries. 2. The three stage framework is reasonable to me. 3. The examples used in the paper are intuitive.
1. The concept bottleneck model is not easy to scale, as it requires manual annotations of concepts. 2. The implementation details of Section 4.2 is not very clear (the introduction is too conceptual). For example, is $\mathcal{G}$ a graph or a model? At the beginning I thought $\mathcal{G}$ is a graph, but in Line 221 it "reasons about a conditional probability". 3. The datasets used in this work are not very strong. I doubt if the work is applicable to real-world situations. At least, the data
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
