ReX: A Framework for Incorporating Temporal Information in Model-Agnostic Local Explanation Techniques
Junhao Liu, Xin Zhang

TL;DR
ReX is a framework that enhances local explanation techniques for machine learning models with variable-length inputs by incorporating temporal information, leading to more accurate and understandable explanations.
Contribution
ReX introduces a novel way to include temporal data in model-agnostic explanations by modifying sampling and features, applicable to multiple explanation methods.
Findings
Significantly improves explanation fidelity.
Outperforms model-specific techniques on target models.
Enhances user understanding of model behavior.
Abstract
Existing local model-agnostic explanation techniques are ineffective for machine learning models that consider inputs of variable lengths, as they do not consider temporal information embedded in these models. To address this limitation, we propose \textsc{ReX}, a general framework for incorporating temporal information in these techniques. Our key insight is that these techniques typically learn a model surrogate by sampling model inputs and outputs, and we can incorporate temporal information in a uniform way by only changing the sampling process and the surrogate features. We instantiate our approach on three popular explanation techniques: Anchors, LIME, and Kernel SHAP. To evaluate the effectiveness of \textsc{ReX}, we apply our approach to six models in three different tasks. Our evaluation results demonstrate that our approach 1) significantly improves the fidelity of…
Peer Reviews
Decision·Submitted to ICLR 2024
1. The authors' predicate definitions (Def. 3.1 and 3.2) are intuitive to interpret (e.g., the 2D predicates can be used to specify the number of tokens between two particular tokens in a sentence). The authors illustrate the potential utility of attributing importance to such predicates rather than individual features in Figure 1. 2. The authors' experimental results (Table 2 and Figure 4) demonstrate the potential value of REX. The explanations provided by REX have higher coverage, precision,
* **Weakness #1: Clarity.** My primary critique of this work is that I found the present draft difficult to understand. The notation used was not sufficiently explained, and I found the authors' descriptions of their methodology and experiments to be severely underspecified. Unless these details are clarified, I do not believe this draft is ready to be published in its present state. Specifically: * Section 2 (Notation). The description that you provided in the second paragraph is confusing,
Overall, the idea of incorporating temporal information using position/distance between features is simple and novel. Temporal information is useful in general and it does help in minimizing the ambiguity in existing explanations as shown by authors for text and time series examples. The fact that existing explanation methods can be extended simply by using an alternate perturbation approach as claimed by authors, is useful.
- Empirical evaluation can include more models and datasets. - The user study could be extended in size and diversity of users. Although positional/distance between features is useful, in case of timeseries the shape of a curve (e.g. shaplets) provides richer information and more useful information to SMEs. It might be good to present use cases where SMEs value the distance information in specific domains.
1. This paper addresses an important research question concerning the impact of temporal relationships between features in the input on the quality of explanations for model decisions. The proposed explanation method is versatile enough to handle varying input lengths, an advantage over traditional model-agnostic explanation techniques. The examples provided offer compelling motivation for the algorithm's design. 2. The experimental evaluation of REX is thorough, covering two distinct tasks: se
1. Algorithm for "Extending Vocabularies": The paper would benefit from a more detailed explanation of how 1-D and 2-D predicates are extracted from the model inputs, as this is a core component of REX. Additional analyses on the algorithm for extending vocabularies would be beneficial. Specifically, do all 1-D and 2-D predicates positively impact explanations? Including more examples beyond those discussed in Section 3.1 could strengthen the paper's argument for the advantages of REX. Additiona
Videos
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Topic Modeling · Scientific Computing and Data Management
MethodsLocal Interpretable Model-Agnostic Explanations
