Differentiable Rule Induction from Raw Sequence Inputs
Kun Gao, Katsumi Inoue, Yongzhi Cao, Hanpin Wang, Feng Yang

TL;DR
This paper introduces a novel self-supervised differentiable ILP approach that learns interpretable rules directly from raw sequence data, overcoming label leakage issues and enhancing rule learning from complex data types.
Contribution
It presents a new integrated model combining self-supervised clustering with differentiable ILP to enable rule induction from raw data without explicit labels.
Findings
Successfully learns generalized rules from time series data.
Effectively induces rules from image data.
Addresses label leakage in differentiable ILP.
Abstract
Rule learning-based models are widely used in highly interpretable scenarios due to their transparent structures. Inductive logic programming (ILP), a form of machine learning, induces rules from facts while maintaining interpretability. Differentiable ILP models enhance this process by leveraging neural networks to improve robustness and scalability. However, most differentiable ILP methods rely on symbolic datasets, facing challenges when learning directly from raw data. Specifically, they struggle with explicit label leakage: The inability to map continuous inputs to symbolic variables without explicit supervision of input feature labels. In this work, we address this issue by integrating a self-supervised differentiable clustering model with a novel differentiable ILP model, enabling rule learning from raw data without explicit label leakage. The learned rules effectively describe…
Peer Reviews
Decision·ICLR 2025 Poster
Originality: This provides a new and instructive implementation to address the raw-to-symbolic data problem, while avoiding the data leakage issue. Quality and Clarity: The paper explains and illustrates the method in a clear and compelling way. Significance: This appears to be an important contribution to the problem of addressing the problem of learning symbolic rules about raw sequential data.
The approach may require significant data invariance (orientation of images, sampling regularity). Many important datasets have these properties, although image data often does not. See questions for further clarification. As a demonstration of the advantage of this approach as an interpretable and explainable method, it would be helpful to have the interpretation of rules on each dataset from UCR discussed in more detail. How interpretable are the rules in terms of the data patterns? Are t
This paper tackles an important topic in the field of differentiable ILP, specifically the need for pre-trained models to ground raw input into symbols for effective rule learning. The approach of combining differentiable k-means and VAE with gradient-based rule learners is noteworthy, even though the concept is relatively straightforward. The manuscript is well-written, with a high-quality presentation overall. It clearly outlines the research question and effectively conveys its core idea. T
My primary concern is that it remains unclear whether the main claim is adequately supported by the methods and experiments sections. The paper asserts: > (Line 42-45) Learning logic programs from raw data is hindered by the label leakage problem common in neuro-symbolic research (Topan et al., 2021): This leakage occurs when labels of ground objects are introduced for inducing rules (Evans & Grefenstette, 2018; Shindo et al., 2023). and > (Line 82-85) In our work, we do not require a pre-tra
1. The paper is overall clear **Clarity**. However the text should be polished to rewrite/rearrange words in some sentences and correct the several typos present throughout. Please refer the MINOR paragraph below for a non-exhaustive list. 2. The considered problem is relevant and timely **Relevance**. 3. Code is available, but no further check has been performed **Code availability**. MINOR \ L.109 -> $\neg\alpha_2$ \ L.129 -> a substitution or an interpretation \ L.151 -> is ordered real-valu
1. The paper lacks adequate context within the existing neuro-symbolic learning literature, making its contribution appear incremental in terms of novelty **Quality**. For example, neuro-symbolic (NeSy) autoencoders have been previously explored (see [1,2]), and differentiable clustering has also been investigated (refer to [3,4]). Additionally, the rule learner shows a strong resemblance to $\delta$ILP. At a minimum, a discussion of related work should be included to clarify the paper's novelty
Videos
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Data Mining Algorithms and Applications · Machine Learning and Data Classification
