Induction of Decision Trees based on Generalized Graph Queries
Pedro Almagro-Blanco, Fernando Sancho-Caparrini

TL;DR
This paper introduces GGQ-ID3, a novel multi-relational decision tree algorithm that employs Generalized Graph Queries to incorporate complex, structured, and cyclic patterns in relational data for improved learning.
Contribution
The paper presents GGQ-ID3, a new decision tree method using dynamically built Generalized Graph Queries for multi-relational data, enabling complex pattern evaluation and structure-aware learning.
Findings
Successfully applied to real-world multi-relational datasets.
Generated interpretable classification trees with semantic patterns.
Maintained manageable complexity in multi-relational learning.
Abstract
Usually, decision tree induction algorithms are limited to work with non relational data. Given a record, they do not take into account other objects attributes even though they can provide valuable information for the learning task. In this paper we present GGQ-ID3, a multi-relational decision tree learning algorithm that uses Generalized Graph Queries (GGQ) as predicates in the decision nodes. GGQs allow to express complex patterns (including cycles) and they can be refined step-by-step. Also, they can evaluate structures (not only single records) and perform Regular Pattern Matching. GGQ are built dynamically (pattern mining) during the GGQ-ID3 tree construction process. We will show how to use GGQ-ID3 to perform multi-relational machine learning keeping complexity under control. Finally, some real examples of automatically obtained classification trees and semantic patterns are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Mining Algorithms and Applications · Rough Sets and Fuzzy Logic · Imbalanced Data Classification Techniques
