Schema Independent Relational Learning
Jose Picado, Arash Termehchy, Alan Fern, Parisa Ataei

TL;DR
This paper addresses the problem of schema dependence in relational learning algorithms, introduces the concept of schema independence, and proposes a new algorithm, Castor, that achieves schema independence.
Contribution
It formalizes schema independence for relational learning, analyzes existing algorithms' dependence on schemas, and presents Castor, a schema-independent relational learning algorithm.
Findings
Current algorithms are generally schema dependent.
Schema transformations affect query complexity in query-based algorithms.
Castor achieves schema independence by leveraging data dependencies.
Abstract
Learning novel concepts and relations from relational databases is an important problem with many applications in database systems and machine learning. Relational learning algorithms learn the definition of a new relation in terms of existing relations in the database. Nevertheless, the same data set may be represented under different schemas for various reasons, such as efficiency, data quality, and usability. Unfortunately, the output of current relational learning algorithms tends to vary quite substantially over the choice of schema, both in terms of learning accuracy and efficiency. This variation complicates their off-the-shelf application. In this paper, we introduce and formalize the property of schema independence of relational learning algorithms, and study both the theoretical and empirical dependence of existing algorithms on the common class of (de) composition schema…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
