Learning Class-Level Bayes Nets for Relational Data
Oliver Schulte, Hassan Khosravi, Flavia Moser, Martin Ester

TL;DR
This paper introduces efficient algorithms for learning class-level Bayes nets from relational data, enabling fast statistical inference over database attributes and links, which is useful for policy making and query optimization.
Contribution
The paper presents novel, faster algorithms for learning class-level relational Bayes nets that leverage single-table nonrelational Bayes net learners, improving scalability and efficiency.
Findings
Algorithms are computationally feasible for realistic data sizes.
Learned structures accurately represent database statistics.
Querying via Bayes nets is faster than SQL and database size independent.
Abstract
Many databases store data in relational format, with different types of entities and information about links between the entities. The field of statistical-relational learning (SRL) has developed a number of new statistical models for such data. In this paper we focus on learning class-level or first-order dependencies, which model the general database statistics over attributes of linked objects and links (e.g., the percentage of A grades given in computer science classes). Class-level statistical relationships are important in themselves, and they support applications like policy making, strategic planning, and query optimization. Most current SRL methods find class-level dependencies, but their main task is to support instance-level predictions about the attributes or links of specific entities. We focus only on class-level prediction, and describe algorithms for learning class-level…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Data Mining Algorithms and Applications · Imbalanced Data Classification Techniques
