VC-Dimension Based Generalization Bounds for Relational Learning

Ondrej Kuzelka; Yuyi Wang; Steven Schockaert

arXiv:1804.06188·cs.LG·July 5, 2018

VC-Dimension Based Generalization Bounds for Relational Learning

Ondrej Kuzelka, Yuyi Wang, Steven Schockaert

PDF

TL;DR

This paper develops VC-dimension based generalization bounds specifically tailored for relational learning scenarios where data is sampled uniformly without replacement from an unknown larger structure, providing theoretical error bounds.

Contribution

It introduces a novel VC-dimension based bound applicable to relational data under specific sampling assumptions, advancing theoretical understanding in relational learning.

Findings

01

Derived a VC-dimension based error bound for relational models

02

Applicable to scenarios with uniform sampling without replacement

03

Provides theoretical guarantees for relational learning accuracy

Abstract

In many applications of relational learning, the available data can be seen as a sample from a larger relational structure (e.g. we may be given a small fragment from some social network). In this paper we are particularly concerned with scenarios in which we can assume that (i) the domain elements appearing in the given sample have been uniformly sampled without replacement from the (unknown) full domain and (ii) the sample is complete for these domain elements (i.e. it is the full substructure induced by these elements). Within this setting, we study bounds on the error of sufficient statistics of relational models that are estimated on the available data. As our main result, we prove a bound based on a variant of the Vapnik-Chervonenkis dimension which is suitable for relational data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.