Non-Parametric Inference of Relational Dependence
Ragib Ahsan, Zahra Fatemi, David Arbour, Elena Zheleva

TL;DR
This paper introduces a scalable, non-parametric kernel test for assessing independence in relational data, addressing the limitations of traditional i.i.d. assumptions in observational network datasets.
Contribution
It proposes a novel relational independence testing framework using kernel mean embeddings, extending independence testing to non-i.i.d. relational data.
Findings
Effective on synthetic and semi-synthetic networks
Outperforms existing kernel-based independence tests
Scalable to large relational datasets
Abstract
Independence testing plays a central role in statistical and causal inference from observational data. Standard independence tests assume that the data samples are independent and identically distributed (i.i.d.) but that assumption is violated in many real-world datasets and applications centered on relational systems. This work examines the problem of estimating independence in data drawn from relational systems by defining sufficient representations for the sets of observations influencing individual instances. Specifically, we define marginal and conditional independence tests for relational data by considering the kernel mean embedding as a flexible aggregation function for relational variables. We propose a consistent, non-parametric, scalable kernel test to operationalize the relational independence test for non-i.i.d. observational data under a set of structural assumptions. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Complex Network Analysis Techniques · Bioinformatics and Genomic Networks
MethodsTest
