Large-Scale Kernel Methods for Independence Testing

Qinyi Zhang; Sarah Filippi; Arthur Gretton; Dino Sejdinovic

arXiv:1606.07892·stat.CO·June 11, 2018·Stat. Comput.

Large-Scale Kernel Methods for Independence Testing

Qinyi Zhang, Sarah Filippi, Arthur Gretton, Dino Sejdinovic

PDF

1 Repo

TL;DR

This paper explores large-scale kernel methods for independence testing, balancing computational efficiency and test accuracy, and demonstrates their effectiveness on synthetic datasets.

Contribution

It introduces and evaluates large-scale kernel approximation techniques like block-based, Nystrom, and Fourier features for independence testing.

Findings

01

Large-scale methods achieve comparable accuracy to traditional approaches.

02

Significantly reduced computation time and memory usage.

03

Effective in synthetic data experiments for independence testing.

Abstract

Representations of probability measures in reproducing kernel Hilbert spaces provide a flexible framework for fully nonparametric hypothesis tests of independence, which can capture any type of departure from independence, including nonlinear associations and multivariate interactions. However, these approaches come with an at least quadratic computational cost in the number of observations, which can be prohibitive in many applications. Arguably, it is exactly in such large-scale datasets that capturing any type of dependence is of interest, so striking a favourable tradeoff between computational efficiency and test performance for kernel independence tests would have a direct impact on their applicability in practice. In this contribution, we provide an extensive study of the use of large-scale kernel approximations in the context of independence testing, contrasting block-based,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

IBM/SIC
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.