Training Normalizing Flows from Dependent Data

Matthias Kirchler; Christoph Lippert; Marius Kloft

arXiv:2209.14933·cs.LG·May 31, 2023

Training Normalizing Flows from Dependent Data

Matthias Kirchler, Christoph Lippert, Marius Kloft

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a new likelihood-based training method for normalizing flows that accounts for dependencies between data points, improving density estimation and data generation in dependent data scenarios.

Contribution

It proposes a novel likelihood objective and learning algorithm for normalizing flows that explicitly models dependencies among data points, addressing a key limitation of existing methods.

Findings

01

Improved empirical results on synthetic and real-world data.

02

Enhanced statistical power in genome-wide association studies.

03

Better density estimation when data dependencies are modeled.

Abstract

Normalizing flows are powerful non-parametric statistical models that function as a hybrid between density estimators and generative models. Current learning algorithms for normalizing flows assume that data points are sampled independently, an assumption that is frequently violated in practice, which may lead to erroneous density estimation and data generation. We propose a likelihood objective of normalizing flows incorporating dependencies between the data points, for which we derive a flexible and efficient learning algorithm suitable for different dependency structures. We show that respecting dependencies between observations can improve empirical results on both synthetic and real-world data, and leads to higher statistical power in a downstream application to genome-wide association studies.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mkirchler/dependent_data_flows
pytorchOfficial

Videos

Training Normalizing Flows from Dependent Data· slideslive

Taxonomy

TopicsGenetic Associations and Epidemiology · Genetic and phenotypic traits in livestock · Bayesian Methods and Mixture Models

MethodsNormalizing Flows