TL;DR
This paper introduces Monte Carlo Dependency Estimation (MCDE), a new framework for quantifying variable dependency in static and streaming data, using Monte Carlo simulations to compare distributions.
Contribution
The paper proposes MCDE as a theoretical framework and introduces MWP, a novel dependency estimator that outperforms existing methods in various scenarios.
Findings
MWP satisfies key properties for dependency estimation.
MWP can handle any numerical data type.
MWP demonstrates superior performance over state-of-the-art measures.
Abstract
Estimating the dependency of variables is a fundamental task in data analysis. Identifying the relevant attributes in databases leads to better data understanding and also improves the performance of learning algorithms, both in terms of runtime and quality. In data streams, dependency monitoring provides key insights into the underlying process, but is challenging. In this paper, we propose Monte Carlo Dependency Estimation (MCDE), a theoretical framework to estimate multivariate dependency in static and dynamic data. MCDE quantifies dependency as the average discrepancy between marginal and conditional distributions via Monte Carlo simulations. Based on this framework, we present Mann-Whitney P (MWP), a novel dependency estimator. We show that MWP satisfies a number of desirable properties and can accommodate any kind of numerical data. We demonstrate the superiority of our estimator…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
