Beta-trees: Multivariate histograms with confidence statements
Guenther Walther, Qian Zhao

TL;DR
Beta-trees provide an efficient, data-adaptive multivariate histogram method with guaranteed confidence intervals, effectively addressing the curse of dimensionality for data visualization and analysis.
Contribution
The paper introduces Beta-trees, a novel data-dependent partitioning method that offers guaranteed confidence intervals for multivariate densities, overcoming dimensionality challenges.
Findings
Confidence intervals depend only on probability content, not dimension.
Beta-trees adapt to regions with near-uniform distribution.
Method effectively visualizes data and identifies modes.
Abstract
Multivariate histograms are difficult to construct due to the curse of dimensionality. Motivated by -d trees in computer science, we show how to construct an efficient data-adaptive partition of Euclidean space that possesses the following two properties: With high confidence the distribution from which the data are generated is close to uniform on each rectangle of the partition; and despite the data-dependent construction we can give guaranteed finite sample simultaneous confidence intervals for the probabilities (and hence for the average densities) of each rectangle in the partition. This partition will automatically adapt to the sizes of the regions where the distribution is close to uniform. The methodology produces confidence intervals whose widths depend only on the probability content of the rectangles and not on the dimensionality of the space, thus avoiding the curse of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Visualization and Analytics · Greenhouse Technology and Climate Control · Forest ecology and management
