Resolving Histogram Binning Dilemmas with Binless and Binfull Algorithms
Abram Krislock, Nathan Krislock

TL;DR
This paper introduces two 'debinning' algorithms that eliminate binning bias in histograms by reconstructing the underlying distribution from data, enhancing analysis accuracy in sciences like high energy physics.
Contribution
It presents novel binless and binfull algorithms to remove histogram binning bias, allowing more accurate probability distribution estimation from data.
Findings
Algorithms successfully reconstruct distributions without binning bias
Comparison highlights strengths and weaknesses of each method
Discussion on future applications and improvements
Abstract
The histogram is an analysis tool in widespread use within many sciences, with high energy physics as a prime example. However, there exists an inherent bias in the choice of binning for the histogram, with different choices potentially leading to different interpretations. This paper aims to eliminate this bias using two "debinning" algorithms. Both algorithms generate an observed cumulative distribution function from the data, and use it to construct a representation of the underlying probability distribution function. The strengths and weaknesses of these two algorithms are compared and contrasted. The applicability and future prospects of these algorithms is also discussed.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Particle physics theoretical and experimental studies · Bayesian Methods and Mixture Models
