Two-Sample Inference in Highly Dispersed Negative Binomial Models
David Shilane, Derek Bean

TL;DR
This paper investigates two-sample inference for differences in means under highly dispersed Negative Binomial models, showing the Normal approximation's robustness and proposing methods to improve confidence interval coverage.
Contribution
It extends existing one-sample methods to two-sample Negative Binomial models and introduces a mixture approach to enhance inference accuracy.
Findings
Normal approximation is more robust in two-sample settings.
Mixture of methods improves coverage in small samples.
Simulation studies confirm the effectiveness of proposed techniques.
Abstract
Two-sample inference for the difference of population means typically relies upon a Central Limit Theorem approximation. When data are drawn from a Negative Binomial distribution, previous work of Shilane et al. (2010) showed that a Normal approximation is often unreliable in one-sample inference and proposed alternative techniques. We seek to extend these methods to the problem of two-sample inference on the difference of sample means in highly dispersed Negative Binomial models. We demonstrate that the Normal approximation is considerably more robust in the two-sample setting and may often be applied even when it is not appropriate for either sample individually. We also provide an intuitive extension of Bernstein's Inequality to the two-sample case. A simple mixture of these two methods may improve coverage in borderline and small sample settings. We subsequently investigate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Bayesian Inference · Statistical Methods in Clinical Trials · Statistical Methods and Inference
