An Empirical Bayes Approach for Constructing the Confidence Intervals of Clonality and Entropy
Zhongren Chen, Lu Tian, Richard Olshen

TL;DR
This paper introduces an empirical Bayes method combined with resampling to construct robust confidence intervals for diversity measures like clonality and entropy in immune response data, based on high-dimensional multinomial observations.
Contribution
It proposes a novel empirical Bayes approach with resampling calibration for accurate confidence interval construction of diversity parameters.
Findings
Method performs well in numerical studies
Applied successfully to real immune response data
Provides robust uncertainty quantification for diversity measures
Abstract
This paper is motivated by the need to quantify human immune responses to environmental challenges. Specifically, the genome of the selected cell population from a blood sample is amplified by the well-known PCR process of successive heating and cooling, producing a large number of reads. They number roughly 30,000 to 300,000. Each read corresponds to a particular rearrangement of so-called V(D)J sequences. In the end, the observation consists of a set of numbers of reads corresponding to different V(D)J sequences. The underlying relative frequencies of distinct V(D)J sequences can be summarized by a probability vector, with the cardinality being the number of distinct V(D)J rearrangements present in the blood. Statistical question is to make inferences on a summary parameter of the probability vector based on a single multinomial-type observation of a large dimension. Popular summary…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvolution and Genetic Dynamics · Gene expression and cancer classification · Bayesian Methods and Mixture Models
