Estimating Viral Genetic Linkage Rates in the Presence of Missing Data
Tyler Vu, Tuo Lin, Jingjing Zou, Vladimir Novitsky, Xin Tu, Victor, De Gruttola

TL;DR
This paper develops a new statistical estimator for viral linkage rates that accounts for missing data, reducing bias and providing reliable inference in network analysis, demonstrated on HIV data from a large trial.
Contribution
It introduces a subsampling-based estimator using U-statistics to handle missing data in network linkage analysis, with proven consistency and asymptotic normality.
Findings
Estimator reduces bias in linkage probability estimation
Method achieves asymptotic normality under missing data
Application to HIV data demonstrates practical utility
Abstract
Although the interest in the the use of social and information networks has grown, most inferences on networks assume the data collected represents the complete. However, when ignoring missing data, even when missing completely at random, this results in bias for estimators regarding inference network related parameters. In this paper, we focus on constructing estimators for the probability that a randomly selected node has node has at least one edge under the assumption that nodes are missing completely at random along with their corresponding edges. In addition, issues also arise in obtaining asymptotic properties for such estimators, because linkage indicators across nodes are correlated preventing the direct application of the Central Limit Theorem and Law of Large Numbers. Using a subsampling approach, we present an improved estimator for our parameter of interest that accommodates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Causal Inference Techniques · Complex Network Analysis Techniques · HIV/AIDS Research and Interventions
