Joint Modeling of Longitudinal EHR Data with Shared Random Effects for Informative Visiting and Observation Processes
Cheng-Han Yang, Xu Shi, Bhramar Mukherjee

TL;DR
This paper introduces a joint modeling approach for longitudinal EHR data that accounts for irregular visits and selective biomarker measurement, reducing bias in association estimates from complex missing data mechanisms.
Contribution
We develop a unified semiparametric framework with shared latent variables to simultaneously model visiting, observation, and outcome processes, improving bias correction in EHR data analysis.
Findings
Our method produces unbiased estimates in simulations.
Existing methods can be biased, especially when only adjusting for visit irregularity.
Application to All of Us data reveals meaningful biomarker associations.
Abstract
Longitudinal electronic health record (EHR) data offer opportunities to study biomarker trajectories; however, association estimates-the primary inferential target-from standard models designed for regular observation times may be biased by a two-stage hierarchical missingness mechanism. The first stage is the visiting process (informative presence), where encounters occur at irregular times driven by patient health status; the second is the observation process (informative observation), where biomarkers are selectively measured during visits. To address these mechanisms, we propose a unified semiparametric joint modeling framework that simultaneously characterizes the visiting, biomarker observation, and longitudinal outcome processes. Central to this framework is a shared subject-specific Gaussian latent variable that captures unmeasured frailty and induces dependence across all…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHealth, Environment, Cognitive Aging · Advanced Causal Inference Techniques · Machine Learning in Healthcare
