A model to predict participant retention in longitudinal acute pancreatitis studies
Ila Lahooti, Lydia Noh, Melica Nikahd, Kathleen Tong, Anna Evans Phillips, Stacey Culp, Matthew Yoder, Patrick Rowan, Vikesh K. Singh, Elham Afghani, Ali Lahooti, Philip A. Hart, Erica Park, Somashekar G. Krishna, Raj Shah, Mitchell L. Ramsey, Georgios I. Papachristou

Abstract
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPancreatitis Pathology and Treatment · Healthcare cost, quality, practices · Food Security and Health in Diverse Populations
Acute pancreatitis (AP) is an inflammatory response to pancreatic injury that is one of the leading causes of gastrointestinal hospitalizations, resulting in nearly $2.6 billion in healthcare costs each year [1]. Recently, research surrounding AP has increasingly focused on studying long-term disease outcomes [2,3]. However, participant retention is a critical challenge in longitudinal AP research, where loss to follow-up (LTFU) can compromise study validity and generalizability [4]. Thus, it is imperative to identify factors predictive of retention. Given this need, we conducted a secondary analysis of the multicenter Post-Acute Pancreatitis Pancreatic Exocrine Insufficiency (PAPPEI) study to evaluate factors predictive of participant retention, including the development of prediction models at 3- and 12-months follow-up.
Using the Revised Atlanta Criteria, the PAPPEI study enrolled adults admitted with AP across three tertiary care centers from September 2017 to September 2021. Participants completed comprehensive physical and mental baseline questionnaires, blood, and stool collection, with follow-up at 3 and 12 months. Retention was defined as completion of follow-up questionnaires and stool submission at the 3- and 12-month timepoints. Concerted efforts were made to contact participants including attempts by the research team via the participant’s communication style of choice (call, text, email), EMR messaging by the physician investigator, as well as outreach to their primary care providers. Additional details related to the study protocol have been previously described [2].
Candidate predictors collected included baseline demographic variables including age, sex, race, education, marital, and employment statuses, and income. Additionally, clinical characteristics such as alcohol-induced etiology, comorbidity burden, AP severity, and specimen submissions at baseline were considered (Supplementary Table 1). All analyses were performed using SAS 9.4. Next, two separate multivariable logistic regression models were developed for each follow-up timepoint using forward selection. Model performance was assessed with receiver operating characteristic (ROC) analysis as well as area under the ROC curve (AUC), sensitivity, specificity, and accuracy of the models. Internal validation was applied with leave out one cross-validation (LOOCV) in order to guard against model overfitting.
A total of 184 participants completed the baseline questionnaires and thus represented the baseline cohort. Of these 184 participants, ninety completed the 3-month follow-up and 100 completed the 12-month follow-up, representing retention rates of 49 % and 54 %, respectively (Supplementary Tables 2 and 3). At both follow-up timepoints, older age, higher education level, and submission of a baseline stool sample were significantly associated with higher retention. The strongest predictor of retention was baseline stool submission (OR 19.42 at 3-months; OR 7.34 at 12-months, both p < 0.001). At the 3-month timepoint, participants who were married, retired, and had a household income of at least $50,000 had statistically significant increased retention rates (Supplementary Table 2). The final 3-month model included the predictors of baseline age, education, and stool sample submission and achieved an AUC of 0.86. The 12-month model included the predictors of baseline age, stool and blood submission, education, comorbidity burden, and AP severity and had an AUC of 0.79 (Fig. 1). Internal validation was performed using LOOCV and demonstrated robust model performances. Additionally, it is important to note that Black participants were disproportionately LTFU (86 % attrition rate vs. 49 % of White participants) at 12 months.
Overall, these findings underscore the substantial attrition in longitudinal AP research despite intensive outreach efforts. The low follow-up rate among Black participants highlights a pressing need to investigate AP-specific barriers to engagement in this population [5–7]. Prior studies in other diseases of this patient population have shown that barriers include mistrust of the medical and research community, logistical and technical barriers, lack of perceived relevance and benefit, socioeconomic factors, as well as communication and engagement challenges [8]. These factors also likely contribute to attrition rates in longitudinal AP research and should serve as a foundation for future studies aimed at improving retention in this patient population.
Our study also identified the notable influence of sociodemographic factors on participant retention. Higher education levels and increased age were significantly associated with follow-up completion at both the 3- and 12-month periods, suggesting that older participants may have more flexibility and time that enable them to complete follow-ups; additionally, in a study by Kim et al., it was found that patients with higher levels of education may experience less barriers to study participation including, but not limited to, transportation, childcare, work schedules, and psychological challenges [9]. Further understanding these socioeconomic influences on retention can help inform targeted strategies, such as tailored outreach efforts and resource allocation, that may reduce barriers to participant follow-up and thereby improve retention in diverse patient populations.
Finally, our results demonstrate that an early willingness to submit a stool sample was the strongest indicator of long-term participation, suggesting patient engagement at baseline is critical. Yet, nearly 40 % of participants did not submit a stool sample, highlighting a persistent challenge in AP research retention. Prior interview-based studies have shown that patients often find stool collection inconvenient and lack confidence or adequate instruction on how to complete the process [10]. Addressing these barriers by simplifying the collection process and offering greater patient support and education is an actionable opportunity for future longitudinal AP studies to improve retention.
To our knowledge, our study is the first to develop internally validated models to predict participant retention in longitudinal AP research. A strength of our study is that our model uses an accessible formula (Supplementary Materials) with a small set of readily available variables that can easily be replicated for external application. Our study is limited by a modest sample size, due to the practical limitations of resource constraints, that precluded the evaluation of model fairness. Additionally, participants were enrolled non-consecutively, therefore our AP cohort may not be representative of the overall AP cohort. Future studies should apply this model to a larger, more representative cohort.
In conclusion, participant retention remains a major challenge in longitudinal AP studies. Our study provides an accessible prediction model to assess retention risk in patients with AP, allowing targeted interventions to maximize participant follow-up and thereby strengthen the generalizability of AP research.
Supplementary Material
Supplementary Material
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Peery AF, Crockett SD, Murphy CC, Burden and cost of gastrointestinal, liver, and pancreatic diseases in the United States: update 2018. Gastroenterology 2019;156(1):254–272.e 11. 10.1053/j.gastro.2018.08.063.30315778 PMC 6689327 · doi ↗ · pubmed ↗
- 2Paragomi P, Phillips AE, Machicado JD, Post-acute pancreatitis pancreatic exocrine insufficiency: rationale and methodology of a prospective, observational, multicenter cohort study. Pancreas 2021;50(2):147–52. 10.1097/MPA.0000000000001743.33565790 PMC 9194920 · doi ↗ · pubmed ↗
- 3Czapári D, Váradi A, Farkas N, Detailed characteristics of post-discharge mortality in acute pancreatitis. Gastroenterology 2023;165(3):682–95. 10.1053/j.gastro.2023.05.028.37247642 · doi ↗ · pubmed ↗
- 4Hollemans RA, Hallensleben NDL, Mager DJ, Pancreatic exocrine insufficiency following acute pancreatitis: systematic review and study level meta-analysis. Pancreatology 2018;18(3):253–62. 10.1016/j.pan.2018.02.009.29482892 · doi ↗ · pubmed ↗
- 5Herring P, Montgomery S, Yancey AK, Williams D, Fraser G. Understanding the challenges in recruiting blacks to a longitudinal cohort study: the adventist health study. Ethn Dis 2004;14(3):423–30.15328945 · pubmed ↗
- 6Ejiogu N, Norbeck JH, Mason MA, Cromwell BC, Zonderman AB, Evans MK. Recruitment and retention strategies for minority or poor clinical research participants: lessons from the healthy aging in neighborhoods of diversity across the life span study. Gerontol 2011;51(Suppl 1):S 33–45. 10.1093/geront/gnr 027.PMC 309297821565817 · doi ↗ · pubmed ↗
- 7Bontemps-Jones JE, Mc Cullough LE, Kirkland EG, Beyond tuskegee: a contemporary qualitative assessment of barriers to research participation among black women. Cancer 2025;131(1):e 35648. 10.1002/cncr.35648.39602086 · doi ↗ · pubmed ↗
- 8Yousafi S, Rangachari P, Holland ML. Barriers to recruitment and retention among underrepresented populations in cancer clinical trials: a qualitative study of the perspectives of clinical trial research coordinating staff at a cancer center. J Healthc Leader 2024;16:427–41. 10.2147/JHL.S 488426.PMC 1153720039502080 · doi ↗ · pubmed ↗
