Development and Validation of a Novel Risk Calculator to Predict Sub-optimal HIV Outcomes Among Pregnant and Postpartum Women with HIV in Kenya

Kevin Owuor; Janet M. Turan; Jeff M. Szychowski; Maricianah Onono; Linet Ongeri; Laura K. Beres; Anna Helova; Emmah Ouma; Mercelline Onyando; Rena C. Patel; Patrick Oyaro; Lisa L. Abuogi; Karen Hampanda

PMC · DOI:10.1007/s10461-025-04814-8·July 10, 2025

Development and Validation of a Novel Risk Calculator to Predict Sub-optimal HIV Outcomes Among Pregnant and Postpartum Women with HIV in Kenya

Kevin Owuor, Janet M. Turan, Jeff M. Szychowski, Maricianah Onono, Linet Ongeri, Laura K. Beres, Anna Helova, Emmah Ouma, Mercelline Onyando, Rena C. Patel, Patrick Oyaro, Lisa L. Abuogi, Karen Hampanda

PDF

Open Access

TL;DR

This study created a risk calculator to predict poor HIV outcomes in pregnant and postpartum women in Kenya, helping identify those needing targeted support.

Contribution

A novel risk calculator was developed and validated to predict disengagement from care and treatment failure in pregnant and postpartum women with HIV.

Findings

01

The risk calculator includes demographic, clinical, and psychosocial variables to predict outcomes.

02

The model for women with known HIV diagnosis showed strong discrimination (AUROC 0.843) and moderate calibration.

03

The calculator can stratify risk into low, moderate, and high categories for targeted interventions.

Abstract

No tool currently exists to predict the cumulative risk of suboptimal clinical outcomes among pregnant and postpartum women with HIV (PPWH). This study sought to develop and validate a parsimonious risk calculator capable of predicting disengagement from care and HIV treatment failure among PPWH. We created the risk calculator using data from 1,331 PPWH from Southwestern Kenya (Homabay, Migori, and Kisumu Counties) in the Mother Infant Visit Adherence and Treatment Engagement Trial. Least absolute shrinkage and selection operator logistic regression retained the most predictive variables from 16 candidate factors to estimate the probability of treatment failure or disengagement from care. Three risk quintiles were calculated. We assessed external validation with an independent dataset (Opt4Mamas; N = 820). Cross-validated area under the curve of receiver operating characteristic (AUROC)…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Species2

Human immunodeficiency virus 1(no rank)Homo sapiens(human · species)

Diseases5

intimate depression failure HIV food insecurity

Figures2

Click any figure to enlarge with its caption.

Internal and External Model Discrimination for participants with known HIV diagnosis at the time of pregnancy

Probability of treatment failure/disengagement from care by risk group in final models

Funding1

—http://dx.doi.org/10.13039/100000025National Institute of Mental Health

Keywords

HIVPregnantPostpartumRiskVirologic FailureCare Disengagement

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHIV/AIDS Impact and Responses · HIV/AIDS Research and Interventions · HIV-related health complications and treatments

Full text

Introduction

Pregnant and postpartum women with HIV (PPWH) experience varied success with HIV treatment and maternal and child health outcomes [1]. The proportion of PPWH accessing antiretroviral therapy (ART) has steadily increased, with the most recent estimates (2023) indicating that 84% of PPWH globally have access to ART [1]. There has been a reduction in mother-to-child transmission (MTCT) globally from 22% in 2010 to 10% in 2023; however, the target of elimination of MTCT has not been achieved [2, 3]. Poor retention of women on antiretroviral therapy (ART) during pregnancy and breastfeeding is a major contributing factor for MTCT (5), with a quarter of African PPWH lost to follow-up within six months of starting ART [4]. In Eastern Africa, 40% of new pediatric infections are estimated to be attributable to PPWH stopping ART [5]. Additionally, maintaining high adherence to ART and continuous viral suppression during the postpartum period remains challenging [6–8].

Prediction tools using a risk score have successfully been implemented in both high and low-resource settings to identify patients with various health conditions at risk of specific outcomes, including HIV [9–13]. Prediction tools have been used to detect patients who are at high risk of acquiring HIV [14–19], disengaging from care after HIV diagnosis [20] and to identify non-pregnant/postpartum patients with HIV who are likely to have treatment failure due to a combination of clinical (prior treatment failure, reported non-adherence, missed clinic visits) [21] and psychosocial (homelessness, mental health conditions, violence/trauma, stigma) factors [17, 22]. A study from western Kenya with pregnant/postpartum women without HIV reported the utility of a risk stratification tool; capturing male partner HIV status, number of sexual partners, and diagnosis of sexually transmitted infections (STIs); for predicting acquisition of HIV [23].

Despite numerous known risk factors for poor treatment outcomes and loss to follow up (LTFU), including intimate partner violence (IPV), stigma, depression, young age, new HIV diagnosis [4, 24–39], no tool currently exists to predict a PPWH’s cumulative risk of disengagement from care or treatment failure. There is a pressing need for such a tool, given the differential success of PPWH and limited resources available to support maternal and child health goals and eliminate MTCT. A validated prediction tool has the potential to help researchers and clinicians identify PPWH at-risk of disengagement from care and treatment failure to intervene before negative outcomes, such as MTCT, occur [20, 40–42]. The objective of this study was to develop and validate a novel multivariable prediction tool (i.e., “risk calculator”) for the combined outcome of disengagement from care or elevated viral load among PPWH using risk prioritization scores (high, moderate, low) based on socio-demographic, clinical, and psychosocial indicators known to be associated with sub-optimal outcomes among PPWH.

Methods

Sources of Data

Development Dataset

A longitudinal quantitative dataset from the Mother-Infant Visit Adherence and Treatment Engagement (MOTIVATE) randomized trial with 1,331 PPWH (18–45 years) from 24 government-run clinics in Kisumu, Migori, and Homa Bay Counties and associated sub-studies was used to develop the risk calculator (i.e., derivation dataset) [43]. The study was conducted among PPWH recruited between December 2015 and August 2017 and followed up to March 2019. Data from the parent MOTIVATE study included demographics, retention in care (HIV care visit within 90 days at 12 months postpartum), and routinely collected viral load. The sub-studies included a detailed follow-up psychosocial survey at 12 months postpartum with a sub-sample of MOTIVATE participants (n = 200), and a survey among PPWH who were lost-to-follow-up or discontinued from the main study and who could subsequently be traced (n = 79).

Validation Dataset

The Opt4Mamas study (N = 767 PPWH) was used as the validation dataset. Opt4Mamas was a pre- and post-implementation cohort study of a point-of-care (POC) viral load testing intervention among PPWH (18–45 years) in five government clinics in Kisumu, Kenya [44]. Participants were enrolled starting in February 2019 and followed through December 2021. Viral load and extensive psychosocial data were collected during pregnancy (baseline), delivery, and six months postpartum. The variables used for model validation were similar to those used for model development from the MOTIVATE trial [43].

Setting

Homabay, Migori, and Kisumu Counties are in Western Kenya and all border Lake Victoria. Homabay County has the highest HIV prevalence rate in Kenya, estimated at 19.6% followed by Kisumu (17.5%) and Migori (13.3%) Counties according to the 2018 Kenya Population-based HIV Impact Assessment (KENPHIA) [45]. Homabay, Kisumu, and Migori Counties have a combined annual estimate of over 17,000 pregnant women with HIV, with high perinatal HIV transmission rates of 7.2–8.7% [46, 47]. During the study periods, all Kenya Ministry of Health (MoH) facilities provided ART free of charge for all PPWH and routine viral load (VL) testing every six months (initial VL after 6 months of ART for women with new HIV diagnoses and upon pregnancy for those already on ART) through breastfeeding per MoH guidelines. At the time of the MOTIVATE study, HIV viral suppression was considered < 1,000 copies/ml; at the time of the OPT4MAMAS study and secondary data analysis for the present study, HIV viral suppression was considered < 400 copies/ml per MoH guidelines.

Outcome

The outcome of interest for the risk prediction tool was defined as a combined binary outcome of disengagement from care (missed HIV visit ≥30 days) or elevated viral load (≥400 copies/ml) among PPWH at 12 months postpartum.

Predictors

A total of 16 demographic, clinical, and psychosocial candidate predictors were assessed for potential inclusion in the risk calculator based on our team’s prior research [43, 44, 48–50] and existing literature (Table 1) [4, 24–39].

Table 1. Risk factor variables used in model developmentVariablesKnown HIV diagnosisNew HIV diagnosisDemographic predictorsAssessedIncluded in Final ModelAssessedIncluded in Final ModelAge (Young age [< 26 years] vs. older age [≥26 years])✓✓✓✓Parity (Nulliparous vs. parous)✓✓✓Marital status (Not married vs. Married)✓✓✓✓Clinical predictorsBaseline viral load category (< 1000 copies, ≥1000 copies/ml, or Unknown)✓✓History of viral failure (≥400 copies/ml) after baseline and before 12 months postpartum (Yes vs. No)✓✓Baseline Adherence (Good, Poor, or Unknown)✓✓History of missed visit - Past missed clinic visits of > 14 days before 12 months postpartum (Yes vs. No)✓✓Antiretroviral Therapy Regimen (Second- or third-line regimen vs. First-line)✓✓Late gestational age at first ANC (> 26 weeks at first ANC visit vs. <=26 weeks)✓Psychosocial predictorsDepression symptoms on Patient Health Questionnaire PHQ-9 (Yes [≥10] vs. No [< 10]) [46, 47]✓✓✓Internalized HIV-related stigma - Endorsed stigma on internalized stigma scale (Yes [any of 6 items > 1] vs. No [all items = 1) [51]✓✓✓✓Anticipated HIV-related stigma - Endorsed stigma on anticipated stigma scale (Yes [any of 5 items > 2] vs. No [all items ≤ 2) [48, 52]✓✓✓✓Intimate partner violence (Exposure to physical, sexual, or emotional IPV in the past 12 months vs. No)✓✓✓✓Low male partner support (Low perceived levels of male partner support from measures of affective support [< 4] vs. High levels [≥4]) [53, 54]✓✓✓Low male partner involvement in PMTCT (Low score [> 4] on the male partner involvement scale vs. High [≤ 4]) [49]✓✓Non-disclosure of HIV status to male partner (Yes vs. No)✓✓✓Food insecurity (Food insecure [Mildly, Moderately, or Severely] as per Household Food Insecurity Access Scale vs. Food secure) [55].✓✓✓✓PMTCT– Prevention of mother-to-child transmission of HIV

Sample Size and Power Calculation

The derivation dataset (N = 1331) was adequate (≥880 participants needed) to ensure that the developed prediction model using a binary outcome (the combined outcome of a missed visit (≥30 days) or treatment failure (elevated VL≥400 copies/ml) would have a small mean absolute prediction error (MAPE) when applied in other targeted individuals, based on a set MAPE of 0.05 (recommended largest value) with an anticipated outcome of 20% or more with missed visit or treatment failure (based on MOTIVATE outcome data) and up to 12 candidate predictor parameters [51]. Our validation dataset (N = 767) was adequate (≥650 participants needed) to ensure small expected optimism in the apparent R^2^Nagelkerke = (R^2^cs(cox snell)/max(R^2^cs); anticipated R^2^cs≥ 0.2; max(R^2^cs) = 0.33; delta ≤ 0.05) of the logistic regression model [52].

Missing Data

Both the development and validation datasets had missing data that was handled using Multivariate Imputation by Chained Equations (MICE). The development dataset had many missing values due to limited psychosocial data in the main dataset; these were collected via two surveys administered to a subsample of 8% of participants in MOTIVATE from the LTFU sub-study (n = 78) and MOTIVATE Follow-up sub-study (n = 198). These will be referred to as the ‘LFTU sub-study’ and the ‘Follow-up sub-study’, respectively. Missing data patterns were examined using Little’s test to assess whether the data were missing completely at random. Little’s test for covariate-dependent missingness (CDM) assumption was used to infer whether the data were likely missing at random [53]. MICE was conducted with 193 and 20 imputation iterations based on the fraction of missing data for development and validation data, respectively [54–56].

Statistical Analysis Methods

Separate models were built and assessed for PPWH with a known HIV diagnosis prior to the pregnancy and those with a new HIV diagnosis, because PPWH with a new HIV diagnosis did not have clinically relevant historical data (e.g., history of elevated viral load). Analytical methods were based on previous studies that have successfully developed predictive tools [17, 22, 57] and TRIPOD (Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis) criteria [58]. Descriptive analyses of participant characteristics were conducted. Bivariate associations of the outcome and predictors were assessed using Chi-square tests. A least absolute shrinkage and selection operator (Lasso) penalized logistic regression procedure was used to retain the most predictive variables in model development. The Lagrangian form of the Lasso formula is shown below [59].

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\:{\widehat{\beta\:}}^{lasso}=\genfrac{}{}{0pt}{}{arg\:min}{\beta\:}\left\{\frac{1}{2}\sum\:_{i=1}^{N}{({y}_{i}-{\beta\:}_{0}-\sum\:_{j=1}^{p}{x}_{ij}{\beta\:}_{j})}^{2}+\:\lambda\:\sum\:_{j=1}^{p}\left|{\beta\:}_{j}\right|\right\}$$\end{document}

Where $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\:{\widehat{\beta\:}}^{lasso}$$\end{document}$ is a Lasso estimate, $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\:{\beta\:}_{0}$$\end{document}$ is the intercept, $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\:{\beta\:}_{j}\:$$\end{document}$ are least squares estimates, $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\:{y}_{i}$$\end{document}$ is virologic failure or disengagement from care outcome, $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\:{x}_{ij}$$\end{document}$ are predictors, L_1_ Lasso regularization (penalty) term is $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\:{\sum\:}_{1}^{p}\left|{\beta\:}_{j}\right|$$\end{document}$ , and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\:\lambda\:$$\end{document}$ is the regularization coefficient.

All predictors were included in the model for the participants with a known HIV diagnosis, while historical clinical predictors were omitted in the model for participants with a new HIV diagnosis. A 10-fold cross-validation method was applied to find the regularization parameter lambda that minimizes the mean squared prediction error (MSPE). The Lasso models were fit to each imputed dataset. Predictors with nonzero coefficients in at least 50% of all models were selected for calculating the predicted risk of the outcome of virologic failure/disengagement in care. Combined linear predictors per participant were calculated for all imputation estimates using Rubin’s rules after confirming the asymptotic normality assumption of the estimates [60]. The linear predictors were used to estimate the probability of treatment failure or disengagement from care from which three quintiles were calculated: Low, Moderate, and High Risk.

Internal validation and performance assessment were done via bootstrapping (1000 resamples) by assessing the cross-validated discrimination and calibration. External validation was done using the separate independent study (Opt4Mamas) dataset. The Brier score as mean squared error (MSE) or MSPE, and concordance statistic (C-statistic; area under the receiver-operating characteristic [AUROC] curve) were used to measure performance and discrimination [61, 62]. Calibration was assessed by plotting predicted probabilities against the observed event proportions.

Risk Groups

The predicted risk of disengagement from care or treatment failure in either group was defined as Risk_KP_ = exp(LP_KP_) / (1 + exp(LP_KP_)) and Risk_NP_ = exp(LP_NP_) / (1 + exp(LP_NP_)) for participants with known HIV positive diagnoses (KP) and those with new HIV positive diagnoses (NP), respectively. The predicted risks were also categorized as quartiles of prediction scores to reflect high (3rd quartile), moderate (2nd quartile), and low (1st quartile) risks of virologic failure or disengagement from care.

Ethical Review Statement

The ethical review approval was granted by the Kenya Medical Research Institute Scientific and Ethics Review Unit, the University of Alabama at Birmingham, and the University of Colorado Denver IRBs in the United States. Only de-identified datasets were used for the study, and procedures were conducted in accordance with the Declaration of Helsinki (version 2024).

Results

Participant Characteristics

The development dataset included 1,331 participants, as described in Table 2. Most were married (91.5%), were 26 years or older (66.9%), and were known to be living with HIV before the index pregnancy (78.8%). Among all 767 participants in the validation dataset, the majority were married (85.1%), over 26 years old (72.2%), and had known HIV diagnoses at the time of pregnancy (93.7%) (Table 2). There was no significant difference in the primary outcome among PPWH with known diagnoses between the development cohort (43.4%) and validation data (36.9%). The characteristics by outcome and timing of HIV diagnosis in both datasets are shown in Table 3.

Table 2. Characteristics of pregnant and postpartum women with HIV (PPWH) in the development and validation datasetsParticipant CharacteristicsDevelopment DatasetValidation DatasetN = 1331N = 767Demographic ItemsCurrently married, n (%)1218 (91.5)653 (85.1)Young age (< 26 years), n (%)440 (33.1)213 (27.8)Parous, n (%)1211 (91.1)664 (86.9)Gestational age < = 26 weeks at first ANC visit, n (%)792 (59.5)622 (83.2)Clinical Items^a^Known HIV positive (diagnosed before most recent pregnancy), n (%)1049 (78.8)719 (93.7)First Line ART Regimen, n (%)1270 (95.4)546 (80.4)Baseline Adherence, n (%)Fair/Poor35 (2.6)2 (0.3)Good1063 (79.9)379 (52.7)Unknown233 (17.5)338 (47.0)History of missed clinic visit(s) > 14 days, n (%)818 (63.4)139 (21.1)Baseline viral load categories, n (%)<1000 copies999 (75.1)383 (53.3)>=1000 copies129 (9.7)84 (11.7)Unknown203 (15.3)252 (35.0)History of viral failure [VL > = 400 copies/ml], n (%)204 (16.8)155 (23.2)Psychosocial Items^b^Physical Intimate Partner Violence in past 12 months, n (%)94 (54.0)100 (13.0)Anticipated HIV-related stigma, n (%)84 (42.4)177 (23.1)Internalized HIV-related stigma, n (%)114 (57.6)99 (12.9)Major Depression, n (%)110 (55.6)56 (7.3)Low Male partner support, n (%)72 (40.4)66 (8.6)Non-disclosure of HIV status to male partner, n (%)33 (16.7)10 (1.3)Household Food Insecurity, n (%)89 (44.9)245 (31.9)^a^Only collected on PPWH with a known HIV diagnosis^b^Psychosocial data was only completed among 198 participants. Imputations were therefore used in the regression models

Table 3. Characteristics of pregnant and postpartum women with HIV (PPWH) with and without missed visits or treatment failureDevelopment Dataset (N = 1331)Validation Dataset^c^ (N = 767)Known HIV Diagnoses N = 1049New HIV Diagnoses N = 282Known HIV Diagnoses N = 658New HIV Diagnoses N = 47Participant CharacteristicsMissed visit or treatment failureMissed visit or treatment failureMissed visit or treatment failureMissed visit or treatment failureNoYesNoYesNoYesNoYesn (%)594 (56.6)455 (43.4)P-value168 (59.6)114 (40.4)P-value415 (63.1)243 (36.9)P-value41 (87.2)6 (12.8)P-valueDemographic ItemsNot married, n (%)52 (8.8)30 (6.6)0.220 (11.9)11 (9.6)0.5560 (14.5)31 (12.8)0.548 (19.5)2 (33.3)0.44Young age (< 26 years), n (%)169 (28.5)110 (24.2)0.1299 (58.9)62 (54.4)0.45110 (26.5)76 (31.3)0.198 (19.5)1 (16.7)0.87Nulliparous, n (%)555 (93.4)429 (94.3)0.5731 (18.7)22 (19.3)0.947 (11.4)38 (15.8)0.101 (2.4)1 (16.7)0.11Gestational age < = 26 weeks at first ANC visit, n (%)351 (59.1)280 (61.5)0.4294 (56.0)67 (58.8)0.64334 (82.3)192 (82.1)0.9537 (90.2)5 (83.3)0.61Clinical Items^a^First Line ART Regimen, n (%)559 (94.1)429 (94.3)0.9364 (87.7)168 (69.1)< 0.001Baseline Adherence, n (%)0.22< 0.001Fair/Poor19 (3.2)12 (2.6)2 (0.5)0 (0.0)Good554 (93.3)417 (91.6)248 (59.8)113 (46.5)Unknown21 (3.5)26 (5.7)165 (39.8)130 (53.5)Baseline viral load categories, n (%)0.441< 0.001<1000 copies484 (81.5)365 (80.2)258 (62.2)115 (47.3)>=1000 copies52 (8.8)50 (11.0)3 (0.7)81 (33.3)Unknown58 (9.8)40 (8.8)154 (37.1)47 (19.3)History of missed clinic visit(s) > 14 days, n (%)224 (39.0)439 (96.5)< 0.001224 (39.0)439 (96.5)< 0.001History of viral failure [VL > = 400 copies/ml], n (%)81 (14.3)84 (18.9)0.0480 (0.0)155 (64.3)< 0.001Psychosocial Items^b^Physical Intimate Partner Violence in past 12 months, n (%)44 (61.1)33 (47.1)0.19 (50.0)8 (57.1)0.6969 (16.6)24 (9.9)0.0212 (4.9)1 (16.7)0.27Anticipated HIV-related stigma, n (%)28 (33.7)41 (51.9)0.0197 (38.9)8 (44.4)0.74114 (27.5)50 (20.6)0.055 (12.2)0 (0.0)0.37Internalized HIV-related stigma, n (%)52 (62.7)39 (49.4)0.0913 (72.2)10 (55.6)0.364 (15.4)26 (10.7)0.094 (9.8)1 (16.7)0.61Major Depression Category, n (%)46 (55.4)45 (57.0)0.8411 (61.1)8 (44.4)0.3231 (7.5)16 (6.6)0.673 (7.3)0 (0.0)0.49Low Male partner support, n (%)31 (42.5)26 (36.1)0.439 (50.0)6 (40.0)0.5733 (8.0)20 (8.2)0.904 (9.8)1 (16.7)0.61Non-disclosure of HIV status to male partner, n (%)16 (19.3)12 (15.2)0.492 (11.1)3 (16.7)0.631 (0.2)6 (2.5)0.0120 (0)0 (0)-Household Food Insecurity, n (%)39 (47.0)33 (41.8)0.5111 (61.1)6 (33.3)0.1150 (36.1)74 (30.5)0.1411 (26.8)1 (16.7)0.59^a^Only collected on PPWH with a known HIV diagnosis^b^Psychosocial data was only collected for 198 participants (n = 162 for participants with Known HIV Diagnoses and n = 36 for those with New HIV Diagnoses). Imputations were used in the regression models^c^Only 705 out of 767 participants had completed the outcome variableNotes: Missed visit or treatment failure outcome was defined as disengagement from care (missed HIV visit ≥30 days) or elevated viral load (≥400 copies/ml) among PPWH at any data point in the parent studies up to 12 months postpartumAll p-values were based on Chi-square tests

Model Development

The set of binary indicator variables used in model development are shown in the models below.

The final Lasso models fit identified with Linear Predictor (LP_KP_) for participants with known HIV diagnosis at the time of pregnancy and Linear Predictor (LP_NP_) for participants with new HIV diagnosis at the time of pregnancy, defined as follows:

LP_KP_ = -2.112 -0.054I(Age < 26 years = Yes)* + 0.047I(Marital Status = Not Married)* + 0.634* I(Baseline VL ³ 1000 copies/ml) + 0.637I(Baseline VL = Unknown)* + 0.506I(History of viral failure [VL* ³400 copies/ml] = Yes) + 3.222I(History of missed clinic visit(s) > 14 days = Yes)* − 0.494I(Physical Intimate Partner Violence in past 12 months = Yes)* + 0.576I(Anticipated HIV-related stigma = Yes)* − 0.470I(Internalized HIV-related stigma = Yes)* − 0.040* I(Major Depression = Yes) + 0.703I(Low Male partner support = Yes)* + 0.245* I(Non-disclosure of HIV status to male partner = Yes) + 0.252I(ART Regimen = Second- or Third-line)* − 0.316**I(Baseline Adherence = Good)* + 0.189* I(Baseline Adherence = Unknown) − 0.248* I(Household Food Insecurity = Yes) and

LP_NP_ = 0.218 − 0.137* I(Age < 26 years = Yes) − 0.311* I(Marital Status = Not Married) + 0.284I(Nulliparous = Yes)* − 0.080I(Gestation age > 26 weeks = Yes)* − 0.148* I(Physical Intimate Partner Violence in past 12 months = Yes) + 0.652* I(Anticipated HIV-related stigma = Yes) -0.598* I(Internalized HIV-related stigma = Yes) + 0.352**I(Low Male Partner Support = Yes)* − 0.397* I(Household Food Insecurity = Yes).

The risk of disengagement from care or treatment failure in either group was defined as follows:

Risk_KP_ = exp(LP_KP_) / (1 + exp(LP_KP_)) and Risk_NP_ = exp(LP_NP_) / (1 + exp(LP_NP_)). A sample implementation of the risk tool in REDCap for the Tatua pilot study with REDCap data dictionary (Online Resource 1) and output are provided (Online Resource 2) [63].

Performance of Risk Models

Participants with Known HIV Diagnosis at the time of Pregnancy

The uncalibrated model for participants with known HIV diagnosis overestimated risks (calibration intercept of -0.17). However, the intercept-calibrated model demonstrated good calibration and discrimination (AUROC/C-statistic by bootstrap validation of 0.883 (95% CI 0.847, 0.892) as shown in Table 4; Fig. 1. The externally validated model underestimated risks (calibration intercept of 0.27). The calibrated and externally validated model demonstrated satisfactory calibration and good discrimination (AUROC/C-statistic by bootstrap validation of 0.843 (95% CI 0.805, 0.866)), as indicated in Table 4; Fig. 1.

Table 4. Internal and external risk model performanceModel PerformanceCross validated Mean AUCROC (95% CI^a^)MSE - Mean Squared Error (MSE)Mean Squared Prediction Error (MSPE)Internal Validation - Known HIV diagnosis Model^b^0.883 (0.847, 0.892)0.150Internal Validation - New HIV diagnosis Model ^d^0.637 (0.631, 0.642)0.236External Validation - Known HIV diagnosis Model ^c^0.843 (0.805, 0.866)0.160External Validation - New HIV diagnosis^e^0.463 (0.347, 0.597)0.126AUCROC– Area Under Curve Receiver Operating Characteristic, Confidence Interval– CI^a^Bootstrap bias corrected CI^b^Intercept calibrated model - intercept of -0.17 and coefficient of 1.37 before calibration^c^Intercept calibrated model - intercept of 0.27 and coefficient of 1.20 before calibration^d^No calibration done - intercept of 0 and coefficient of 1.01^e^No calibration done - intercept of 0 and coefficient of 0.06

Fig. 1. Internal and External Model Discrimination for participants with known HIV diagnosis at the time of pregnancy

Participants with New HIV Diagnosis at the time of Pregnancy

The uncalibrated model among participants with new HIV diagnosis indicated no tendency to over or underestimate risks (calibration intercept of 0) and not systematically low or high. Despite the good calibration, the model did not demonstrate strong discrimination defined by a C-statistic of 0.8 or more (AUROC/C-statistic by bootstrap validation of 0.637 (95% CI 0.631, 0.642)) as shown in Table 4. Similarly, the externally validated new HIV diagnosis model had a good calibration intercept, but risks were systematically low risk (calibration coefficient of 0.06). Overall, the model demonstrated lower discrimination (AUROC/C-statistic by bootstrap validation of 0.463 (95% CI 0.347, 0.597) as shown in Table 4.

Distribution of Predicted Risks of the Outcome by Risk Group

The model-predicted risk groups for the development and external validation (Fig. 2) showed similar distributions in the predicted risks for both participants with known and new HIV diagnoses. The mean predicted risk of the outcome in the low-risk group, moderate-risk group, and high-risk group was 6.1%, 55.8%, and 69.8%, respectively, for PPWH with known HIV diagnoses.

The model-predicted risk groups for the development and external validation (Fig. 2) showed similar distributions in the predicted risks for PPWH with known and new HIV diagnoses. The mean predicted risk of the outcome in the low-risk group, moderate-risk group, and high-risk group was 31.1%, 48.4%, and 64.7%, respectively, for PPWH with new HIV diagnoses.

Fig. 2. Probability of treatment failure/disengagement from care by risk group in final models

Discussion

Using two large longitudinal datasets from southwestern Kenya, this study developed and validated a novel risk calculator capable of predicting treatment failure and disengagement from care among PPWH using risk groups (low, moderate, and high). The findings indicate that the risk calculator can successfully predict PPWH at moderate or high risk of experiencing treatment failure or disengagement from HIV care, especially among those with a known HIV diagnosis prior to pregnancy, which is the majority of pregnant women living with HIV in many Eastern African settings currently. This early identification holds significant potential for both clinical practice and research to allocate resources more effectively and implement targeted interventions for better outcomes among PPWH at greatest risk of treatment failure and disengagement from care [4, 26–38, 43, 64].

The risk calculator exhibited different performance levels depending on the timing of HIV diagnosis of the women [6, 39, 69–71]. The final risk model for PPWH with a known HIV diagnosis outperformed those for PPWH with a new HIV diagnosis, indicating potential to identify those who are at risk of sub-optimal HIV treatment and care outcomes. This disparity suggests that the model benefits from the additional clinical history and data available for women with HIV diagnoses before pregnancy, which likely contributes to more accurate risk predictions. Overall, we expect a varied validation performance of models in different location settings, populations, and over time [65]. Several studies have documented that PPWH with a new HIV diagnosis have a high risk of poor retention and treatment failure [4, 38, 66–68]. The difference could also be due to smaller samples of PPWH with a new HIV diagnosis in the development and validation samples.

The variables included in the final model for PPWH with a known HIV diagnosis were similar to risk factors identified in literature among PPWH or other people living with HIV (PLHIV) [25–30, 33–39, 69, 70]. Similarly, for the model with newly diagnosed PPWH, the variables retained in the final model have been identified in existing literature among PPWH or other PLHIV [4, 30, 39, 69, 70]. Neither of the final models included later gestation age at the first antenatal care visit, parity, or low male partner involvement as key risk factors, even though they have been identified in the literature as potential risk factors of disengagement in care [26, 29, 30, 49]. A limitation of the Lasso model in randomly selecting only one of two or more highly correlated variables could explain this [71].

A major strength of this study is the use of two large, independent longitudinal datasets, which provided a robust basis for model development and validation. One of the key innovations is its ability to integrate and assess risk using multiple domains - clinical, demographic, and psychosocial factors, which strengthens the model’s applicability and relevance to the target population. The tool can stratify risk more comprehensively than models focusing on single or limited factors [23, 40–42]. This approach of including psychosocial factors enables more precise identification of PPWH who need additional support for better outcomes, as reported by Ibrahima et al. (2024) [20]. The utility of such a tool early in the care continuum may be especially beneficial in resource-limited settings, where proactive and efficient use of available resources is crucial. The Tatua study is currently testing screening with the risk calculator within a pilot intervention study in Kenya to identify women who may benefit from a psychosocial support intervention, demonstrating its practical application in ongoing research [63].

However, some limitations should be noted. The datasets we used had some participants exposed to clinical trial interventions and may have introduced some selection bias. The risk calculator model for PPWH who are newly diagnosed with HIV should be used with some caution. The risk calculator models for PPWH newly HIV diagnosed were based on smaller samples and did not perform as well as those with a known diagnosis, highlighting the need for additional predictors for refinement and potentially the inclusion of more development and validation data for this subgroup. Viral suppression thresholds in settings such as Kenya are continually changing with time and more effective ART regimens, which makes a standard viral suppression definition among independent and outcome measures challenging. Missing data in the development dataset was an issue, even though rigorous imputation methods were used to address it. Additionally, while the models have been validated in the context of southwestern Kenya, further external validation is necessary to confirm their broader applicability in different geographic and cultural settings, and sensitivity to changes in viral suppression thresholds and ART regimens (e.g. change from non-nucleoside reverse transcriptase inhibitors [NNRTIs] first line to Dolutegravir).

Future research should focus on refining and (or) updating the developed risk calculator for PPWH with new HIV diagnoses by increasing the sample size and incorporating additional relevant demographic (e.g. household income or proxy wealth index, education level, employment status, household size, travel distance to clinic), clinical (e.g. history of opportunistic infections such as Tuberculosis, pregnancy complications), and psychosocial variables (e.g. level of social support, alcohol or substance use). Prospective studies are also needed to validate both models’ effectiveness in real-world clinical settings and across diverse populations while exploring implementation strategies that are feasible and acceptable. Moreover, exploring further development using advanced machine-learning techniques could further enhance models’ predictive accuracy and improve clinical utility by reducing the number of variables and length of scales/measures required [28, 72–74].

Conclusions

In conclusion, this study developed and validated a novel risk calculator for PPWH, which represents an advancement in the ability to predict treatment failure and disengagement from care in this population. The risk calculator for PPWH who have a known HIV diagnosis performed particularly well. This study’s innovative, multidimensional approach to identifying risk of suboptimal outcomes offers practical benefits for both clinical care and research, providing a valuable tool for early identification and intervention. Continued research and external validation efforts will be essential to optimize the model and fully realize its potential to improve health outcomes for PPWH in real-world settings.

Electronic Supplementary Material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Supplementary Material 2

Bibliography17

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1The Joint United Nations Programme on HIV/AIDS (UNAIDS). 2024 global AIDS report — The urgency of now: AIDS at a crossroads. 2024; Available from: http://www.wipo.int/
2UNAIDS, Start Free Stay Free AIDS. Free: 2019 report [Internet]. 2019. Available from: https://www.unaids.org/en/resources/documents/2019/20190722_UNAIDS_SFSFAF_2019
3Yee CW, Cunningham SD, Ickovics JR. Application of the social vulnerability index for identifying teen pregnancy intervention need in the united States. Matern Child Health J. 2019;44(5):898–907.10.1007/s 10995-019-02792-731228149 · doi ↗ · pubmed ↗
4Scott H, Vittinghoff E, Irvin R, Liu A, Nelson L, Del Rio C et al. Development and validation of the personalized sexual health promotion (Sex Pro) HIV risk prediction model for men who have sex with men in the united States. AIDS Behav. 2019.10.1007/s 10461-019-02616-3PMC 695495931352633 · doi ↗ · pubmed ↗
5Ibrahima D, Hallee W, Margeret M, Hari I, Gugulethu T, Amanda F et al. A risk prediction model to identify people living with HIV who are high-risk for disengagement from care after HIV diagnosis in South Africa. AIDS Behav. 2024; 28(10):3362–72.10.1007/s 10461-024-04430-y 38985402 · doi ↗ · pubmed ↗
6Meade CM, Badell M, Hackett S, Mehta CC, Haddad LB, Camacho-Gonzalez A et al. HIV care continuum among postpartum women living with HIV in Atlanta. Infect Dis Obstet Gynecol. 2019;2019.10.1155/2019/8161495 PMC 639389130894788 · doi ↗ · pubmed ↗
7Erlwanger AS, Joseph J, Gotora T, Muzunze B, Orne-Gliemann J, Mukungunugwa S et al. Patterns of HIV care clinic attendance and adherence to antiretroviral therapy among pregnant and breastfeeding women living with HIV in the context of option b + in Zimbabwe. J Acquir Immune Defic Syndr (1988). 2017;75:S 198–206.10.1097/QAI.000000000000134728498190 · doi ↗ · pubmed ↗
8Ayuo P, Musick B, Liu H, Braitstein P, Nyandiko W, Otieno-Nyunya B et al. Frequency and factors associated with adherence to and completion of combination antiretroviral therapy for prevention of mother to child transmission in Western Kenya. J Int AIDS Soc. 2013;16(1):17994.10.7448/IAS.16.1.17994 PMC 353694123336727 · doi ↗ · pubmed ↗