A Lung Ultrasound-Integrated Clinical Model for Predicting Pulmonary Arterial Hypertension in Patients with Connective Tissue Disease-Associated Interstitial Lung Disease

Xihua Lian; Shunlan Liu; Jing Bai; Ying Zhang; Jiaohong Yang; Jimin Fan; Zhixing Zhu

PMC · DOI:10.3390/diagnostics16020203·January 8, 2026

A Lung Ultrasound-Integrated Clinical Model for Predicting Pulmonary Arterial Hypertension in Patients with Connective Tissue Disease-Associated Interstitial Lung Disease

Xihua Lian, Shunlan Liu, Jing Bai, Ying Zhang, Jiaohong Yang, Jimin Fan, Zhixing Zhu

PDF

Open Access

TL;DR

This study creates a noninvasive model using lung ultrasound and clinical data to predict pulmonary arterial hypertension in patients with a specific lung disease linked to connective tissue disorders.

Contribution

A novel lung ultrasound-integrated clinical nomogram for predicting PAH in CTD-ILD patients is developed and validated.

Findings

01

The model achieved high discrimination with AUCs of 0.952 in training, 0.935 in validation, and 0.874 in testing cohorts.

02

Five independent predictors were identified: respiratory rate, DLCO%, TLUS score, RBC count, and BNP.

03

Calibration and decision curve analysis confirmed strong clinical utility and applicability across thresholds.

Abstract

Objectives: To develop and validate a transthoracic lung ultrasound (TLUS)-integrated clinical nomogram for predicting pulmonary arterial hypertension (PAH) in patients with connective tissue disease-associated interstitial lung disease (CTD-ILD). Methods: This multicenter retrospective study included 550 patients with CTD-ILD from the Second Affiliated Hospital of Fujian Medical University and 169 external cases from the Xijing Hospital, Fourth Military Medical University. Patients were randomly divided into a training cohort (n = 385) and an internal validation cohort (n = 165); the external dataset served as a testing cohort. Demographic, physiological, laboratory, pulmonary function, and TLUS data were collected. Univariate and multivariate logistic regression analyses identified independent predictors of PAH, which were used to construct a nomogram model. Discrimination was…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Genes1

NPPB

Proteins1

Species1

Homo sapiens(human · species)

Chemicals1

carbon

Diseases3

pulmonary arterial hypertension PAH CTD-ILD

Figures1

Click any figure to enlarge with its caption.

A 72-point scan method of the bilateral anterior, lateral, and posterior chest wall. The numbers represent the 16 lung regions, ICS: intercostal spaces.

Tables1

Table 1. Modified lung ultrasound scoring system.

Ultrasound Finding		Score per Region
B line	None	0
	<4	1
	4–6	2
	>6 or white lung	3
Pleural Line	Normal	0
	Thickened	1
	Irregular, rough	2
	Discontinuous, fragmented	3
Complications	None	0
	Am-line	4
	Pleural effusion	5

Funding2

—Quanzhou Science and Technology Project
—Science and Technology Program of the Fujian Provincial Health Commission

Keywords

connective tissue disease-associated interstitial lung diseasepulmonary arterial hypertensiontransthoracic lung ultrasoundnomogramrisk prediction model

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsUltrasound in Clinical Applications · Interstitial Lung Diseases and Idiopathic Pulmonary Fibrosis · Pulmonary Hypertension Research and Treatments

Full text

1. Introduction

Connective tissue diseases (CTDs) are systemic autoimmune disorders that often involve multiple organ systems, among which the lungs and pulmonary vasculature are frequently affected. Patients with CTD frequently develop interstitial lung disease (ILD), which can progress to compromise gas exchange and pulmonary vascular structure. In CTD-associated ILD (CTD-ILD), the emergence of pulmonary arterial hypertension (PAH) represents a serious complication that portends markedly worse prognosis, increased morbidity, and therapeutic challenges [1,2,3].

Although PAH is classically grouped under World Health Organization (WHO) group 1 pulmonary hypertension (PH), in CTD patients, multiple pathogenic mechanisms may coexist, including parenchymal lung disease, left-heart dysfunction, and vasculopathy [1,4]. The overlap of ILD and PAH in CTD patients often complicates diagnosis and management, because both conditions can cause dyspnea, exercise intolerance, and hypoxia [5]. Therefore, accurate identification of patients at high risk of PAH among CTD-ILD patients is clinically imperative. Structured surveillance pathways are increasingly emphasized across pulmonary vascular diseases to detect progression over time and to triage patients for definitive testing, supporting the need for practical risk-stratification tools [6].

Prior studies and screening strategies have identified several predictors associated with PAH risk in CTD populations, including impaired gas transfer (reduced DLCO), elevated BNP/NT-proBNP reflecting cardiac strain, and echocardiographic features suggestive of increased pulmonary pressure and right-heart remodeling; these parameters are frequently combined in multimodal screening approaches rather than used in isolation [7,8,9,10]. Transthoracic lung ultrasound (TLUS) has also emerged as a bedside tool for assessing interstitial involvement in CTD-ILD by quantifying B-lines and pleural-line abnormalities, which correlate with HRCT extent and may carry prognostic information in systemic sclerosis and related CTDs [11,12]. Moreover, recent cardiopulmonary ultrasound studies suggest that TLUS may reveal subclinical pulmonary congestion patterns even in PAH/right-heart failure states, supporting its potential complementary value alongside functional and biomarker-based markers when constructing practical risk tools [13]. However, many existing studies are limited by being single-center, small sample size, or lacking external validation, and few have constructed integrated risk models specifically for CTD-ILD populations.

In this study, we aimed to develop and validate a lung ultrasound-integrated clinical nomogram to predict PAH in patients with CTD-ILD. Using a training, internal validation, and external test cohort, we compared baseline characteristics, conducted univariate and multivariate logistic regression analyses, evaluated multicollinearity, and constructed a nomogram incorporating key predictors. This lung ultrasound-enhanced nomogram provides a practical tool for accurate identification and individualized risk stratification of PAH in CTD-ILD patients.

2. Materials and Methods

2.1. Study Design and Participants

This multicenter retrospective observational study was conducted at the Department of Pulmonary and Critical Care Medicine and Department of Ultrasound Medicine, Fujian Medical University Second Affiliated Hospital (Quanzhou, China) and the First Affiliated Hospital of Air Force Medical University (Xi’an, China). The study protocol was reviewed and approved by the Ethics Committees of both Fujian Medical University Second Affiliated Hospital (approval No. 2018-24 and 2025-284) and the Xijing Hospital, Fourth Military Medical University (approval No. KY20242041-C-1).

The inclusion criteria of the participants were as follows: (1) Age ≥ 18 years; (2) Diagnosis of CTD according to the 2010 ACR/EULAR classification criteria, including rheumatoid arthritis, systemic sclerosis, systemic lupus erythematosus, primary Sjögren’s syndrome, and polymyositis/dermatomyositis [14,15,16,17,18]; (3) Diagnosis of ILD based on 2018 ATS/ERS/JRS/ALAT guidelines of imaging criteria using high-resolution computed tomography (HRCT) [19]; (4) Transthoracic lung ultrasound (TLUS), HRCT, pulmonary function tests (PFTs), and biomarker measurements were completed within 14 days of the PAH reference test.

Exclusion criteria of the participants were as follows: (1) Presence of lung cancer, active pulmonary tuberculosis, or severe infection; (2) History of lung resection, heart failure, or other underlying conditions that may interfere with ultrasound image interpretation; (3) Poor image quality or incomplete pulmonary ultrasound scanning; (4) Inability or refusal to cooperate with the examination.

2.2. Definition of Pulmonary Arterial Hypertension (PAH)

The diagnosis of PAH followed the 2022 European Society of Cardiology (ESC)/European Respiratory Society (ERS) Guidelines [1]. PAH was defined as a mean pulmonary arterial pressure (mPAP) > 20 mmHg, pulmonary artery wedge pressure ≤ 15 mmHg, and pulmonary vascular resistance ≥ 2 Wood units measured by right heart catheterization (RHC). When RHC was available, PAH status was determined primarily by hemodynamic criteria from RHC. When RHC was not performed, PAH was adjudicated using echocardiographic pulmonary artery systolic pressure > 40 mmHg together with prespecified supportive clinical and/or imaging evidence (right heart enlargement/dysfunction on echocardiography and/or CT signs suggestive of pulmonary arterial hypertension) [20]. Outcome adjudication was performed by experienced clinicians based on the complete clinical record. Importantly, biomarkers included as candidate predictors were not used to define PAH status to minimize incorporation bias. Patients were classified into PAH and non-PAH groups accordingly.

2.3. Data Collection and Variables

All demographic, clinical, laboratory, pulmonary function, and imaging data were extracted from the hospital electronic medical record and ultrasound archiving system. The following variables were analyzed:

(1)Demographics: age, gender, height, and weight.
(2)Physiologic measures: respiratory rate, C-reactive protein (CRP), oxygenation index (PaO_2_/FiO_2_), and blood pH.
(3)Lung ultrasound parameters: transthoracic lung ultrasound (TLUS) was performed using GE Voluson E10, Mindray Resona R9, and Mindray Resona R8 ultrasound machines, employing convex (2–8 MHz) and linear array (6–18 MHz) probes for image acquisition. The TLUS score was derived using a standardized 72-point lung ultrasound protocol [12] (Figure 1), covering 16 predefined regions across both lungs (8 per lung). Each region was graded on a 0–11 point scale according to B-line burden, pleural line morphology, and the presence of additional findings/complications (Am-lines and pleural effusion). The total TLUS score was calculated as the sum of all regional scores, yielding a native range of 0–176 points (Table 1).
(4)Pulmonary function tests: diffusing capacity of the lung for carbon monoxide (DLCO% predicted) and forced expiratory volume in one second (FEV_1_% predicted).
(5)Laboratory indicators: white blood cell (WBC), red blood cell (RBC), and platelet (PLT) counts; D-dimer; and brain natriuretic peptide (BNP).
(6)Disease classification: CTD subtypes including polymyositis, primary Sjögren’s syndrome, rheumatoid arthritis, systemic lupus erythematosus, systemic sclerosis, and mixed connective tissue disease.

Because the TLUS score ranged from 0 to 176, we rescaled it by dividing by 5 prior to model fitting to improve numerical stability and interpretability. Accordingly, a 1-unit increase in the rescaled TLUS corresponds to an approximately 5-point increase in the original score. This linear transformation does not affect correlations among predictors; therefore, collinearity diagnostics (VIF and tolerance) remain unchanged. Participants with >20% missing values were excluded, and complete-case analysis was performed for all retained variables.

A 72-point scan method of the bilateral anterior, lateral, and posterior chest wall. The numbers represent the 16 lung regions, ICS: intercostal spaces.

2.4. Statistical Analysis

All statistical analyses were performed using SPSS version 26.0 (IBM Corp., Armonk, NY, USA) and R software version 4.3.2 (R Foundation for Statistical Computing, Vienna, Austria). Data preprocessing was conducted in Microsoft Excel (version 2019). The Shapiro–Wilk test and histogram inspection were used to assess the normality of continuous variables, and Levene’s test evaluated homogeneity of variances. Normally distributed variables were expressed as mean ± standard deviation ( $[eqn]$ ± s) and compared using the independent-samples t test, while non-normally distributed variables were expressed as median (interquartile range, P25–P75) and compared using the Mann–Whitney U test. Categorical variables were presented as n (%) and compared using the χ^2^ test or Fisher’s exact test. Intraclass correlation coefficient (ICC) was used to analyze intra- and inter- operator reliability for lung ultrasound score calculation.

Candidate predictors were prespecified based on clinical relevance and prior literature, including demographics (age, gender, height, weight), physiological measures (respiratory rate, oxygenation index [PaO_2_/FiO_2_], pH), pulmonary function (DLCO% predicted, FEV_1_% predicted), laboratory indicators (WBC, RBC, PLT, D-dimer, BNP), lung ultrasound (TLUS score), and CTD subtype. To identify predictors of pulmonary arterial hypertension (PAH), univariate logistic regression was first performed to describe unadjusted associations; however, univariate p-values were not used as the sole criterion for predictor inclusion. The primary prediction model was developed in the training cohort using multivariable logistic regression, and results were reported as odds ratios (ORs) with 95% confidence intervals (CIs). Multicollinearity was evaluated using the variance inflation factor (VIF) and tolerance; VIF > 5 or tolerance < 0.2 indicated collinearity.

Independent predictors were incorporated into a nomogram model constructed in R. Discrimination was assessed using receiver operating characteristic (ROC) curves and area under the curve (AUC) values in the training, validation, and testing cohorts. The DeLong test was used to compare AUCs across cohorts. Diagnostic performance was further evaluated using the Youden index, sensitivity, specificity, positive/negative predictive values (PPV, NPV), and positive/negative likelihood ratios (LR^+^, LR^−^). Calibration was assessed using the Brier score and calibration plots, and numeric calibration-in-the-large (CITL; calibration intercept) and calibration slope with 95% confidence intervals were reported for all cohorts.

To quantify overfitting and model optimism, bootstrap internal validation (B = 1000 resamples) was performed in the training cohort to obtain optimism-corrected estimates of AUC, calibration intercept, and calibration slope.

Predictor-selection robustness beyond univariate screening was assessed using penalized logistic regression with the least absolute shrinkage and selection operator (LASSO) across the prespecified candidate set. Predictors were standardized prior to penalization, the penalty parameter was selected by cross-validation, and predictors with non-zero coefficients were recorded as the LASSO-selected set; discrimination and calibration were compared with the primary model.

Decision curve analysis (DCA) and clinical impact curves (CIC) were used as supplementary analyses to illustrate potential clinical utility across threshold probabilities. A two-tailed p value < 0.05 was considered statistically significant.

3. Results

3.1. Patient Inclusion of the Study

A total of 622 consecutive CTD-ILD patients were screened from Fujian Medical University Second Affiliated Hospital between January 2018 and December 2024. After applying the predefined exclusion criteria, 72 patients were excluded from the initial cohort of 622 cases, including 35 patients with other concomitant pulmonary diseases (such as COPD, bronchiectasis, or active pulmonary infection), 12 patients with a history of lung surgery, 10 patients with cardiac dysfunction that could interfere with ultrasound or pulmonary function test, and 8 patients who were unable to cooperate or had unsatisfactory lung ultrasound image quality, 7 cases were lost to follow-up and subsequently excluded from the study. Consequently, a total of 550 patients were finally included in the analysis. These patients were randomly allocated into a training cohort (n = 385, 70%) and an internal validation cohort (n = 165, 30%) to establish and internally validate the predictive model.

Additionally, 198 CTD-ILD patients were screened from the Xijing Hospital, Fourth Military Medical University during the same period. After excluding 10 patients with other pulmonary diseases, 5 with prior lung surgery, 4 with cardiac dysfunction that could interfere with ultrasound or pulmonary function test, 5 with inadequate ultrasound imaging or poor cooperation, and 5 cases were lost to follow-up and subsequently excluded from the study 169 cases were included as an external test cohort for independent validation of the model (Figure 2).

3.2. Reproducibility of TLUS Scoring

TLUS scoring demonstrated excellent reproducibility. The intra-operator agreement was high (ICC = 0.9694, 95% CI 0.9494–0.9816), and the inter-operator agreement was also excellent (ICC = 0.9182, 95% CI 0.8667–0.9503).

3.3. Baseline Comparability of the Training and Validation Cohorts

No statistically significant differences were observed between the training cohort (n = 385) and the validation cohort (n = 165) across all baseline characteristics (all p ≥ 0.05), indicating good comparability between the two groups (Table 2). The prevalence of PAH was similar (19.7% vs. 17.0%). Demographic parameters (age, gender, height, and weight), physiologic measures (respiratory rate, CRP, TLUS score, oxygenation index [PaO_2_/FiO_2_], and pH), hematologic and biochemical indicators (WBC, RBC, PLT counts, D-dimer, and BNP), and pulmonary function indices (DLCO% predicted and FEV_1_% predicted) were all comparable. The distribution of CTD subtypes—including polymyositis, primary Sjögren’s syndrome, rheumatoid arthritis, systemic lupus erythematosus, systemic sclerosis, and mixed connective tissue disease—did not differ significantly between cohorts (p = 0.275). The proportions of patients undergoing RHC and the breakdown of PAH adjudication by RHC versus echocardiography across cohorts are summarized in Supplementary Material S1 Table S1.

3.4. Analysis of Independent Risk Factors for PAH in Patients with CTD-ILD and Construction of the Nomogram Model

3.4.1. Univariate Logistic Regression Analysis

Univariate logistic regression analysis revealed seven variables that were significantly associated with the presence of PAH in patients with CTD-ILD (p < 0.05) (Table 3). Higher respiratory rate (OR = 1.24, 95% CI 1.15–1.33, p < 0.001), TLUS score (OR = 1.28, 95% CI 1.21–1.36, p < 0.001), RBC count (OR = 3.59, 95% CI 2.29–5.61, p < 0.001), and BNP (OR = 1.03, 95% CI 1.03–1.04, p < 0.001) were positively correlated with PAH, whereas lower FEV_1_% predicted (OR = 0.96, 95% CI 0.95–0.98, p < 0.001), DLCO% predicted (OR = 0.89, 95% CI 0.87–0.91, p < 0.001), and oxygenation index (PaO_2_/FiO_2_) (OR = 0.99, 95% CI 0.99–0.99, p < 0.001) were negatively associated. Other baseline variables, including demographic, inflammatory, and hematologic parameters, showed no significant associations (all p > 0.05).

3.4.2. Multivariable Logistic Regression Analysis

Multivariate logistic regression identified five independent predictors of PAH in patients with CTD-ILD (Table 4). Higher respiratory rate, TLUS score, RBC count, and BNP were positively associated with PAH, while lower DLCO% predicted remained a strong negative predictor (all p < 0.001). In contrast, FEV_1_% predicted and oxygenation index showed no significant associations after adjustment (p > 0.05). Because dividing the TLUS score by 5 was a simple linear rescaling, it did not change correlations among predictors; therefore, collinearity diagnostics were invariant to this transformation.

3.4.3. Multicollinearity Assessment

To evaluate potential collinearity among variables included in the multivariable model, variance inflation factors (VIFs) were calculated. All predictors demonstrated low VIF values, with the highest observed for the oxygenation index (VIF = 1.708), followed by TLUS score (VIF = 1.349) and BNP (VIF = 1.269). Other variables, including DLCO% predicted, respiratory rate, and red blood cell count, all showed VIF values close to 1.0. Since none of the VIFs exceeded the conventional threshold of 5, no significant multicollinearity was detected, indicating that the predictors in the final model were independent and stable (Table 5).

3.4.4. Sensitivity Analyses for Predictor Selection

To evaluate the robustness of predictor selection beyond the primary multivariable modeling strategy, we performed a sensitivity analysis using penalized logistic regression with LASSO across the prespecified candidate predictor set. The penalty parameter was selected via cross-validation, and the cross-validated performance profile and coefficient trajectories are presented in Supplementary Material S2 Figures S1 and S2.

At the selected penalty, 13 predictors retained non-zero coefficients, including CRP, BNP, TLUS score, DLCO% predicted, CTD subtype indicator, WBC, respiratory rate, oxygenation index (PaO_2_/FiO_2_), RBC count, D-dimer, blood pH, height, and age. Importantly, the five predictors in the primary model–respiratory rate, DLCO% predicted, TLUS score, RBC count, and BNP–were consistently retained by LASSO, supporting these variables as stable signals (Supplementary Material S2 Figure S2 and Supplementary Material S1 Table S2). Model performance comparisons among the primary five-predictor model, the prespecified clinical model (adding age and gender), and the cross-validated LASSO model across the training, validation, and testing cohorts are summarized in Supplementary Material S1 Table S3. Although LASSO suggested additional predictors, we retained the five-predictor model for the nomogram to preserve parsimony and clinical interpretability, and because the simpler specification showed stable overall performance and calibration when applied to the testing cohort (Supplementary Material S1 Table S3).

3.5. Nomogram Model and Predictive Formula

A nomogram model was developed based on the independent predictors identified in multivariable logistic regression analysis, including respiratory rate, DLCO% predicted, TLUS score, RBC count, and BNP. Each variable was assigned a score proportional to its regression coefficient, and the sum of individual scores corresponded to the predicted probability of PAH. The graphical representation of the nomogram is shown in Figure 3, demonstrating the relative contribution of each predictor to overall risk estimation. The TLUS score was recorded on its native scale (0–176). For regression modeling, we defined as TLUS score as TLUS native/5 to improve numerical stability and interpretability.

The final logistic regression formula for predicting PAH in CTD-ILD patients was:

[eqn]

The predicted probability of PAH is then calculated as:

[eqn]

In this model, higher respiratory rate, TLUS score, red blood cell count, and BNP were associated with an increased risk of PAH, whereas higher DLCO% predicted was protective. This nomogram-based formula provides an individualized, quantitative risk assessment tool for estimating the probability of PAH among patients with CTD-ILD.

Worked example. For an example patient with respiratory rate = 23 breaths/min, DLCO% predicted = 56%, TLUS native = 51 (thus TLUS score = 10.2), RBC = 4.83 ×10^12^/L, and BNP = 69.72 pg/mL, the calculated logit (P_PAH_) was −0.535 and the predicted probability was P (PAH) ≈ 0.369 (Figure 3). To facilitate bedside use, we provide an Excel-based calculator as a supplementary file (Supplementary Material S3).

3.6. Evaluation of Model Performance

3.6.1. Accuracy and Discrimination

The occurrence of PAH was defined as the outcome variable, coded as 1 for patients diagnosed with PAH and 0 for those without PAH. The discriminatory performance of the nomogram for predicting PAH in patients with CTD-ILD was evaluated using receiver operating characteristic (ROC) curve analysis across the training, internal validation, and external testing cohorts.

The model demonstrated excellent and stable discriminative ability, with areas under the curve (AUCs) of 0.952 (95% CI: 0.927–0.977) in the training cohort, 0.935 (95% CI: 0.885–0.985) in the validation cohort, and 0.874 (95% CI: 0.806–0.942) in the external testing cohort. These results indicate that the integrated nomogram model provides superior and consistent predictive performance across all datasets (Figure 4 and Table 6).

Across the training, validation, and testing cohorts, the model showed good discrimination, with AUCs of 0.952 (95% CI 0.927–0.977), 0.935 (95% CI 0.885–0.985), and 0.874 (95% CI 0.806–0.942), respectively. Using Youden index–derived probability thresholds (0.144, 0.140, and 0.176), the model achieved sensitivities of 0.908, 0.857, and 0.756 and specificities of 0.854, 0.861, and 0.859. Corresponding confusion-matrix counts (TP/TN/FN/FP) were 69/264/7/45 in the training cohort, 23/119/4/19 in the validation cohort, and 30/111/10/18 in the testing cohort. Positive likelihood ratios ranged from 5.36 to 6.23 and negative likelihood ratios from 0.11 to 0.28; PPVs were 0.605, 0.548, and 0.625, while NPVs were 0.974, 0.967, and 0.917 across the three cohorts (Table 6).

Furthermore, DeLong’s test showed no statistically significant differences in AUCs between the training and validation cohorts (Z = 0.616, p = 0.538), between the validation and testing cohorts (Z = 1.409, p = 0.160), or between the training and testing cohorts (Z = 1.620, p = 0.205). These findings confirm that the nomogram maintained consistent discriminative performance across all datasets, without evidence of model overfitting or a decline in external validation.

3.6.2. Calibration Performance

The calibration of the nomogram model was evaluated using both the Brier score and calibration plots in the training, validation, and test datasets. The Brier scores were 0.060 for the training cohort, 0.063 for the internal validation cohort, and 0.107 for the external test cohort, all of which were well below the conventional threshold of 0.25, indicating good overall prediction accuracy and agreement between predicted and observed probabilities. In addition to graphical assessment, numeric calibration measures were provided for all cohorts: the CITL (intercept) was 0.000 (95% CI −0.414 to 0.414) in the training cohort, 0.023 (95% CI −0.587 to 0.634) in the internal validation cohort, and 0.793 (95% CI 0.242 to 1.344) in the external testing cohort; the corresponding calibration slopes were 1.000 (95% CI 0.754 to 1.246), 0.975 (95% CI 0.606 to 1.343), and 0.595 (95% CI 0.396 to 0.795), respectively.

The calibration plots demonstrated that the predicted probabilities of PAH generally aligned with the actual observed outcomes across all three datasets (Figure 5). In the training cohort, the calibration curve closely overlapped the 45-degree reference line, suggesting minimal deviation and strong internal consistency. The internal validation cohort demonstrated a similar pattern. In the external test cohort, the calibration curve remained acceptable overall but showed modest deviation at higher predicted probabilities, consistent with the numeric calibration results (positive CITL and a slope < 1), indicating some degree of miscalibration when the model was transported to an independent cohort.

3.6.3. Bootstrap Internal Validation

To quantify potential overfitting and estimate model optimism, we performed bootstrap internal validation in the training cohort using 1000 resamples. The optimism-corrected AUC was 0.948, and the optimism-corrected calibration intercept and slope were −0.057 and 0.946, respectively (Supplementary Material S1 Table S4). These findings indicate limited optimism and provide a quantitative assessment of model overfitting.

3.6.4. Decision Curve Analysis

As a supplementary analysis, DCA was performed to explore the potential net benefit of using the nomogram across a range of threshold probabilities in the training, internal validation, and external testing cohorts. As shown in Figure 6, the nomogram exhibited a consistently higher net benefit than the “treat-all” or “treat-none” strategies within a wide range of clinically relevant threshold probabilities (approximately 0.1–0.8). The “All” curve represents the hypothetical scenario in which all CTD-ILD patients are assumed to develop PAH and therefore receive diagnostic evaluation or intervention, whereas the “None” curve assumes that no patient develops PAH and no intervention is performed.

In the training cohort, the net benefit curve of the model remained clearly above both reference lines, suggesting potential clinical usefulness across a range of thresholds. A similar trend was observed in the internal validation cohort, confirming the model’s robustness and reproducibility. When applied to the external testing cohort, the model maintained the greatest net clinical benefit across most threshold probabilities, suggesting generalizability and clinical applicability for individualized PAH risk prediction.

Collectively, these findings demonstrate that the proposed nomogram can effectively assist clinical decision-making by identifying CTD-ILD patients at higher risk of PAH who may benefit from further diagnostic evaluation and timely management.

3.6.5. Clinical Impact Curve Analysis

Clinical impact curve analysis (CIC) was used to further evaluate the clinical utility and predictive reliability of the nomogram model across the training, internal validation, and external test cohorts. In all three datasets, the red curve representing the number of individuals classified as high risk by the model and the black curve representing the actual number of true positive cases were closely aligned when the threshold probability exceeded 60%. These results imply that the nomogram performs well in differentiating genuine high-risk patients at elevated probability thresholds, underscoring its potential value for clinical decision-making and risk stratification (Figure 7).

In the training cohort, the CIC demonstrated that the predicted number of high-risk individuals closely matched the observed number of PAH cases, indicating excellent internal consistency and minimal overestimation of risk. The internal validation cohort exhibited a similarly favorable pattern, with the true-positive curve largely overlapping the predicted high-risk curve, confirming stable model performance in unseen data. In the external test cohort, the curves maintained close proximity across clinically relevant thresholds, suggesting that the model achieved good external generalizability and practical applicability for identifying patients at high risk of PAH.

Together, these findings highlight that the nomogram model provides high clinical benefit and reliable risk stratification, supporting its potential value as a decision-support tool in managing CTD-ILD patients.

4. Discussion

This study developed and validated a practical nomogram model for predicting pulmonary arterial hypertension (PAH) in patients with connective tissue disease-associated interstitial lung disease (CTD-ILD). Using data from three independent cohorts, we identified five predictors—respiratory rate, DLCO% predicted, TLUS score, RBC count, and BNP—that were independently associated with PAH. The model demonstrated strong discriminative ability, with AUCs of 0.952, 0.935, and 0.874 in the training, validation, and testing cohorts, respectively. Calibration analysis showed good agreement between predicted and observed probabilities, and both decision curve analysis (DCA) and clinical impact curves (CIC) suggested potential clinical usefulness of the model across threshold probabilities. Collectively, these results indicate that the proposed nomogram can effectively assist in the individualized risk assessment of PAH among CTD-ILD patients.

Several of the identified predictors in this study are biologically and clinically consistent with the known pathophysiology of CTD-ILD-related pulmonary vascular disease. A reduced DLCO% predicted emerged as one of the strongest negative predictors of PAH, consistent with previous studies showing that impaired diffusing capacity reflects early pulmonary vascular remodeling and reduced alveolar–capillary surface area [21,22]. Declining DLCO% predicted often precedes overt pulmonary arterial hypertension and can serve as a sensitive functional marker for early identification and risk stratification [22]. Elevated BNP levels were also strongly associated with PAH, reflecting increased right ventricular wall stress and myocardial strain secondary to elevated pulmonary artery pressure. This finding aligns with prior research demonstrating BNP as a reliable biomarker for right heart dysfunction in systemic sclerosis and other connective tissue diseases [23]. Notably, sex-related differences in PAH phenotypes—including BNP behavior and right-heart indices—have been reported, which may influence biomarker interpretation across heterogeneous cohorts [24]. Additionally, higher respiratory rate was independently related to PAH, possibly reflecting a compensatory mechanism for impaired gas exchange and reduced pulmonary perfusion [25]. Elevated RBC counts may reflect chronic hypoxemia–driven compensatory erythrocytosis in CTD-ILD, which can increase blood viscosity, raise pulmonary vascular resistance, and thereby potentially contribute to a higher pulmonary vascular burden and increased risk of PAH [26].

A key finding of this study is that the TLUS score, which quantifies pleural line irregularities and B-line burden, was independently associated with PAH risk. This finding is biologically plausible and aligns with accumulating evidence that TLUS captures the extent of interstitial involvement and correlates with structural disease on HRCT and with gas-exchange impairment [27]. Mechanistically, higher TLUS scores likely reflect more extensive interstitial fibrosis and reduced lung compliance, which increase ventilatory drive and contribute to elevated pulmonary vascular resistance through hypoxic pulmonary vasoconstriction and vascular remodeling [12,28]. Consistent with this pathway, DLCO reduction, an integrated readout of alveolar–capillary membrane integrity and pulmonary vascular involvement, tracks with PAH severity in CTD-ILD cohorts [29]. Taken together, our results support TLUS as a complementary, bedside marker of disease burden that adds clinically meaningful information when combined with functional and biomarker measures.

From a clinical perspective, our model is intended as a CTD-ILD-specific risk stratification tool rather than a universal predictor for all CTD phenotypes. Pulmonary hypertension can occur in CTD even in the absence of ILD; because lung ultrasound primarily reflects interstitial lung involvement, extrapolation to CTD patients without ILD should be avoided. In addition, the nomogram is not designed to replace echocardiography or right heart catheterization, but to help identify patients who may benefit from closer cardiopulmonary evaluation, surveillance, or definitive testing when clinically indicated [30,31]. Taken together, the present study provides a CTD-ILD–focused, noninvasive prediction model that integrates functional (DLCO, respiratory rate), imaging (TLUS), and laboratory (BNP, RBC) measures to capture both parenchymal and vascular involvement. In routine practice, the nomogram can support individualized risk estimation and help triage patients for further evaluation or closer monitoring, with consistent performance observed across internal validation and external testing cohorts.

To enhance transparency, we conducted additional robustness analyses, including LASSO with cross-validation and bootstrap internal validation (B = 1000), and reported calibration-in-the-large and calibration slope (95% CIs) alongside Brier scores and calibration plots. In the external testing cohort, the calibration slope < 1 suggested slightly over-dispersed risk predictions, which is commonly seen with differences in case-mix or measurement characteristics and may also indicate mild overfitting. Therefore, simple recalibration of the intercept (and, if needed, the slope) may improve agreement when applying the model in new centers without changing the predictor set.

Despite its strengths, this study has several limitations. First, the retrospective multicenter design may introduce selection bias and warrants prospective validation. Second, RHC was not available for all patients; thus, some PAH cases were adjudicated using echocardiography-based criteria with supportive findings, which may cause outcome misclassification. Third, although we used a standardized 72-point TLUS protocol with good intra-/inter-observer agreement, TLUS remains semiquantitative and operator-dependent. Finally, RBC may be influenced by sex and oxygenation status, and residual confounding cannot be fully excluded; additional biomarkers and imaging parameters may further improve future models.

5. Conclusions

In summary, this study established and validated a robust, noninvasive TLUS-integrated nomogram model to predict PAH in CTD-ILD patients by combining clinical, functional, and ultrasound parameters. The model demonstrated strong discrimination and overall good calibration across three independent cohorts.

Bibliography31

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Humbert M. Kovacs G. Hoeper M.M. Badagliacca R. Berger R.M.F. Brida M. Carlsen J. Coats A.J.S. Escribano-Subias P. Ferrari P. 2022 ESC/ERS Guidelines for the diagnosis and treatment of pulmonary hypertension Eur. Respir. J.202361220087910.1183/13993003.00879-202236028254 · doi ↗ · pubmed ↗
2Khangoora V. Bernstein E.J. King C.S. Shlobin O.A. Connective tissue disease-associated pulmonary hypertension: A comprehensive review Pulm. Circ.202313 e 1227610.1002/pul 2.1227638088955 PMC 10711418 · doi ↗ · pubmed ↗
3Kattih Z. Kim H.C. Aryal S. Nathan S.D. Review of the Diagnosis and Management of Pulmonary Hypertension Associated with Interstitial Lung Disease (ILD-PH)J. Clin. Med.202514202910.3390/jcm 1406202940142837 PMC 11942768 · doi ↗ · pubmed ↗
4Rodolfi S. Ong V.H. Denton C.P. Recent developments in connective tissue disease associated pulmonary arterial hypertension Int. J. Cardiol. Congenit. Heart Dis.20241610051310.1016/j.ijcchd.2024.10051339712533 PMC 11657338 · doi ↗ · pubmed ↗
5Shlobin O.A. Adir Y. Barbera J.A. Cottin V. Harari S. Jutant E.M. Pepke-Zaba J. Ghofrani H.A. Channick R. Pulmonary hypertension associated with lung diseases Eur. Respir. J.202464240120010.1183/13993003.01200-202439209469 PMC 11525344 · doi ↗ · pubmed ↗
6Cueto-Robledo G. Roldan-Valadez E. Graniel-Palafox L.E. Garcia-Cesar M. Torres-Rojas M.B. Enriquez-Garcia R. Cueto-Romero H.D. Rivera-Sotelo N. Perez-Calatayud A.A. Chronic Thromboembolic Pulmonary Hypertension (CTEPH): A Review of Another Sequel of Severe Post-Covid-19 Pneumonia Curr. Probl. Cardiol.20234810118710.1016/j.cpcardiol.2022.10118735346727 PMC 8956357 · doi ↗ · pubmed ↗
7Steen V. Medsger T.A.Jr. Predictors of isolated pulmonary hypertension in patients with systemic sclerosis and limited cutaneous involvement Arthritis Rheum.20034851652210.1002/art.1077512571862 · doi ↗ · pubmed ↗
8Young A. Nagaraja V. Basilious M. Habib M. Townsend W. Gladue H. Badesch D. Gibbs J.S.R. Gopalan D. Manes A. Update of screening and diagnostic modalities for connective tissue disease-associated pulmonary arterial hypertension Semin. Arthritis Rheum.2019481059106710.1016/j.semarthrit.2018.10.01030415942 PMC 7155785 · doi ↗ · pubmed ↗