Against All Odds: Why a Lung Donor Score Does Not Add Up

Simon Schwab; Fabian Iten

PMC · DOI:10.3389/ti.2025.14937·October 17, 2025

Against All Odds: Why a Lung Donor Score Does Not Add Up

Simon Schwab, Fabian Iten

PDF

Open Access

Abstract

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Species1

Homo sapiens(human · species)

Diseases3

DCD tumor inflammation

Keywords

lung transplantrisk factorsclinical prediction modellung donor scorestatistical modeling

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOrgan Transplantation Techniques and Outcomes · Transplantation: Methods and Outcomes · Organ Donation and Transplantation

Full text

Dear Editors,

The Eurotransplant Lung Donor Score (LDS) [1] is a prediction model to assess the risk of donor lung discard due to medical reasons. A logistic regression model was fitted using data from over 5,000 donor lungs, and the resulting odds ratios were translated into a risk score ranging from 6 to 19 points, with higher scores indicating poorer donor lung quality. The LDS was found to be significantly associated with 1-year survival. The authors concluded that the LDS accurately reflects the likelihood of organ acceptance and is predictive of patient mortality. Furthermore, its implementation at the time of donor reporting may support more informed donor risk assessment and improve recipient selection. The LDS is used to study the quality of transplanted lungs in Eurotransplant [2], but it’s unclear to what extent it is actually used in clinical practice to define an extended-criteria donor lung.

However, we believe the Eurotransplant LDS has some methodological limitations. The scoring system is presented in Table 2 of the original report. Odds ratios from the logistic regression model were rounded to the nearest whole number to construct a risk score. For example, donor age between 45 and 54 years was assigned 1 point (odds ratio 1.33), a history of smoking was assigned 2 points (odds ratio 1.53), and the presence of inflammation on donor bronchoscopy was assigned 3 points (odds ratio 2.83). Finally, individual points were summed into an overall risk score. We identified four limitations in the construction of the LDS, discussed in detail below.

First, rounding odds ratios such as 1.53, 1.87, and 2.40 all to a score of 2 points is suboptimal, as it obscures important differences in risk. These values represent varying levels of association with organ discard: a donor smoking history corresponds to a 1.5-fold increase in the odds of organ discard, purulent secretions observed on bronchoscopy to a 1.9-fold increase, and a PaO_2_/FiO_2_ ratio of 301–350 mmHg to a 2.4-fold increase. Grouping them under the same score point may discard a vast amount of information, which may harm discrimination and the predictive accuracy of the model.

Second, the construction relies on adding odds ratios, a practice that is statistically not valid. For example, consider a 20-year-old donor with purulent secretions observed on bronchoscopy. The donor is assigned the level of donor age <45, resulting in an odds ratio of 1 (reference level), indicating neither a decrease nor an increase in the odds of organ discard. Then, the odds ratio for purulent secretions is nearly 2, reflecting a 2-fold increased chance of discard. Combining these two odds ratios would result in a 1 × 2 = 2-fold increase in the odds of organ discard. If these two odds ratios were combined by summing them, it would result in a 3-fold increase in the odds of organ discard, which is incorrect. Odds ratios multiply; they do not add up. This is a basic mathematical property of ratios.

Third, caution is warranted when using binary indicators to denote the presence or absence of missing data. In the current model, nearly all donor risk factors include a category specifically representing missing values, which may introduce bias and compromise the validity of the predictions [3]. A single imputation model may be considered when the missing data are low. Advanced methods, such as multiple imputations, are not always practical when applying a model in clinical practice. Predictors with substantial missingness are generally poor candidates, as they are also likely to be missing in future patients. When only a small proportion (less than 5%) of data is missing, complete case analysis may be acceptable, though it remains important to always investigate potential reasons for missingness. Strong candidate predictors are objective measures that are consistently and widely available. In organ transplantation, in particular, key clinical information is mandatory, as an organ cannot be allocated without it. Thus, it is often possible to retrieve missing data from the medical records and databases.

Fourth, the primary outcome used in the model was organ discard. As it stands, the model predicts the probability of discard rather than clinically meaningful endpoints such as graft failure or patient death. Even if organ discard is correlated with these outcomes, the primary endpoint should ideally reflect a patient-centered outcome following transplantation.

We propose the following simple solution: we calculated the log odds ratios from Table 2 of the original publication. We then multiplied these values by a factor of 4 and rounded them to whole numbers. This preserved sufficient accuracy while aligning more closely with the original score for easier comparison. Using a multiplicative factor of 5 would also work, but it would increase the maximal score to 30. This new risk score system avoids data loss due to rounding of small numbers. Moreover, adding the values on the log scale is statistically appropriate. The new risk score ranges from 0 to 24, and is shown in Table 1. Supplementary Figure 1A shows the distribution of the original and revised scores, while Supplementary Figure 1B compares them directly. The revised score, however, cannot address the issues with how missing data were handled and with the choice of the primary outcome.

Medical statisticians warned against the flawed practice of adding up ratios decades ago [4] but the problem seems to persist. A risk score for donation after cardiac death (DCD) liver transplants combined hazard ratios in a similarly incorrect manner [5]. Summing odds ratios (or hazard ratios) not only lacks any meaningful interpretation, but may also produce misleading results. For instance, an odds ratio of 0.8 from a logistic model, which indicates a 0.8-fold risk reduction, would paradoxically increase the overall risk score if added. On the log scale, however, ratios smaller than 1 have a negative sign.

The LDS continues to be used today, for instance, a recent study used it as a benchmark to compare with their newly developed risk score [6]. Had the LDS been constructed using a statistically appropriate approach, the resulting score may have offered more accurate and evidence-based weighting of clinical donor risk factors in lung transplants. We hope our approach offers a constructive refinement of the LDS that could improve its accuracy and interpretability.

Bibliography6

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Smits J Mvan der Bij W Van Raemdonck Dde Vries E Rahmel A Laufer G Defining an Extended Criteria Donor Lung: An Empirical Approach Based on the Eurotransplant Experience. Transpl Int (2011) 24:393–400. 10.1111/j.1432-2277.2010.01207.x 21155901 · doi ↗ · pubmed ↗
2Smits JM Gottlieb J Verschuuren E Evrard P Hoek R Knoop C Impact of Donor Lung Quality on Post-transplant Recipient Outcome in the Lung Allocation Score Era in Eurotransplant - a Historical Prospective Study. Transpl Int (2020) 33:544–54. 10.1111/tri.13582 31984577 PMC 7216955 · doi ↗ · pubmed ↗
3van Smeden M Groenwold RHH Moons KG. A Cautionary Note on the Use of the Missing Indicator Method for Handling Missing Data in Prediction Research. J Clin Epidemiol (2020) 125:188–90. 10.1016/j.jclinepi.2020.06.007 32565213 · doi ↗ · pubmed ↗
4Harrell F. Regression Coefficients and Scoring Rules. J Clin Epidemiol (1996) 49:819. 10.1016/0895-4356(95)00068-2 8691234 · doi ↗ · pubmed ↗
5Khorsandi SE Giorgakis E Vilca-Melendez HO’Grady J Heneghan M Aluvihare V Developing a Donation After Cardiac Death Risk Index for Adult and Pediatric Liver Transplantation. World J Transpl (2017) 7:203–12. 10.5500/wjt.v 7.i 3.203 28698837 PMC 5487310 · doi ↗ · pubmed ↗
6Cantu E Diamond J Ganjoo N Nottigham A Ramon CV Mc Curry M Scoring Donor Lungs for Graft Failure Risk: The Lung Donor Risk Index (LDRI). Am J Transpl (2024) 24:839–49. 10.1016/j.ajt.2024.01.022 38266712 PMC 11939009 · doi ↗ · pubmed ↗