# Domain-Aware Interpretable Machine Learning Model for Predicting Postoperative Hospital Length of Stay from Perioperative Data: A Retrospective Observational Cohort Study

**Authors:** Iqram Hussain, Joseph R. Scarpa, Richard Boyer

PMC · DOI: 10.3390/bioengineering13020147 · Bioengineering · 2026-01-27

## TL;DR

This study developed an interpretable machine learning model to predict hospital length of stay after surgery, identifying key factors like operation duration and lab values that influence recovery time.

## Contribution

The novel contribution is an interpretable machine learning framework that integrates multimodal perioperative data to predict and explain postoperative hospital length of stay.

## Key findings

- The model achieved R2 = 0.61 and MAE ≈ 1.34 days on the holdout set.
- Operative duration, diagnostic complexity, and intraoperative hemodynamic variability were the strongest predictors of postoperative stay.
- Lower albumin levels and complex procedures were linked to prolonged hospitalization.

## Abstract

Background and Objective: Postoperative hospital length of stay (LOS) reflects surgical recovery and resource demand but remains difficult to predict due to heterogeneous perioperative trajectories. We aimed to develop and validate an interpretable machine learning framework that integrates multimodal perioperative data to accurately predict LOS and uncover clinically meaningful drivers of prolonged hospitalization. Methods: We studied 97,937 adult surgical cases from a large perioperative registry. Routinely collected perioperative data included patient demographics, comorbid conditions, preoperative laboratory values, intraoperative physiologic summaries, and procedural characteristics. Length of stay was modeled using a supervised regression approach with internal cross-validation and independent holdout evaluation. Model performance was assessed at both the cohort and individual levels, and explanatory analyses were performed to quantify the contribution of clinically defined perioperative domains. Results: The model achieved R2 = 0.61 and MAE ≈ 1.34 days on the holdout set, with nearly identical cross-validation performance (R2 = 0.60, MAE ≈ 1.34 days). Operative duration, diagnostic complexity, intraoperative hemodynamic variability, and preoperative laboratory indices—particularly albumin and hematocrit—emerged as the strongest determinants of postoperative stay. Patients with shorter recoveries typically had brief operations, stable physiology, and normal laboratory profiles, whereas prolonged hospitalization was linked to complex procedures, malignant or respiratory diagnoses, and lower albumin levels. Conclusions: Interpretable machine learning enables accurate and generalizable estimation of postoperative LOS while revealing clinically actionable perioperative domains. Such frameworks may facilitate more efficient perioperative planning, improved allocation of hospital resources, and personalized recovery strategies.

## Full-text entities

- **Genes:** ALB (albumin) [NCBI Gene 213] {aka FDAHT, HSA, PRO0883, PRO0903, PRO1341}, CRP (C-reactive protein) [NCBI Gene 1401] {aka PTX1}, ALPP (alkaline phosphatase, placental) [NCBI Gene 250] {aka ALP, PALP, PLAP, PLAP-1}, ITIH2 (inter-alpha-trypsin inhibitor heavy chain 2) [NCBI Gene 3698] {aka H2P, ITI-HC2, SHAP}
- **Diseases:** CTS (MESH:D002349), postoperative (MESH:D019106), thromboembolic (MESH:D013923), infection (MESH:D007239), death (MESH:D003643), musculoskeletal, and respiratory disorders (MESH:D009139), blood loss (MESH:D016063), neoplasms (MESH:D009369), LOS (MESH:D007870), respiratory diseases (MESH:D012140), inflammation (MESH:D007249), injury to (MESH:D014947)
- **Chemicals:** creatinine (MESH:D003404), CO2 (MESH:D002245), chloride (MESH:D002712), ASA (MESH:D001241), oxygen (MESH:D010100)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12938465/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12938465/full.md

## References

54 references — full list in the complete paper: https://tomesphere.com/paper/PMC12938465/full.md

---
Source: https://tomesphere.com/paper/PMC12938465