# Inferring tumor immune microenvironment -related risk states from pretreatment H&E pathomics and clinical biomarkers to predict checkpoint inhibitor pneumonitis in advanced NSCLC: a multicenter multimodal study

**Authors:** Lei Yuan, Qi Wang, Fei Sun, Wenlong Yang, Jie Lei, Juan Liu, Yiwei Fan, Yibo Shan, Yi Lu, Yaojing Zhang, Yilun Wang, Jianwei Zhu, Lintao Guo, Wenxuan Chen, Shichun Lu, Hongcan Shi

PMC · DOI: 10.3389/fimmu.2026.1792179 · Frontiers in Immunology · 2026-02-19

## TL;DR

This study uses AI-driven analysis of H&E slides and clinical data to predict the risk of checkpoint inhibitor pneumonitis in lung cancer patients before treatment.

## Contribution

A novel multimodal model combining pathomics and clinical biomarkers to predict pneumonitis risk in NSCLC patients.

## Key findings

- The multimodal fusion model achieved the highest AUC scores (up to 0.930) for predicting CIP risk.
- Pathomics models outperformed clinical models in predicting pneumonitis risk.
- Integration of H&E pathomics and clinical data enables accurate pretreatment risk stratification.

## Abstract

Checkpoint inhibitor pneumonitis (CIP) is a rare but potentially fatal immune-related adverse event (irAE) that can interrupt immune checkpoint blockade in non-small cell lung cancer (NSCLC). With no validated pretreatment biomarkers and a diagnosis largely made by exclusion, upfront risk stratification is required. Recent advances in artificial intelligence (AI)-driven pathomics have made it feasible to infer tumor immune microenvironment (TIME)-relevant risk states in patients with NSCLC. Accordingly, we leveraged hematoxylin and eosin(H&E)-based digital pathomics combined with clinical variables to interrogate the TIME in patients who developed CIP and to enable pretreatment and early prediction of CIP.

In this retrospective study, 346 eligible patients from three hospitals were screened consecutively between January 2022 and January 2025. Patients were divided into CIP and non-CIP groups according to whether CIP occurred at a prespecified observation endpoint. We first developed a pathomics model that employed convolutional neural networks (CNNs) combined with multi-instance learning (MIL) to generate predictions at both the patch and whole slide image (WSI) levels on H&E-stained slides. Separately, we constructed a clinical model using logistic regression (LR) to process the structured clinical data accompanying each case. Subsequently, pathological and clinical information were integrated, where modeling was advanced from modality-specific feature learning to cross-modal representation learning, and final predictive modeling was completed. The predictive performance of different models was evaluated using the area under the Receiver Operating Characteristic (ROC) curve and benchmarked against unimodal models and standard ensemble methods.

When the models were evaluated across both internal validation and external test datasets, the pathomics model demonstrated noticeably stronger performance than the clinical approach, achieving area under the curve (AUC) scores of 0.916, 0.875(test 1), and 0.843(test 2), respectively, while the clinical model posted more modest results of 0.880, 0.569(test 1), and 0.594(test 2). The most significant outcome, however, emerged from the multimodal fusion model, which produced the strongest results of all, with performance metrics of 0.930, 0.919(test 1), and 0.905(test 2) in the validation and test phases, respectively.

Pretreatment H&E-derived pathomics, integrated with baseline clinical biomarkers, enable accurate prediction of CIP risk in locally advanced or metastatic NSCLC. This framework supports proactive surveillance and individualized immune checkpoint inhibitor (ICI) strategies and provides a scalable route to decode TIME-relevant states from routine pathology.

## Linked entities

- **Diseases:** non-small cell lung cancer (MONDO:0005233)

## Full-text entities

- **Genes:** CRP (C-reactive protein) [NCBI Gene 1401] {aka PTX1}, IL6 (interleukin 6) [NCBI Gene 3569] {aka BSF-2, BSF2, CDF, HGF, HSF, IFN-beta-2}, CTLA4 (cytotoxic T-lymphocyte associated protein 4) [NCBI Gene 1493] {aka ALPS5, CD, CD152, CELIAC3, CTLA-4, GRD4}, ALB (albumin) [NCBI Gene 213] {aka FDAHT, HSA, PRO0883, PRO0903, PRO1341}, CD274 (CD274 molecule) [NCBI Gene 29126] {aka ADMIO5, B7-H, B7H1, PD-L1, PDCD1L1, PDCD1LG1}, PDCD1 (programmed cell death 1) [NCBI Gene 5133] {aka ADMIO4, AIMTBS, CD279, PD-1, PD1, SLEB2}, GPT (glutamic--pyruvic transaminase) [NCBI Gene 2875] {aka AAT1, ALT, ALT1, GPT1, SGPT}
- **Diseases:** HIS (MESH:D003428), respiratory symptoms (MESH:D012818), death (MESH:D003643), infection (MESH:D007239), SGD (MESH:D000141), cough (MESH:D003371), hepatobiliary dysfunction (MESH:D004066), LYMPH (MESH:D000072717), TNM (MESH:D008207), heart failure (MESH:D006333), EOS (MESH:C538157), interstitial pneumonitis (MESH:D017563), pulmonary fibrosis (MESH:D011658), stage III (MESH:D062706), WSI (MESH:C564543), inflammation (MESH:D007249), pulmonary infection (MESH:D012141), H&amp;E (MESH:D016751), lung cancer (MESH:D008175), dyspnea (MESH:D004417), Tumor (MESH:D009369), adenocarcinoma (MESH:D000230), lung disease (MESH:D008171), lung injury (MESH:D055370), respiratory failure (MESH:D012131), chest pain (MESH:D002637), Fatigue (MESH:D005221), chronic obstructive pulmonary disease (MESH:D029424), CIP (MESH:D011014), stage IIIA/IIIB (MESH:C566890), pulmonary embolism (MESH:D011655), fever (MESH:D005334), NSCLC (MESH:D002289)
- **Chemicals:** pemetrexed (MESH:D000068437), DB (-), H&amp;E (MESH:D006371), hematoxylin (MESH:D006416), Pembrolizumab (MESH:C582435), Sintilimab (MESH:C000632826), Durvalumab (MESH:C000613593), eosin (MESH:D004801), Nivolumab (MESH:D000077594), platinum (MESH:D010984), metal (MESH:D008670), Bilirubin (MESH:D001663), paclitaxel (MESH:D017239), Tislelizumab (MESH:C000707970)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12960525/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12960525/full.md

## References

47 references — full list in the complete paper: https://tomesphere.com/paper/PMC12960525/full.md

---
Source: https://tomesphere.com/paper/PMC12960525