# Urine-Based cfDNA Ensemble Modeling for Early Detection of Bladder Cancer Using Whole-Genome Methylation Sequencing

**Authors:** Taehoon Kim, Dongju Shin, Hyun Kyu Ahn, Young Joon Moon, Duhee Bang, Kwang Hyun Kim

PMC · DOI: 10.3390/cancers18050767 · 2026-02-27

## TL;DR

A new non-invasive urine test using DNA methylation and copy number variations can detect early-stage bladder cancer with high accuracy.

## Contribution

A urine-based ensemble model combining methylation and CNV data achieves high sensitivity for non-muscle invasive bladder cancer detection.

## Key findings

- Urine samples capture stronger and more tissue-representative cancer signals than plasma.
- The model achieved 91.9% sensitivity and 80% specificity for bladder cancer detection.
- The model successfully identified mutation-negative cases, complementing mutation-based approaches.

## Abstract

Early detection of bladder cancer is challenging due to the limited release of tumor-derived DNA into body fluids. Using whole-genome methylation sequencing, we compared urine and plasma samples and found that urine captures stronger and more tissue-representative cancer signals. We developed a urine-based ensemble model integrating DNA methylation and copy number variations, achieving 91.9% sensitivity in detecting non-muscle invasive bladder cancer. Importantly, the model successfully identified mutation-negative cases, highlighting its complementary value to mutation-based approaches and its potential as a non-invasive tool for bladder cancer detection.

Early detection of bladder cancer poses a major challenge for liquid biopsy due to limited tumor burden and low abundance of tumor-derived DNA. In such low-signal settings, detection sensitivity critically depends on both biofluid selection and effective integration of weak, distributed molecular signals. We analyzed Enzymatic Methyl-seq (EM-seq) data on 41 matched urine–plasma pairs, which demonstrated that urine samples exhibited significantly higher tumor fractions and greater concordance with tissue methylation profiles than plasma. Based on this observation, we developed a urine-based bladder cancer detection framework using EM-seq. We profiled 143 urine samples (68 bladder cancer and 75 healthy controls) and 14 bladder cancer tissues. Methylation markers (113,052 regions) were identified by comparing cancer tissues (n = 14) with urine from healthy individuals (n = 14). Using XGBoost, possible features and their combinations were evaluated, with the combination of methylation and copy number variations (CNV) yielding the best performance as the final ensemble model. When evaluated on an independent test set, the model achieved 91.9% sensitivity at 80% specificity, with an area under the curve (AUC) of 0.932 for bladder cancer detection and 0.928 for non-muscle invasive bladder cancer (NMIBC) detection. Notably, the model successfully detected four of seven mutation-negative cases, demonstrating complementary value to mutation-based approaches.

## Linked entities

- **Diseases:** bladder cancer (MONDO:0004986)

## Full-text entities

- **Diseases:** Bladder Cancer (MESH:D001749), NMIBC (MESH:D000093284), cancer (MESH:D009369)

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12984421/full.md

---
Source: https://tomesphere.com/paper/PMC12984421