# EM-DeepSD: A Deep Neural Network Model Based on Cell-Free DNA End-Motif Signal Decomposition for Cancer Diagnosis

**Authors:** Zhi-Yang Zhao, Chang-Ling Huang, Tong-Min Wang, Shi-Hao Zhou, Lu Pei, Wen-Hui Jia, Wei-Hua Jia

PMC · DOI: 10.3390/diagnostics15091156 · Diagnostics · 2025-05-01

## TL;DR

This paper introduces EM-DeepSD, a deep learning model that improves cancer diagnosis by analyzing cell-free DNA end-motifs across different sequencing methods.

## Contribution

The novel EM-DeepSD framework uses signal decomposition of cfDNA end-motifs to enhance cancer diagnosis accuracy across sequencing modalities.

## Key findings

- EM-DeepSSA outperformed benchmark methods in internal validation with an AUC of 0.920.
- The model achieved AUCs of 0.933 and 0.956 on two external testing sets using different sequencing techniques.
- EM-DeepSD demonstrates adaptability and high performance across various cfDNA sequencing modalities.

## Abstract

Background and Objectives: The accurate discrimination between patients with and without cancer using their cell-free DNA (cfDNA) is crucial for early cancer diagnosis. The end-motifs of cfDNA serve as significant cancer biomarkers, offering compelling prospects for cancer diagnosis. This study proposes EM-DeepSD, a signal decomposition deep learning framework based on cfDNA end-motifs, which is aimed at improving the accuracy of cancer diagnosis and adapting to different sequencing modalities. Materials and Methods: This study included 146 patients diagnosed with cancer and 122 non-cancer controls. EM-DeepSD comprises three core modules. Initially, it utilizes a signal decomposition module to decompose and reconstruct the input end-motif profiles, thereby generating multiple regular subsequences that optimize the subsequent modeling process. Subsequently, both a machine learning module and a deep learning module are employed to improve the accuracy of cancer diagnosis. Furthermore, this paper compares the performance of EM-DeepSD with that of existing benchmarked methods to demonstrate its superiority. Based on the EM-DeepSD framework, we developed the EM-DeepSSA model and compared it with two benchmarked methods across different cfDNA sequencing datasets. Results: In the internal validation set, EM-DeepSSA outperformed the two benchmark methods for cancer diagnosis (area under the curve (AUC), 0.920; adjusted p value < 0.05). Meanwhile, EM-DeepSSA also exhibited the best performance on two independent external testing sets that were subjected to 5-hydroxymethylcytosine sequencing (5hmCS) and broad-range cell-free DNA sequencing (BR-cfDNA-Seq), respectively (test set-1: AUC = 0.933; test set-2: AUC = 0.956; adjusted p value < 0.05). Conclusions: In summary, we present a new framework which can achieve high classification performance in cancer diagnosis and which is applicable to different sequencing modalities.

## Linked entities

- **Diseases:** cancer (MONDO:0004992)

## Full-text entities

- **Diseases:** Cancer (MESH:D009369)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12071254/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12071254/full.md

## References

55 references — full list in the complete paper: https://tomesphere.com/paper/PMC12071254/full.md

---
Source: https://tomesphere.com/paper/PMC12071254