# Genetic variations and recurrence in stage III Korean colorectal cancer: Insights from tumor-only mutation analysis

**Authors:** Hajin Jeon, Jong Lyul Lee, Hyeran Shim, Soobok Joe, Iksu Byeon, Chan Wook Kim, Seok-Byung Lim, In Ja Park, Yong Sik Yoon, Hoang Bao Khanh Chu, Young-Joon Kim, Chang Sik Yu, Jin Ok Yang, Nejat Mahdieh, Nejat Mahdieh, Nejat Mahdieh

PMC · DOI: 10.1371/journal.pone.0323302 · PLOS One · 2025-05-23

## TL;DR

This study identifies genetic variations linked to recurrence in stage III Korean colorectal cancer using tumor-only sequencing data and highlights the importance of robust filtering methods.

## Contribution

The study introduces a robust tumor-only mutation analysis approach and develops a recurrence prediction model for Korean CRC patients.

## Key findings

- 221 significant mutations across 195 genes were identified, with distinct distributions between recurrence and non-recurrence groups.
- T-category, N-category, and preoperative CEA levels were clinically correlated with CRC recurrence.
- A recurrence prediction model achieved an AUC of 0.77 using PyCaret.

## Abstract

Colorectal cancer (CRC) has the second highest incidence rate among all cancers in Korea, with approximately 30% of patients with regional CRC experiencing recurrence. Understanding the genetic drivers of recurrence is essential for early detection and targeted treatment. Therefore, many studies have focused on genetic analysis using tumor-normal matched samples, as this approach provides more comprehensive insights. However, tumor-only samples are far more common in clinical practice because of the difficulty in obtaining normal tissues, making developing robust methods for analyzing tumor-only data a pressing need. This study aimed to investigate the genetic variations associated with CRC recurrence using tumor-only whole-exome sequencing data from 200 Korean patients with stage III CRC. By applying stringent filtering using public databases including Genome Aggregation Database (gnomAD), Exome Aggregation Consortium (ExAC), Single Nucleotide Polymorphism Database (dbSNP), 1000 Genomes Project (1000G), Korean Variant Archive 2 (KOVA2), and Korean Reference Genome Database (KRGDB), we identified 221 statistically significant mutations across 195 genes with distinct distributions between the recurrence and non-recurrence groups. Furthermore, statistical analysis of the clinical data revealed that the T-category, N-category, and preoperative carcinoembryonic antigen levels were correlated with CRC recurrence. Moreover, we identified nine networks through protein-protein interaction analysis and identified networks with high feature importance. We also developed a CRC recurrence prediction model using PyCaret, which achieved an area under the curve (AUC) of 0.77. Our findings highlight the importance of robust variant filtering in tumor-only sample analyses and provide insights into the genetic landscape of CRC recurrence in the Korean population.

## Linked entities

- **Diseases:** colorectal cancer (MONDO:0005575), CRC (MONDO:0005575)

## Full-text entities

- **Diseases:** stage III (MESH:D062706), CRC (MESH:D015179), cancers (MESH:D009369)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12101642/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12101642/full.md

## References

74 references — full list in the complete paper: https://tomesphere.com/paper/PMC12101642/full.md

---
Source: https://tomesphere.com/paper/PMC12101642