# Identification of prognostic markers related to homologous recombination deficiency in cholangiocarcinoma using CoxBoost and LASSO machine learning techniques

**Authors:** Yan Liu, Cheng Zhou, Tianhao Shen, Xue Yu, Qiuying Li, Tinghui Jiang, Wei Li, Yongqiang Zhu

PMC · DOI: 10.3389/fimmu.2026.1615657 · Frontiers in Immunology · 2026-01-21

## TL;DR

This study identifies six genes linked to poor outcomes in cholangiocarcinoma using machine learning, offering new biomarkers for personalized treatment.

## Contribution

Novel HRD-based prognostic model using CoxBoost and LASSO for cholangiocarcinoma with six identified candidate genes.

## Key findings

- High HRD scores correlate with worse survival and progression in cholangiocarcinoma.
- CoxBoost and LASSO algorithms outperformed other methods in building the prognostic model.
- Six genes (ANXA2P1, BBOX1, KLHL33, MN1, OR51A4, TRDN) show significant differential expression.

## Abstract

Cholangiocarcinoma (CHOL) is a highly aggressive malignancy with a poor prognosis. Homologous recombination deficiency (HRD) is associated with genomic instability and cancer progression, making it a potential therapeutic target. The aim of this study is to develop novel potential prognostic biomarkers and construct an HRD-based prognostic risk prediction model for CHOL to enhance clinical precision medicine.

We analyzed HRD across cancers using multiple datasets including TCGA-CHOL, TCGA-LIHC, GDC TARGET-OS, and IMvigor210. HRD scores were calculated using data from Thorsson et al. and the maftools R package was used for mutation data visualization and tumor mutational burden (TMB) calculation. Differential gene expression analysis identified HRD-related genes, validated in tumor and adjacent non-tumor tissues using RT-PCR. 10 machine-learning algorithms including RSF, LASSO, GBM, Survival-SVM, SuperPC, Ridge, plsRcox, CoxBoost, Stepwise Cox, Enet were selected to construct a prognostic model and validated in the E-MTAB-6389 and GSE107943. Among them, RSF, LASSO, CoxBoost and Stepwise Cox have the functions of dimension reduction and variable screening.

Comparative analysis demonstrated significant associations between HRD scores and genomic instability markers. High HRD scores independently predicted poorer overall survival (log-rank p = 0.043) and progression-free interval (log-rank p = 0.028). Immune infiltration analysis revealed higher levels of active B cells and regulatory T cells in the low-risk group, suggesting differential immune landscapes between risk groups. We identified the CoxBoost and LASSO algorithms as the optimal combination for creating a CoxBoost+ LASSO prognostic model. Using this model, we identified six genes (ANXA2P1, BBOX1, KLHL33, MN1, OR51A4, and TRDN) with significant differential expression.

Our HRD-based prediction model offers a reliable tool for CHOL prognosis, suggesting new potential for six candidate genes as prognostic biomarkers. It highlights potential therapeutic targets and drug sensitivities, providing new insights into personalized treatment strategies for CHOL management.

## Linked entities

- **Genes:** ANXA2P1 (annexin A2 pseudogene 1) [NCBI Gene 303], BBOX1 (gamma-butyrobetaine hydroxylase 1) [NCBI Gene 8424], KLHL33 (kelch like family member 33) [NCBI Gene 123103], MN1 (MN1 proto-oncogene, transcriptional regulator) [NCBI Gene 4330], OR51A4 (olfactory receptor family 51 subfamily A member 4) [NCBI Gene 401666], TRDN (triadin) [NCBI Gene 10345]
- **Diseases:** cholangiocarcinoma (MONDO:0019087)

## Full-text entities

- **Genes:** BBOX1 (gamma-butyrobetaine hydroxylase 1) [NCBI Gene 8424] {aka BBH, BBOX, G-BBH, gamma-BBH}, OR51A4 (olfactory receptor family 51 subfamily A member 4) [NCBI Gene 401666], MN1 (MN1 proto-oncogene, transcriptional regulator) [NCBI Gene 4330] {aka CEBALID, MGCR, MGCR1, MGCR1-PEN, dJ353E16.2}, KLHL33 (kelch like family member 33) [NCBI Gene 123103], TRDN (triadin) [NCBI Gene 10345] {aka CARDAR, CPVT5, TDN, TRISK}, ANXA2P1 (annexin A2 pseudogene 1) [NCBI Gene 303] {aka ANX2L1, ANX2P1, LPC2A}
- **Diseases:** CHOL (MESH:D018281), HRD (MESH:C535296), cancer (MESH:D009369)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12867883/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12867883/full.md

## References

45 references — full list in the complete paper: https://tomesphere.com/paper/PMC12867883/full.md

---
Source: https://tomesphere.com/paper/PMC12867883