# Unveiling Novel Genetic Mutations and Prognostic Indicators in Breast Carcinoma: An Analysis of The Cancer Genome Atlas (TCGA) Data

**Authors:** Raman Butta, Shristi Butta

PMC · DOI: 10.7759/cureus.98637 · Cureus · 2025-12-07

## TL;DR

This study identifies new genetic mutations and prognostic indicators in breast cancer using TCGA data, highlighting potential targets for treatment.

## Contribution

The study reveals novel genes like SETDB1, ARMCX5, and SLCO6A1 with prognostic significance in breast carcinoma.

## Key findings

- SETDB1, USP37, NDUFS1, TRPM4, and MYO18A are associated with poor prognosis in breast cancer.
- ARMCX5 and SLCO6A1 are enriched in high-grade tumors.
- The findings suggest a role for metabolic and epigenetic genes in breast cancer progression.

## Abstract

Background

Breast carcinoma is one of the leading causes of cancer-associated mortality in women worldwide. Although several advances have been made in molecularly classifying breast cancers, treatment resistance continues to limit the overall survival. The Cancer Genome Atlas (TCGA) has unraveled diverse genomic alterations in breast carcinoma. However, some potential biomarkers still remain unexplored, like SETDB1, a histone methyltransferase involved in epigenetic silencing of tumor suppressor genes. ARMCX5 and SLCO6A1 are also some of the unexplored genes that could have a potential role in drug resistance.

Materials and methods

The Mutation Annotation Format (MAF) data set from the Cancer Genome Atlas Breast Cancer (TCGA-BRCA) cohort was analyzed using the Maftools, Survival, Mclust, and Survminer R packages. Oncodrive driver analysis and protein family (PFAM) domain mapping were performed. A total of 845 cases of breast carcinoma with complete survival data were retrieved, of which mutation data for 800 cases were available. Comprehensive mutation analysis was also done to unveil unexplored genes. Survival data of 845 cases were integrated for Kaplan-Meier and Cox proportional hazard analysis to ascertain the prognostic significance of an array of genes. Oncogenic signaling pathway mapping was done to determine the clinical enrichment of the genes associated with breast carcinoma. Genes associated with clinical enrichment, clustering of somatic mutations, and prognosis were subjected to further analysis.

Results

Besides the established molecular drivers like PIK3CA and TP53, we found several novel and understudied genes with potential prognostic and oncogenic significance. SETDB1 (p < 0.0001), USP37 (p < 0.0001), NDUFS1 (p = 0.025), TRPM4 (p < 0.0001), and MYO18A (p < 0.0001) were associated with poor prognosis. ARMCX5 (p < 0.0001) and SLCO6A1 (p < 0.0001) were enriched in high-grade tumors.

Conclusion

The TCGA-BRCA cohort analysis emphasizes a potential interplay of metabolic genes like NDUFS1, TRPM4, ARMCX5, SLCO6A1, and epigenetic axis genes like SETDB1 and USP37 in the oncogenesis and prognosis of breast carcinomas. These observations could open potential avenues for exploring novel therapeutics in aggressive breast carcinomas.

## Linked entities

- **Genes:** SETDB1 (SET domain bifurcated histone lysine methyltransferase 1) [NCBI Gene 9869], ARMCX5 (armadillo repeat containing X-linked 5) [NCBI Gene 64860], SLCO6A1 (solute carrier organic anion transporter family member 6A1) [NCBI Gene 133482], USP37 (ubiquitin specific peptidase 37) [NCBI Gene 57695], NDUFS1 (NADH:ubiquinone oxidoreductase core subunit S1) [NCBI Gene 4719], TRPM4 (transient receptor potential cation channel subfamily M member 4) [NCBI Gene 54795], MYO18A (myosin XVIIIA) [NCBI Gene 399687], PIK3CA (phosphatidylinositol-4,5-bisphosphate 3-kinase catalytic subunit alpha) [NCBI Gene 5290], TP53 (tumor protein p53) [NCBI Gene 7157]
- **Diseases:** breast carcinoma (MONDO:0004989)

## Full-text entities

- **Genes:** TP53 (tumor protein p53) [NCBI Gene 7157] {aka BCC7, BMFS5, LFS1, P53, TRP53}, ARMCX5 (armadillo repeat containing X-linked 5) [NCBI Gene 64860] {aka GASP5}, USP37 (ubiquitin specific peptidase 37) [NCBI Gene 57695], TRPM4 (transient receptor potential cation channel subfamily M member 4) [NCBI Gene 54795] {aka EKVP6, LTrpC4, PFHB1B, TRPM4B, hTRPM4}, NDUFS1 (NADH:ubiquinone oxidoreductase core subunit S1) [NCBI Gene 4719] {aka CI-75Kd, CI-75k, MC1DN5, PRO1304}, PIK3CA (phosphatidylinositol-4,5-bisphosphate 3-kinase catalytic subunit alpha) [NCBI Gene 5290] {aka CCM4, CLAPO, CLOVE, CWS5, HMH, MCAP}, MYO18A (myosin XVIIIA) [NCBI Gene 399687] {aka MAJN, MYSPDZ, SP-R210, SPR210, TIAF1}, SLCO6A1 (solute carrier organic anion transporter family member 6A1) [NCBI Gene 133482] {aka CT48, GST, OATP-I, OATP6A1, OATPY}, SETDB1 (SET domain bifurcated histone lysine methyltransferase 1) [NCBI Gene 9869] {aka ESET, H3-K9-HMTase4, KG1T, KMT1E, TDRD21}, BRCA1 (BRCA1 DNA repair associated) [NCBI Gene 672] {aka BRCAI, BRCC1, BROVCA1, FANCS, IRIS, PNCA4}
- **Diseases:** Breast Cancer (MESH:D001943), Cancer (MESH:D009369)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12772933/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12772933/full.md

## References

17 references — full list in the complete paper: https://tomesphere.com/paper/PMC12772933/full.md

---
Source: https://tomesphere.com/paper/PMC12772933