# Exploring the biological functions of PCOS: identifying hub androgen-related genes through bioinformatics

**Authors:** Xinling He, Lan Su, Ji Yin, Yuanyuan Lai, Han Yang, Zheng Yu, Xiaoyan Zheng, Jia Liu, Jie Yang

PMC · DOI: 10.3389/fmed.2026.1693216 · 2026-03-19

## TL;DR

This study identifies key androgen-related genes in PCOS using bioinformatics methods and validates them experimentally, offering new insights for diagnosis and treatment.

## Contribution

The study integrates multiple algorithms to identify four hub androgen-related genes in PCOS and validates them in mouse models.

## Key findings

- Four hub androgen-related genes (ALDH1A1, DHRS9, PRKCB, SGPL1) were identified through integration of LASSO, RF, and PPI methods.
- RT-qPCR validation confirmed altered expression of these genes in PCOS mouse ovarian tissues.
- ARG molecular subtypes were classified, showing distinct immune infiltration and gene expression patterns.

## Abstract

Polycystic ovary syndrome (PCOS) is a prevalent reproductive endocrine disorder in women. While the role of androgens in PCOS is well-recognized, the underlying mechanisms warrant further investigation. In this study, we identified potential hub androgen-related genes (ARGs) in PCOS and established diagnostic and classification models to find novel biomarkers for PCOS therapy. Five datasets (GSE34526, GSE80432, GSE95728, GSE124226, and GSE137684) were retrieved from GEO, followed by data normalization and batch effect removal. To identify hub genes, a PPI network was constructed based on differentially expressed ARGs. Subsequently, we employed the least absolute shrinkage and selection operator (LASSO) regression analysis and random forest (RF) algorithm to screen hub ARGs. Besides, hub ARGs associated with PCOS were determined by integrating the results of the three algorithms. Additionally, a nomogram was constructed using these hub ARGs to predict the risk of PCOS development. We also investigated the classification of ARG molecular subtypes and assessed immune characteristics and gene expression profiles in different subtypes. Lastly, RT-qPCR was utilized to validate the reliability of the hub genes. A total of 91 ARGs were retrieved from the GSEA website. This study included 26 healthy and 34 PCOS samples. Using the LASSO identified 13 key ARGs, RF identified 10 crucial ARGs, and PPI identified 19 pivotal ARGs. Integration of three methods identified four hub ARGs (ALDH1A1, DHRS9, PRKCB, and SGPL1). And a nomogram was constructed to predict the risk of PCOS occurrence. Notably, we validated the expression levels of the 4 hub ARGs in ovarian tissues from PCOS mice using RT-qPCR. The results showed that the expression levels of DHRS9, SGPL1, and ALDH1A1 were significantly downregulated, while PRKCB was significantly upregulated, which was consistent with our data analysis findings. Furthermore, samples were divided into two distinct ARG patterns and further explored the relationship between immune cell infiltration and these patterns. ARG scores were significantly higher in cluster A or gene cluster A compared to cluster B or gene cluster B. Finally, we evaluated the expression levels of PCOS-related genes in distinct clusters. In summary, our results may further elucidate the mechanisms of PCOS pathogenesis and offer novel ideas for PCOS diagnosis and treatment.

## Linked entities

- **Genes:** ALDH1A1 (aldehyde dehydrogenase 1 family member A1) [NCBI Gene 216], DHRS9 (dehydrogenase/reductase 9) [NCBI Gene 10170], PRKCB (protein kinase C beta) [NCBI Gene 5579], SGPL1 (sphingosine-1-phosphate lyase 1) [NCBI Gene 8879]
- **Diseases:** Polycystic ovary syndrome (MONDO:0008487), PCOS (MONDO:0008487)
- **Species:** Mus musculus (taxon 10090)

## Full-text entities

- **Genes:** ALDH1A1 (aldehyde dehydrogenase 1 family member A1) [NCBI Gene 216] {aka ALDC, ALDH-E1, ALDH1, ALDH11, HEL-9, HEL-S-53e}, SGPL1 (sphingosine-1-phosphate lyase 1) [NCBI Gene 8879] {aka NPHS14, RENI, S1PL, SPL}, ABL2 (ABL proto-oncogene 2, non-receptor tyrosine kinase) [NCBI Gene 27] {aka ABLL, ARG}, DHRS9 (dehydrogenase/reductase 9) [NCBI Gene 10170] {aka 3-alpha-HSD, 3ALPHA-HSD, RDH-TBE, RDH15, RDHL, RDHTBE}, PRKCB (protein kinase C beta) [NCBI Gene 5579] {aka PKC-beta, PKCB, PKCI(2), PKCbeta, PRKCB1, PRKCB2}
- **Diseases:** PCOS (MESH:D011085), reproductive endocrine disorder (MESH:D004700)
- **Species:** Mus musculus (house mouse, species) [taxon 10090], Homo sapiens (human, species) [taxon 9606]

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13043399/full.md

---
Source: https://tomesphere.com/paper/PMC13043399