Integrative Bioinformatics Analysis Reveals Key Regulatory Genes and Therapeutic Targets in Ulcerative Colitis Pathogenesis
Sheikh Atikur Rahman, Mst. Tania Khatun, Mahendra Singh, Viplov Kumar Biswas, Forkanul Hoque, Nurun Nesa Zaman, Anzana Parvin, Mohammad Khaja Mafij Uddin, Md. Mominul Islam Sheikh, Most Morium Begum, Rakesh Arya, Hossain Md. Faruquee

TL;DR
This study identifies key genes and potential treatments for ulcerative colitis using bioinformatics analysis of gene expression data.
Contribution
The study discovers novel biomarkers and therapeutic candidates for ulcerative colitis through integrative bioinformatics analysis.
Findings
Key hub genes like TLR2, IFNG, and CD163 are implicated in UC pathogenesis and immune responses.
MicroRNAs such as hsa-miR-34a-5p and hsa-miR-335-5p regulate gene expression in UC.
Apremilast and Golotimod are predicted as promising therapeutic compounds for UC.
Abstract
Background: Ulcerative colitis (UC), a chronic and relapsing form of inflammatory bowel disease (IBD), arises from a multifactorial interplay of genetic predisposition, immune dysregulation, and environmental triggers. Despite advances in understanding UC pathogenesis, the identification of reliable biomarkers and key regulatory genes remains essential for unraveling disease mechanisms. Such insights are crucial for improving diagnostic precision and developing personalized therapeutic strategies. Methods: In this study, gene expression profiles from publicly available microarray and RNA-sequencing datasets were systematically analyzed using advanced bioinformatics tools. Differentially expressed genes (DEGs) were identified through statistical comparisons, and functional enrichment analyses were performed to explore their biological relevance. A total of 141 overlapping DEGs were…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8| Category | Validated UC/Treatment Markers (Previous Studies) | Functional Role in UC | Hub Genes Identified in This Study | Functional Role in UC (This Study) |
|---|---|---|---|---|
|
| Calprotectin (S100A8/A9) [ | Clinical biomarkers widely used for inflammation and disease monitoring | CD163 | Scavenger receptor on macrophages; associated with anti-inflammatory modulation, fibrosis, and impaired tissue repair [ |
|
| HNF4A, ICAM1, RNF186 [ | Regulate epithelial barrier integrity, immune modulation, and genetic susceptibility to UC | TLR2 | Recognizes microbial ligands; initiates innate immune responses; dysregulation activates NF-κB and exacerbates mucosal inflammation [ |
|
| Oncostatin M (OSM) [ | Pro-inflammatory mediators; OSM predicts anti-TNF therapy resistance | IFNG | Key cytokine driving macrophage activation, T-cell infiltration, and chronic inflammation in UC [ |
|
| Anti-TNF response markers (e.g., OSM, CRP) [ | Guide biologic therapy decisions | Apremilast, Golotimod (predicted drug–gene interactions) | Target IFNG and TLR2; potential drug repurposing candidates with favorable ADME/T properties [ |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInflammatory Bowel Disease · Bioinformatics and Genomic Networks · GDF15 and Related Biomarkers
1. Introduction
Ulcerative colitis (UC) is a long-term inflammatory disorder of the large intestine, primarily impacting the colon and rectum [1]. It typically originates in the rectum and extends proximally through the colon, either partially or completely [2]. UC is a persistent, relapsing condition of unknown origin and has been increasingly reported worldwide [3]. Its complex pathogenesis involves a multifactorial interplay of environmental, microbial, and genetic factors that disrupt mucosal homeostasis and trigger inappropriate immune responses [4]. UC substantially impairs quality of life, often resulting in complications such as diarrhea, bloody mucous-laden stools, tenesmus, fecal urgency, incontinence, fatigue, and abdominal pain [5]. It is also a significant precursor to colorectal cancer (CRC), with risk increasing proportionally to the duration of chronic intestinal inflammation [6,7]. Some studies suggest an exponential rise in annual CRC risk. UC accounts for approximately 15% of IBD-related mortality, with an incidence rate 1.5 to 2.4 times higher than in the general population [8,9]. Over time, the global incidence of UC has risen significantly, and by 2035, UC is projected to affect over two million individuals in Asia [10].
Although multiple therapeutic options exist, up to 15% of patients exhibit treatment resistance or develop colitis with dysplastic changes requiring surgical intervention [11]. In cases of acute severe ulcerative colitis (ASUC), oral or intravenous drug delivery may be compromised by systemic side effects and poor site-specific drug availability, which may restrict therapeutic effectiveness [12,13]. Differentiating UC from other conditions can be difficult, as diagnosis requires a thorough assessment of clinical symptoms, endoscopic findings, and histological features. Due to its uncertain etiology and lack of precise diagnostic markers, identifying predictive biomarkers remains critical for timely diagnosis and targeted therapy.
Currently, no approved biomarkers or commercial assays are available for early UC detection in clinical settings. However, bioinformatics approaches have been extensively used to explore disease-related molecular mechanisms and uncover novel biomarkers. Recent advances, including RNA sequencing and mucosal gene expression profiling, have yielded several candidate genes. Notable among these are Cadherin 11, hepatocyte nuclear factor 4 alpha (HNF4α), intercellular adhesion molecule 1 (ICAM1), and ring finger protein 186 (RNF186) [14]. These genes are linked to essential cellular functions, including immune modulation, epithelial repair, barrier integrity, microbial defense, autophagy, cell proliferation, and apoptosis [15,16]. Still, the specificity and sensitivity of current markers remain suboptimal, often overlapping with other inflammatory conditions. Therefore, discovering new biomarkers is essential for early detection and improved therapeutic outcomes.
This study aims to computationally identify shared genomic biomarkers as potential drug targets for UC by analyzing their functions, regulatory pathways, and upstream mediators, including transcription factors and microRNAs. Additionally, candidate therapeutic compounds were evaluated through molecular docking to validate their binding affinities and interactions with key hub targets.
2. Materials and Methods
2.1. Data Collection for Gene Expression Profiles
The gene expression profiles (RNA-seq data) were retrieved from the Gene Expression Omnibus (GEO) database using GREIN (http://www.ilincs.org/apps/grein/?gse, accessed on 11 October 2024) with the search term “ulcerative colitis” [17]. The selected datasets met the following inclusion criteria to ensure analytical relevance: (a) Homo sapiens; (b) mRNA expression profiling; (c) study design including both experimental (UC) and control groups; (d) a minimum of 10 samples per dataset; and (e) exclusion of cell line data. Based on these criteria, three GEO datasets-GSE135223, GSE112057, and GSE83687 were selected for comparative analysis of UC and normal tissue gene expression. A schematic overview of the study design is presented in Figure 1.
2.2. Screening and Identification of Differentially Expressed Genes (DEGs)
The raw expression values from each GEO dataset were preprocessed and normalized by the original submitters using the respective GEO platform pipelines, such as RMA normalization for microarray data and counts per million (CPM)/TPM normalization for RNA-seq data [18]. To ensure consistency across datasets, the expression values were log2-transformed (where applicable) and further median-centered to minimize batch effects. The normalized data were then analyzed in Excel 2016 to screen for DEGs between the UC and control groups, with genes filtered using the criteria |logFC| > 1 and p < 0.05. Genes with logFC > 1 were considered upregulated, while those with logFC < –1 were considered downregulated. Finally, the online tool jvenn was applied to identify DEGs common across all three datasets [19].
2.3. Protein–Protein Interaction (PPI) Network Analysis
The functional interactions between DEGs were assessed using a PPI network. The PPI network was constructed using the Search Tool for the Retrieval of Interacting Genes (STRING) [20]. Protein interaction data were visualized and analyzed using Cytoscape (version 3.9.1). In the PPI network, the nodes represent proteins, while the edges indicate interactions between them. In the network, nodes represent proteins and edges indicate interactions. Hub genes were identified based on their connectivity, with higher values indicating stronger centrality within the network.
2.4. Identification of Hub Genes
Hub genes were identified using the cytoHubba plugin in Cytoscape [21]. Twelve algorithms were applied to assess node importance, including MCC, Closeness, Radiality, Betweenness, and Stress centralities [22]. The top 10 genes from each method were selected based on scores, indicating their potential biological relevance and prioritization for further study.
2.5. Gene Enrichment Analysis
Functional enrichment was conducted using Database for Annotation, Visualization, and Integrated Discovery (DAVID, https://davidbioinformatics.nih.gov/tools.jsp, accessed on 23 January 2025), which provides comprehensive annotation tools [23]. Enhanced p-value enrichment was applied to ensure statistical significance and improve analysis accuracy. Enrichment was evaluated for biological processes, cellular components, and molecular functions. The top 10 terms in each category were selected, and the top 5 pathways were identified using KEGG, WikiPathways, and Reactome. Enrichment results were visualized using bubble plots generated via SRplot (https://www.bioinformatics.com.cn/en, accessed on 29 january 2025).
2.6. Gene–miRNA–TF Interaction Network
Regulatory interactions were analyzed to explore how transcription factors (TFs) and microRNAs (miRNAs) influence gene expression [24,25,26]. Gene regulatory network (GRN) analysis was performed using NetworkAnalyst (https://www.networkanalyst.ca/, accessed on 22 February 2025) and integrated data from three databases [27]. miRNA–gene interactions were sourced from miRTarBase v9.0 and TarBase v9.0, while TF–gene relationships were predicted based on ENCODE and JASPAR binding profiles.
2.7. Gene–Disease Association Analysis
Gene–disease associations were investigated using DisGeNET (https://www.disgenet.org/, accessed on 24 February 2025), which integrates information from multiple sources related to human diseases [28]. Network analysis helped identify diseases and long-term complications linked to shared DEGs.
2.8. Protein–Drug Interaction Analysis
Protein–drug interactions were analyzed using the DrugBank (version 5.0) database [29]. NetworkAnalyst was used to visualize and filter potential drug candidates targeting key DEGs for UC therapy.
2.9. Molecular Docking
Molecular docking was performed using AutoDock Vina via PyRx. The conjugate gradient algorithm was applied to optimize potential ligands in 200 steps using the Universal Force Field (UFF) [30]. The docking grid was centered at (x = 11.5647583557, y = −15.0323563498, z = −14.9472), with dimensions of X: 134.951379941 Å, Y: 124.914233058 Å, and Z: 63.7435282516Å. Ligands selected for further analysis exhibited higher binding affinities (kcal/mol) than the control ligand, with more negative values indicating stronger binding interactions. Only ligands with a zero root mean square deviation (RMSD) were considered. Finally, molecular interactions and structural visualizations were analyzed using Discovery Studio.
2.10. ADME/T Prediction
ADME/T (Absorption, Distribution, Metabolism, Excretion, and Toxicity) prediction is a crucial filtering step in computational drug design [31]. Computational methods offer a cost-effective and time-efficient alternative to labor-intensive experimental evaluations, as demonstrated in recent studies. In this study, SwissADME was employed to assess pharmacokinetic properties, including drug absorption, metabolism, distribution, and excretion [32]. Additionally, toxicity assessments were conducted using the pkCSM server [33]. Molecular structures were represented using Structure Data File (SDF) strings and the Simplified Molecular Input Line Entry System (SMILES) for accurate computational analysis.
3. Results
3.1. Identification of DEGs
A total of 2340, 1526, and 2759 significant DEGs were identified in the GSE135223, GSE112057, and GSE83687 datasets, respectively. Among these, 1,279, 905, and 1512 genes were upregulated, while 1,061, 621, and 1247 were downregulated (Table 1) and Supplementary Table S1. GSE135223 exhibited a wider range of gene expression changes, indicating notable transcriptional alterations (Figure 2A). In contrast, GSE112057 displayed a moderate distribution with a higher proportion of upregulated genes (Figure 2B), and GSE83687 showed a more compact distribution, suggesting less pronounced fold changes (Figure 2C). Most DEGs fell within the log_10_(mean expression) range of 1 to 4, indicating moderate to high expression levels.
3.2. Common Gene Identification and PPI Construction
The Venn diagram (Figure 3A) shows the overlap of DEGs among three datasets (GSE135223, GSE112057, and GSE83687), identifying 141 shared DEGs that may serve as key contributors to UC. The final PPI network consisted of 105 nodes and 366 edges, constructed using STRING interactions at a confidence threshold of 0.7 (Figure 3B) identifies IFNG, TLR2, and CD163 as central hub genes, with strong interactions (orange nodes) involving other significant genes such as PPARG, ARG1, CCL4, and CXCL9. These genes are closely associated with immune regulation, inflammatory signaling, and UC-related disease mechanisms, underscoring their potential as biomarkers and therapeutic targets.
3.3. Identification of Hub Genes
We analyzed the PPI network derived from STRING using our gene sets and visualized it in Cytoscape to explore common adhesion pathways and DEG interactions. The network consisted of 366 edges and 105 nodes based on STRING interactions at a confidence threshold of 0.7 (Figure 3B). To ensure robustness of the network and avoid centrality bias, we evaluated hub gene stability across 12 cytoHubba ranking methods. TLR2, IFNG, and CD163 consistently ranked among the top hubs across five key centrality measures (MCC, Closeness, Radiality, Betweenness, and Stress) (Figure 4F). Varying the STRING confidence threshold between 0.4 (medium confidence) and 0.9 (high confidence) showed that these core hubs remained stable despite changes in network density. Additionally, permutation testing with 1000 randomized networks demonstrated that the observed network topology was significantly different from random (p < 0.001, Kolmogorov–Smirnov test), confirming the biological relevance of the identified hubs (Figure 4A–E).
3.4. Gene Ontology (GO) and Pathways Enrichment Analysis
The DAVID database was used to perform gene ontology and pathway enrichment analysis, helping determine the biological significance of pathways associated with DEGs. Gene ontology assesses gene functions and attributes by integrating comprehensive computational knowledge resources [34]. An ontology serves as a structured framework of biological knowledge that supports biological modeling and annotation, commonly applied in biomedical research.
For GO analysis, three major domains were considered: molecular function, cellular component, and biological process [35]. The most frequently enriched GO terms included signal transduction, inflammatory response, membrane, extracellular region, carbohydrate binding, and transmembrane activity. Bubble plots illustrate the enrichment results across each category (Figure 5A–C). Additionally, several significant molecular pathways were identified, including the phagosome pathway and PI3K-Akt signaling pathway (KEGG), the innate immune system pathway (Reactome), and the focal adhesion PI3K-Akt-mTOR signaling pathway (WikiPathways), as shown in Figure 5D. Full GO annotation details and pathway enrichment results are provided in Table 2 and Table 3. Also, Full data is available in Supplementary Tables S2 and S3.
3.5. Identification of Gene Regulators
Gene–miRNA interactions were analyzed using the NetworkAnalyst platform, incorporating data from the miRTarBase v9.0 database. This revealed 42 miRNAs linked to 13 seed genes through 61 edges (betweenness = 20), with hsa-miR-34a-5p, hsa-miR-335-5p, and hsa-miR-24-3p emerging as the top regulators. Analysis using the TarBase v9.0 database identified 306 miRNAs associated with the same 13 genes via 509 edges, with hsa-miR-34a-5p, hsa-miR-23a-5p, and hsa-miR-26a-5p showing the strongest influence. Notably, hsa-miR-34a-5p was common to both datasets (Figure 6A,B).
For transcription factor (TF) analysis, ENCODE and JASPAR databases were used. ENCODE identified 142 TFs interacting with 13 genes across 216 edges (betweenness = 40), with GABPA, SUPT5H, and NR2F1 being the most prominent. JASPAR revealed 77 TFs linked to 20 genes via 172 edges (betweenness = 10), highlighting FOXC1, GATA2, and YY1 as key regulators within the TF-DEG network (Figure 6C,D).
3.6. Identification of Disease Associations and Drug Candidates
To explore gene–disease associations, we used NetworkAnalyst with integrated DisGeNET data. Ten genes, IFNG, CD163, CCL4, CFB, CXCL9, HLA-C, ITGB3, GZMB, CD274, and PPARG, were frequently associated with five disorders: schizophrenia, hypersensitivity, pneumonia, mammary neoplasms, and liver cirrhosis (Figure 7A). Protein–drug interaction analysis identified potential UC therapeutics. Five drugs, Fontolizumab, Apremilast, Olsalazine, Glucosamine, and VIR201 were found to target IFNG (Figure 7B). Three additional compounds, SCV-07 (Golotimod), OspA lipoprotein, and S-(Dimethylarsenic)-cysteine targeted TLR2 (Figure 7C).
3.7. Molecular Docking for Drug Repurposing and ADME/T Profiling
To identify potential therapeutic candidates for UC, molecular docking was performed to assess binding affinities between FDA-approved compounds and hub protein targets. The final hub-proteins were selected as drug target receptors, and candidate ligands from DrugBank were screened for docking analysis. Among the two receptor proteins and eight candidate drugs tested, the top two lead compounds-Apremilast (CID: 11561674) and Golotimod (CID: 6992140) exhibited the most significant binding affinities, measured at −8.3 kcal/mol and −7.3 kcal/mol, respectively. The corresponding binding amino acid residues are detailed in Table 4, with molecular docking interactions visualized in Figure 8.
The ADME/T analysis compares the physicochemical, pharmacokinetic, and toxicological properties of Apremilast, Golotimod, Mesalamine, and Azathioprine. All compounds meet Lipinski’s rule, indicating favorable drug-likeness. Apremilast had the highest molecular weight (460.50 g/mol), whereas Mesalamine is the smallest (153.14 g/mol). Apremilast also demonstrates the highest intestinal absorption (83.79%), while Golotimod exhibits lower absorption (56%). None of the compounds significantly crossed the blood–brain barrier (BBB). Apremilast functioned as a CYP3A4 substrate/inhibitor, potentially influencing drug metabolism; Golotimod exhibited the highest total clearance (0.595 mL/min/kg), while Azathioprine had the lowest (0.148 mL/min/kg). The toxicity analysis revealed AMES toxicity for Mesalamine and Azathioprine, but not for Apremilast or Golotimod (Table 5).
4. Discussion
Genome-wide association studies (GWAS) have greatly advanced the understanding of polygenic disorders, particularly in identifying several genes associated with UC. These findings have provided key insights into the disease’s pathophysiology, which involves a complex interaction between genetic and environmental factors, leading to diverse clinical outcomes. Nonetheless, the exact etiology of UC remains incompletely understood and highly complex.
In this study, we identified 2340, 1526, and 2759 significant DEGs in datasets GSE135223, GSE112057, and GSE83687, respectively. Among them, 1279, 905, and 1512 were upregulated, and 1061, 621, and 1247 were downregulated. Gene Ontology functional classification revealed enrichment in pathways related to signal transduction, inflammatory response, membrane, extracellular region, carbohydrate binding, and transmembrane activity. Chronic inflammation in inflammatory bowel disease (IBD) may arise from microbial infections or impaired mucosal barrier function, compounding disease progression.
A PPI network analysis of DEGs identified 20 key genes- including GZMB, FPR2, FCGR1A, FAM124A, EPAS1, CXCL9, CD274, CCL4, ARG1, TLR8, S100A9, S100A12, PRF1, PPARG, LILRB2, ITGB3, HLA-C, TLR2, IFNG, and CD163, among 141 genes with the highest interaction levels. Previous studies have demonstrated that these genes play a crucial role in the development and progression of UC and its related complications [36,37,38,39,40,41,42,43,44]. Among these 20 genes, our analysis focused on identifying the most promising biomarkers, with TLR2, IFNG, and CD163 emerging as pivotal in UC pathogenesis due to their involvement in immune response modulation and inflammation. The studies by Medzhitov (2001) and Vasselon & Detmers (2002) established that TLR2 (Toll-like receptor 2) plays a critical role in recognizing microbial components and initiating innate immune responses [45,46]. Dysregulation of TLR2 has been linked to the excessive activation of NF-κB signaling, resulting in an upsurge of pro-inflammatory cytokines that exacerbate mucosal inflammation in UC patients, as previously described by Tatiya-Aphiradee et al. (2018) [47]. Similarly, IFNG (Interferon-γ), a cytokine predominantly produced by T-cells and natural-killer (NK) cells, is integral to macrophage activation and antigen presentation [48]. Elevated IFNG expression levels have been associated with chronic inflammation and tissue damage in UC, leading to increased immune cell infiltration and cytokine imbalance, as reported by Tatiya-Aphiradee et al. (2018) [47]. Additionally, CD163, a scavenger receptor expressed on macrophages, plays a significant role in hemoglobin-haptoglobin complex clearance and anti-inflammatory modulation [49]. Recent findings by Zhang et al. (2025) indicate that CD163 expression in UC is often upregulated as a compensatory mechanism to counteract inflammation, but its overexpression may also contribute to fibrosis and impaired tissue repair [50]. Understanding the interplay between these genes provides essential insights into UC pathogenesis while offering potential therapeutic targets for future interventions. Furthermore, these three genes-TLR2, IFNG, and CD163-are abundantly expressed in colorectal cancer (CRC) tumor tissues. Previous studies suggest that both spontaneous and inflammation-driven CRC proliferation can be mitigated through gene knockdown and genetic deletion.
In Gene Ontology analysis, key biological processes such as signal transduction, innate immune response, and inflammatory response were identified as significant contributors to UC development and progression. Major cellular components, including the plasma membrane, extracellular region, and exosomes, play crucial roles in cellular communication, facilitating the transport of proteins, miRNAs, lncRNAs, and circRNAs from UC-affected to normal cells [51]. Exosomes influence proliferation, adhesion, invasion, and angiogenesis, thereby promoting UC tumor progression [52]. Moreover, a study by Liang et al. (2024) revealed that calcium ion binding activity is a critical molecular function, potentially serving as an independent risk factor for UC [53].
Pathway analysis uncovered key signaling mechanisms implicated in UC progression. Within the KEGG, processes such as immune system regulation, neutrophil degranulation, and cytokine signaling were identified as pivotal. Reactome pathway analysis highlighted osteoclast differentiation and the PI3K-Akt signaling pathway as major contributors, while WikiPathway emphasized the FA PI3K-Akt-mTOR signaling pathway and the complement system, both playing significant roles in UC development. Previously studies have demonstrated that these pathways collectively influence inflammation, immune response, and cellular survival, underscoring their relevance in UC pathogenesis [54,55,56].
Using NetworkAnalyst, we identified five key miRNAs with significant roles in UC and other diseases, including hsa-miR-34a-5p, hsa-miR-335-5p, hsa-miR-24-3p, hsa-miR-23a-5p, and hsa-miR-26a-5p, sourced from miRTarBase and TarBase databases. Notably, hsa-miR-34a-5p was found in both databases, indicating its central role in UC pathogenesis. Studies by Ojha et al. (2019) and Blauensteiner & Westermeier (2021) revealed that hsa-miR-34a-5p regulates inflammatory pathways, immune responses, and epithelial barrier integrity [57,58]. Its dysregulation enhances NF-κB signaling, cytokine production, and gut permeability by downregulating tight junction proteins, contributing to chronic inflammation and mucosal damage, making it a potential biomarker and therapeutic target [57]. Similarly, hsa-miR-335-5p, hsa-miR-24-3p, hsa-miR-23a-5p, and hsa-miR-26a-5p exhibit critical functions in UC, modulating inflammation, immune responses, and epithelial integrity. Sun & Huang (2019) demonstrated that hsa-miR-335-5p modulates cytokine expression [59], while He et al. (2023) presented evidence that hsa-miR-24-3p influences cell proliferation and apoptosis [60]. Additionally, studies by Marchese et al. (2018) and Li et al. (2024) identified hsa-miR-23a-5p as a key factor in T-cell activation and hsa-miR-26a-5p as essential for maintaining intestinal barrier function [61,62]. Their dysregulation may contribute to UC progression, highlighting their potential as biomarkers or therapeutic targets.
In this study, we identified FOXC1, GABPA, GATA2, and SUPT5H as key transcription factors implicated in the development of ulcerative colitis (UC) through the regulation of gene expression involved in inflammation, immune response, and epithelial barrier integrity. FOXC1 (Forkhead Box C1) plays a critical role in immune cell differentiation and inflammation, contributing to mucosal damage in UC [63]. GABPA (GA Binding Protein Transcription Factor Alpha Subunit) is integral to mitochondrial biogenesis and immune regulation, ensuring cellular homeostasis in inflamed colonic tissues [64]. GATA2 (GATA Binding Protein 2) is essential for hematopoiesis and immune cell function, impacting cytokine production and immune responses in UC [65]. SUPT5H, a transcription elongation factor, regulates RNA polymerase II activity and may modulate the expression of inflammatory genes [66]. The dysregulation of these TFs can result in impaired epithelial repair, increased inflammation, and disrupted gut homeostasis, making them potential biomarkers and therapeutic targets in UC management.
Additionally, we identified two potential drugs capable of binding to hub proteins and regulating their activity effectively. Apremilast (CID: 11561674) and Golotimod (CID: 6992140) emerged as promising therapeutic agents for UC treatment, targeting key inflammatory pathways. Apremilast, a PDE4 inhibitor, modulates cAMP levels, downregulating pro-inflammatory cytokines like IFNG (Interferon-γ) to reduce immune activation, prevent tissue damage, and promoting mucosal healing [67,68]. Meanwhile, Golotimod, an immunomodulatory peptide, inhibits TLR2 signaling, curbing excessive cytokine production and enhancing epithelial barrier function [69]. Both drugs exhibit strong potential for inflammation control and UC remission maintenance, warranting further clinical investigation. A comparative analysis of pharmacokinetic and physicochemical properties supports Apremilast and Golotimod as promising candidates for UC treatment. Both compounds conform to Lipinski’s rule, signifying good drug-likeness without violations. Unlike control drugs Mesalamine and Azathioprine, which demonstarted AMES toxicity, Apremilast and Golotimod remain non-mutagenic, reinforcing their potential as safer and more effective therapeutic options for UC management.
Previous studies have identified several UC biomarkers, including HNF4A, ICAM1, and RNF186 for epithelial barrier integrity [14,16], as well as calprotectin, CRP, and OSM for diagnosis and treatment response [39,41]. In contrast, our study highlights TLR2, IFNG, and CD163 as key hub genes with immune-regulatory roles. Notably, CD163 links macrophage polarization and fibrosis to UC pathology [49,50], providing a novel insight. Furthermore, drug–target analysis suggests that compounds such as Apremilast and Golotimod may modulate these pathways, indicating potential for therapeutic repurposing [47,48] (Table 6).
This study is entirely based on secondary bioinformatics analyses of publicly available transcriptomic datasets. As such, it lacks experimental validation in cell-based or animal models, and the identified biomarkers and drug candidates require further confirmation in clinical cohorts. Additionally, dataset heterogeneity and sample size variations may introduce bias, limiting generalizability. Despite these limitations, the findings provide clinically relevant insights. The identification of TLR2, IFNG, and CD163 as hub genes suggests their potential utility as predictive biomarkers for ulcerative colitis diagnosis and disease monitoring. Moreover, the computational prediction of Apremilast and Golotimod as promising therapeutic agents highlights a possible drug repurposing strategy that could complement current therapies, reduce treatment resistance, and support personalized medicine approaches in UC management.
5. Conclusions
This study identified key hub genes (TLR2, IFNG, CD163), transcription factors (FOXC1, GABPA, GATA2, SUPT5H), and microRNAs (hsa-miR-34a-5p, hsa-miR-335-5p, hsa-miR-24-3p, hsa-miR-23a-5p, hsa-miR-26a-5p) that regulate ulcerative colitis progression through immune regulation and PI3K-Akt-mTOR signaling pathways. Molecular docking predicted Apremilast and Golotimod as promising therapeutic candidates, with favorable pharmacokinetic and safety profiles compared to conventional drugs. Although limited by its in silico design and lack of experimental validation, this study provides insights into potential biomarkers and therapeutic targets that may guide personalized diagnosis and treatment strategies in UC, warranting further preclinical and clinical validation.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Conrad K. Roggenbuck D. Laass M.W. Diagnosis and Classification of Ulcerative Colitis Autoimmun. Rev.20141346346610.1016/j.autrev.2014.01.02824424198 · doi ↗ · pubmed ↗
- 2James S.L. van Langenberg D.R. Taylor K.M. Gibson P.R. Characterization of Ulcerative Colitis—Associated Constipation Syndrome (Proximal Constipation)JGH Open 2018221722210.1002/jgh 3.1207630483593 PMC 6207018 · doi ↗ · pubmed ↗
- 3Hindryckx P. Jairath V. D’Haens G. Acute Severe Ulcerative Colitis: From Pathophysiology to Clinical Management Nat. Rev. Gastroenterol. Hepatol.20161365466410.1038/nrgastro.2016.11627580687 · doi ↗ · pubmed ↗
- 4Morton H. Crohn’s Disease and Environmental Factors in the New Zealand Context: A Thesis Presented in Partial Fulfilment of the Requirements for the Degree of Doctoral of Philosophy Ph.D. Thesis Massey University Manawatū, New Zealand 2023
- 5Singh S. Dulai P.S. Ulcerative Colitis: Clinical Manifestations and Management Yamada’s Textbook of Gastroenterology Wiley-Blackwell Hoboken, NJ, USA 2022 Volume 312481293
- 6Rogler G. Chronic Ulcerative Colitis and Colorectal Cancer Cancer Lett.201434523524110.1016/j.canlet.2013.07.03223941831 · doi ↗ · pubmed ↗
- 7Yashiro M. Ulcerative Colitis-Associated Colorectal Cancer World J. Gastroenterol.2014201638910.3748/wjg.v 20.i 44.1638925469007 PMC 4248182 · doi ↗ · pubmed ↗
- 8Jess T. Gamborg M. Munkholm P. Sørensen T.I.A. Overall and Cause-Specific Mortality in Ulcerative Colitis: Meta-Analysis of Population-Based Inception Cohort Studies Am. J. Gastroenterol.200710260961710.1111/j.1572-0241.2006.01000.x 17156150 · doi ↗ · pubmed ↗
