Identification of positive cofactor 4 as a diagnostic and prognostic biomarker associated with immune infiltration in hepatocellular carcinoma
Liangliang Bai, Guan Liu, Gang Dou, Xiaojun He, Chenyu Gong, Hongbin Zhang, Kai Tan, Xilin Du

TL;DR
This study identifies PC4 as a potential biomarker for diagnosing and predicting outcomes in liver cancer patients, and links it to immune cell activity and treatment response.
Contribution
The study introduces PC4 as a novel diagnostic and prognostic biomarker in hepatocellular carcinoma and constructs an eight-gene risk signature for clinical use.
Findings
PC4 levels are significantly higher in HCC and correlate with worse prognosis and advanced cancer stages.
PC4 can distinguish tumor from normal tissue with high accuracy (AUC of 0.965).
An eight-gene risk signature was developed and validated for predicting HCC patient outcomes.
Abstract
Human positive cofactor 4 (PC4) is associated with the development and therapeutic resistance of several malignancies. However, the role of PC4 in hepatocellular carcinoma (HCC) remains obscure. The expression status of PC4 was explored in Gene Expression Omnibus and The Cancer Genome Atlas datasets. Subsequently, the prognostic and diagnostic significance of PC4 in HCC patients was analyzed. Functional enrichment analyses were conducted to explore biological functions and potential mechanisms. The CIBERSORT algorithm was used for immune infiltration analysis. The risk signature was constructed by LASSO-Cox regression and was validated with the International Cancer Genome Consortium dataset. Quantitative real-time polymerase chain reaction was used to verify the expression levels of all genes. Tumor Immune Dysfunction and Exclusion analysis evaluated immunotherapy response. Finally,…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFerroptosis and cancer prognosis · RNA modifications and cancer · Cancer, Hypoxia, and Metabolism
Introduction
1
Primary liver cancer (PLC) is a common malignancy with high annual incidence and mortality. According to the latest cancer statistics, PLC accounted for 900,000 new cases and 830,000 deaths globally in 2020, posing a severe threat to human life and health [1]. Hepatocellular carcinoma (HCC), the most common histopathological type of PLC, is highly invasive and insidious, leading to a high number of patients with advanced-stage disease at diagnosis who miss the optimal timing for radical surgical treatment [2]. In addition, patients who undergo radical surgical treatment have a recurrence of nearly 70% during the first five postoperative years [3]. The past few years have witnessed unprecedented advances in research on therapeutic drugs, targeted therapy, and immunotherapy, which have improved patient prognosis to some extent. However, significant heterogeneity surrounds drug efficacy and resistance, emphasizing the need for further research [4,5]. Therefore, finding effective biomarkers and potential therapeutic targets is urgent to help diagnose HCC and improve patient prognosis.
Human positive cofactor 4 (human PC4, yeast SUB1), a nuclear protein highly conserved during evolution, was extracted by purification from the human upstream stimulatory activity fraction in 1994 [6]. Earlier studies had shown that PC4 participates in the initiation, elongation, and termination of RNA polymerase II–mediated transcription [7]. However, extensive research on the PC4 gene has determined that PC4 is also closely associated with DNA damage repair, chromatin formation, and cell cycle regulation [[8], [9], [10], [11]]. Given that PC4 can interact with the P53 gene and form a positive feedback loop [12], it has long been thought to be a tumor-suppressor gene. However, the tumor-promoting effect of PC4 has recently received much attention from scholars. A strong correlation has been revealed between PC4 and the proliferation, metastasis, and therapy resistance of malignant tumors such as esophageal squamous cell carcinoma, non–small cell lung cancer, pancreatic ductal adenocarcinoma, breast cancer, and prostate cancer [[13], [14], [15], [16], [17]]. However, the role of PC4 in HCC is still unknown.
This study explored the PC4 gene level in HCC and the association between gene expression and diagnosis, prognosis, and immune infiltration. Additionally, this study constructed and validated a risk signature for effective assessment of patient prognosis and immunotherapy response. Finally, establishment of the competing endogenous RNA (ceRNA) network facilitates understanding of the regulatory mechanism of PC4 in HCC.
Materials and methods
2
Data acquisition and preparation
2.1
The expression and clinical data of 374 HCC and 50 normal samples were acquired as the primary analysis dataset from The Cancer Genome Atlas (TCGA) (https://portal.gdc.cancer.gov/), and the GSE25097, GSE36376, and GSE39791 datasets derived from Gene Expression Omnibus (GEO) (https://www.ncbi.nlm.nih.gov/geo/) were applied to identify differentially expressed genes between HCC and normal tissue. Expression and clinical data of 260 HCC samples were obtained from the International Cancer Genome Consortium (ICGC) dataset (https://dcc.icgc.org/) to validate the risk signature. The Xena database (https://xenabrowser.net/datapages/) was used to obtain data on HCC patient progression-free survival (PFS) and disease-specific survival (DSS).
Differential expression analysis
2.2
The Tumor Immune Estimation Resource (TIMER) database (http://timer.cistrome.org/) was used to analyze PC4 gene expression in 33 malignant tumors and normal tissues. The “limma” package was adopted to analyze the differential expression of the PC4 gene between HCC and normal tissues from TCGA and four GEO datasets [18]. The association between gene expression and different clinicopathological features of HCC patients was also analyzed by the “limma” package.
Diagnostic and prognostic analysis
2.3
The “pROC” package was used to plot receiver operating characteristic (ROC) curves for assessing the ability of the PC4 gene to distinguish Stage I–IV tumor tissues from normal tissues [19]. Time-dependent ROC curves for predicting HCC patient overall survival (OS) were plotted by the “timeROC” package [20]. Independent prognostic factors in HCC patients were explored using univariate and multivariate analyses. Differences in survival were shown by Kaplan–Meier (K–M) curves.
Identification of differentially expressed genes
2.4
The differentially expressed genes (DEGs) of the PC4-high and PC4-low groups were identified with the “limma” package. The filter criteria were p-value being <0.05 and log_2_FC value being >1/<−1. Heatmaps, and volcano plots were used to visualize the results.
Enrichment analyses
2.5
Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses of DEGs were conducted to explore biological functions and potential mechanisms [21]. Gene set enrichment analysis (GSEA) was conducted to find PC4-associated signaling pathways [22].
Immune infiltration analysis
2.6
The “CIBERSORT” package was used to assess the level of immune infiltration of 22 immune cells in each HCC sample [23]. A box plot displayed the differences in immune cell infiltration between the PC4-high and PC4-low groups. The differences in the expression of 24 human leukocyte antigen (HLA) genes and eight immune checkpoint genes between the two groups were visualized in a box plot.
Construction and verification of risk signature
2.7
Genes coexpressed with PC4 were screened using the following criteria: a Pearson correlation coefficient >0.6/< −0.6 and a p-value <0.001. Differential expression analysis and univariate Cox analysis were conducted for screening for genes with high expression in HCC and associated with patient prognosis. The “glmnet” package was used to construct the risk signature through LASSO-Cox regression [24]. The ICGC dataset was used to validate the risk signature.
Prediction of immunotherapy response and drug sensitivity analysis
2.8
Based on the Tumor Immune Dysfunction and Exclusion (TIDE) database (http://tide.dfci.harvard.edu/), the patient response to immunotherapy was predicted using two common mechanisms of immune escape: T-cell dysfunction and exclusion. A larger TIDE score indicated a higher probability of immune escape and a worse response to immunotherapy in patients [25]. Drug sensitivity data were derived from Genomics of Drug Sensitivity in Cancer (https://www.cancerrxgene.org/). The “oncoPredict” package was used to analyze these data [26]. Box plots were used to visualize the results. The filter criterion was a p-value <0.001.
Construction of ceRNA network
2.9
With the ENCORI database (https://starbase.sysu.edu.cn/), the potential regulatory associations between microRNA (miRNA)–messenger RNA (mRNA) and long noncoding RNA (lncRNA)–miRNA were predicted [27]. The results were screened based on a correlation coefficient <−0.3 and a p-value <0.001. The miRNA–mRNA and lncRNA–miRNA–mRNA networks were visualized with the Cytoscape software.
Quantitative real-time polymerase chain reaction analysis
2.10
Five cell lines (L02, BEL7402, HCCLM3, HepG2, and Huh7) were purchased from Procell Life Science & Technology (Wuhan, China) and cultured according to the manufacturer's instructions. Six pairs of fresh specimens were collected from the Second Affiliated Hospital of Air Force Military Medical University. Quantitative real-time polymerase chain reaction was used to verify the expression levels of all genes. The primer sequence information is listed in Table S1.
Statistical analysis
2.11
In this study, all data analyses were conducted using R software (version 4.1.3). Differences between groups were compared using the Wilcoxon test. K–M analysis and log-rank tests were adopted for intergroup survival comparison. Independent prognostic factors were screened by univariate and multivariate analyses. A p-value of <0.05 was considered statistically significant.
Results
3
High PC4 expression in tumor tissues
3.1
Pan-cancer analysis of the Tumor Immune Estimation Resource database showed significant differences in PC4 expression between 13 malignant tumor tissues and the adjacent normal ones. PC4 was highly expressed in nine malignancies, including cholangiocarcinoma, esophageal cancer, colon cancer, and HCC, while it was expressed at low levels in kidney chromophobe, kidney renal clear cell carcinoma, kidney renal papillary cell carcinoma, and thyroid cancer (Fig. 1A). According to differential expression analysis from the GEO and TCGA datasets, PC4 gene expression was significantly higher in HCC than in adjacent normal liver tissue (Fig. 1B, C). In addition, quantitative real-time polymerase chain reaction showed that the expression levels of PC4 in HCCLM3, HepG2, and Huh7 were significantly higher than in LO2 (Fig. 1D). The same results were obtained for HCC tissue samples (Fig. 1E).Fig. 1. Expression status of positive cofactor 4 (PC4) in cancer. (A) Observation of PC4 expression level in pan-cancer from the Tumor Immune Estimation Resource database. (B) PC4 expression in hepatocellular carcinoma (HCC) and normal tissues from three Gene Expression Omnibus datasets (GSE25097, GSE36376, and GSE39791). (C) PC4 expression in HCC (n = 374) and adjacent normal liver tissues (n = 50) from The Cancer Genome Atlas database. (D) PC4 expression in five cell lines (LO2, BEL7402, HCCLM3, HepG2, and Huh7). (E) PC4 expression in six pairs of HCC and adjacent normal liver tissues. ∗p < 0.05, ∗∗p < 0.01, ∗∗∗p < 0.001, ∗∗∗∗p < 0.0001.Fig. 1
Association of PC4 expression with the clinicopathological features and prognosis of HCC patients
3.2
Given the high PC4 gene expression in HCC, we further explored the association between PC4 expression and the clinicopathological features of HCC patients. The PC4 expression was not associated with the age or gender of HCC patients but was positively correlated with the pathological grade and clinical stage (Fig. S1). An association was also found between the expression of PC4 and the HCC patient prognosis. The PC4-high group had significantly lower OS, PFS, and DSS than the PC4-low group (Fig. 2A). The cumulative K–M curves for the clinical stages of combined HCC patients showed the same results. Patients in the PC4-high group at a late stage had the shortest survival time, whereas those in the PC4-low group at an early stage had the longest survival time (Fig. 2B). Subsequently, we assessed the influence of PC4 on HCC patient prognosis in different subgroups of age, gender, pathological grade, and clinical stage. The results implied a worse prognosis in the PC4-high group, irrespective of the subgroup (Fig. 2C).Fig. 2. Correlation between positive cofactor 4 (PC4) expression level and survival and prognosis of hepatocellular carcinoma patients. (A) Kaplan–Meier survival curves for overall survival, progression-free survival, and disease-specific survival between the PC4-high and PC4-low groups. (B) Cumulative survival curves between the PC4-high and PC4-low groups under different clinical stages. (C) Kaplan–Meier survival curves for overall survival in different age, gender, grade, and stage subgroups.Fig. 2
Diagnostic and prognostic value of PC4
3.3
The area under the ROC curve (AUC) values were all larger than 0.9, suggesting that PC4 could distinguish Stage I-IV tumor tissues from normal tissues (Fig. 3A, B). In addition, PC4 exhibited good performance in predicting HCC patient prognosis. The 1-, 3-, and 5-year AUCs were 0.69, 0.61, and 0.65, respectively (Fig. 3C). According to univariate and multivariate analyses, PC4 expression was an independent prognostic factor for HCC patients (Fig. 3D). Subsequently, we further constructed a nomogram to predict the 1-, 3-, and 5-year survival of HCC patients, based on the clinicopathological features and PC4 expression level (Fig. 3E). The calibration curve showed that the nomogram had good predictive power (Fig. 3F).Fig. 3. Diagnostic and prognostic value of positive cofactor 4 in hepatocellular carcinoma (HCC). (A) Receiver operating characteristic (ROC) curve for diagnosis to distinguish HCC from adjacent normal liver tissue. (B) ROC curve for diagnosis to distinguish Stage Ⅰ-Ⅳ HCC from adjacent normal liver tissue. (C) Time-dependent ROC curve to predict 1-, 3-, and 5-year survival rates of HCC patients. (D) Univariate and multivariate Cox analyses to identify independent prognostic factors. (E) Nomogram for predicting 1-, 3-, and 5-year survival rates of HCC patients by clinicopathological features and positive cofactor 4 expression level. (F) Calibration curve for evaluating the nomogram.Fig. 3
Identification of DEGs and enrichment analysis
3.4
We identified 2856 DEGs (2708 upregulated and 148 downregulated) in HCC (Table S2). A heatmap was generated showing the top 50 most significantly upregulated and downregulated genes in HCC (Fig. 4A). A volcano plot was generated to visualize the distribution of all DEGs (Fig. 4B). GO and KEGG analyses and GSEA were conducted to explore the potential mechanism of PC4 in HCC. GO enrichment analysis revealed that the DEGs were mainly enriched in biological processes including “nuclear division”, “chromosome segregation”, and “organelle fission”. KEGG enrichment analysis revealed that the DEGs were significantly enriched in the “cell cycle”, “glycosphingolipid biosynthesis-lacto and neolacto series”, and “cytokine–cytokine receptor interaction” pathways (Fig. 4C). According to the GSEA, the PC4-high group was primarily enriched in “cell cycle”, “cytokine–cytokine receptor interaction”, and “extracellular matrix (ECM) receptor interaction” signaling pathways, whereas the PC4-low group was enriched in “glycine, serine and threonine metabolism”, “fatty acid metabolism”, and “primary bile acid biosynthesis” (Fig. 4D).Fig. 4. Identification of differential gene and enrichment analysis. (A) Heatmap showing 100 differentially expressed genes between the positive cofactor 4 (PC4)-high and PC4-low groups. (B) Volcano plot displaying the distribution of all differentially expressed genes based on log_2_FC > 1/< −1 and p < 0.05. (C) Gene Ontology and Kyoto Encyclopedia of Genes and Genomes enrichment analysis of all differentially expressed genes. (D) Gene set enrichment analysis for pathway enrichment between PC4-high and PC4-low groups.Fig. 4
Immune infiltration analysis between groups
3.5
The association between PC4 and tumor-infiltrating immune cells was further analyzed using the “CIBERSORT” algorithm. Memory B cells, activated dendritic cells, and M0 macrophages were differentially expressed between the PC4-high and PC4-low groups (Fig. S2A). Subsequently, correlation analysis between PC4 expression, 24 HLA genes, and eight immune checkpoint genes showed that all genes were overexpressed in the PC4-high group (Figs. S2C and D). The difference in TIDE scores indicated a higher incidence of immune escape and a lower response rate to immunotherapy in the PC4-high group (Fig. S2B).
Construction and validation of an eight-gene risk signature
3.6
To predict HCC patient prognosis, we constructed a risk signature by coexpression analysis. A heatmap showed that 92 PC4-related genes were overexpressed in HCC (Fig. S3A). The univariate Cox analysis further revealed that 77 genes were related to HCC patient prognosis (Fig. S3B). Subsequently, we constructed and validated an eight-gene risk signature (risk score = [0.074 ∗ CEP55] + [0.091 ∗ TRIP13] + [0.100 ∗ BRIX1] + [0.081 ∗ STIP1] + [0.191 ∗ PIGU] + [0.038 ∗ CFL1] + [0.017 ∗ RBM17] + [0.178 ∗ OLA1]) (Table S3). These eight genes are highly expressed in HCC samples from TCGA (Fig. 5A). Meanwhile, the same results were shown in cell lines. The expression of all genes in HCCLM3, HepG2, and Huh7 was significantly higher than in LO2 (Fig. 5B). The ROC curve in the training set confirmed the predictive power of this risk signature for HCC patient prognosis. The 1-, 2-, and 3-year AUC values were 0.79, 0.70, and 0.68, respectively (Fig. 6B). The number of deaths in HCC patients increased correspondingly with an increase in the risk score (Fig. 6A). The same results were obtained during K–M survival curve analysis. The high-risk group presented a significantly worse prognosis than the low-risk one (Fig. 6C). According to univariate and multivariate analysis, the risk score was an independent prognostic factor for HCC patients (Fig. 6D, E). Principal component analysis (PCA) and t-distributed stochastic neighbor embedding (t-SNE) analysis revealed that the risk signature could distinguish between high- and low-risk patients (Fig. 6F, G). The aforementioned results from the training set were confirmed in the validation set. The 1-, 2-, and 3-year AUC values of the validation set were 0.75, 0.74, and 0.73, respectively (Fig. 7A-G).Fig. 5. Expression levels of eight genes in the risk signature. (A) Expression levels of eight genes in hepatocellular carcinoma (n = 374) and adjacent normal liver tissues (n = 50) from The Cancer Genome Atlas database. (B) Expression levels of eight genes in five cell lines (LO2, BEL7402, HCCLM3, HepG2, and Huh7). ∗p < 0.05, ∗∗p < 0.01, ∗∗∗p < 0.001, ∗∗∗∗p < 0.0001.Fig. 5. Fig. 6Survival analysis of the eight-gene risk signature in the training cohort (The Cancer Genome Atlas cohort). (A) Risk score distribution, survival status, and gene expression in high-risk and low-risk groups. (B) Time-dependent receiver operating characteristic curve to predict 1-, 2-, and 3-year survival rates of hepatocellular carcinoma patients. (C) Overall survival Kaplan–Meier curve between the high-risk and low-risk groups. (D, E) Univariate (D) and multivariate (E) Cox analysis to identify independent prognostic factors. (F, G) Principal component analysis (F) and t-distributed stochastic neighbor embedding analysis (G) for the two groups.Fig. 6. Fig. 7Survival analysis of the eight-gene risk signature in the test cohort (International Cancer Genome Consortium cohort). (A) Risk score distribution, survival status, and gene expression in the high-risk and low-risk groups. (B) Time-dependent receiver operating characteristic curve to predict 1-, 2-, and 3-year survival rates of hepatocellular carcinoma patients. (C) Overall survival Kaplan–Meier curve between the high-risk and low-risk groups. (D, E) Univariate (D) and multivariate (E) Cox analysis to identify independent prognostic factors. (F, G) Principal component analysis (F) and t-distributed stochastic neighbor embedding analysis (G) for the two groups.Fig. 7
Additionally, the influence of the risk signature on HCC patient prognosis in different subgroups of age, gender, pathological grade, and clinical stage was further analyzed. The results revealed a worse prognosis in the high-risk group in any subgroup in the training set and validation set (Fig. S4). The clinical data of HCC patients from TCGA (training set) and the ICGC datasets (validation set) are provided in Table S4. Subsequently, we constructed a nomogram with the risk signature and clinicopathological features to predict the survival rates of HCC patients at 1, 2, and 3 years (Fig. S5A). The calibration curve revealed that the nomogram had good predictive power (Fig. S5B).
Finally, TIDE analysis assessed the correlation between the risk signature and immunotherapy response. Patients with high scores were unlikely to respond to immunotherapy, while the opposite was true for patients with low scores (Fig. S5C). Drug sensitivity analysis showed that patients in the high-risk group were sensitive to sorafenib, axitinib, cytarabine, entinostat, fludarabine, irinotecan, obatoclax mesylate, oxaliplatin, ribociclib, and selumetinib. Conversely, patients in the low-risk group were sensitive to 5-fluorouracil, alpelisib, cediranib, dasatinib, fulvestrant, lapatinib, osimertinib, pevonedistat, pictilisib, and taselisib (Fig. S6).
Construction of ceRNA network based on PC4
3.7
To explore the potential upstream regulators of PC4, we constructed a PC4-related ceRNA network. First, 73 possible upstream miRNAs were obtained from the ENCORI online database, and the miRNA–mRNA network was constructed through Cytoscape software (Fig. 8A). Correlation analysis between PC4 and miRNAs further identified miRNA-101-3p as a potential regulatory miRNA of PC4 (Fig. S7A). Subsequently, we obtained 49 possible lncRNAs to explore the upstream regulatory lncRNAs of miRNA-101-3p. Correlation analysis of miRNA-101-3p with lncRNAs ultimately identified five potential lncRNAs. The expression of these lncRNAs was positively correlated with PC4 and negatively with miRNA-101-3p (Fig. S7B–F, S8). Finally, the lncRNA/miRNA-101-3p/PC4 network was constructed (Fig. 8B). Boxplots and K–M curves were generated to visualize the differences in expression and OS of miRNA-101-3p and the five lncRNAs. miRNA-101-3p had low expression in HCC, and HCC patients with low expression showed a worse prognosis. In addition, all five lncRNAs were highly expressed in HCC and predicted a poorer prognosis (Fig. 8C, D).Fig. 8. Construction of positive cofactor 4 (PC4)-associated competing endogenous RNA network. (A) microRNA (miRNA)–messenger RNA (mRNA) network composed of PC4 and 73 miRNAs. (B) long noncoding RNA (lncRNA)–miRNA–mRNA network composed of PC4, miRNA-101, and five lncRNAs. (C) Expression differences of miRNA-101 and five lncRNAs in hepatocellular carcinoma and adjacent normal liver tissues based on The Cancer Genome Atlas cohort. (D) Overall survival Kaplan–Meier curves of miRNA-101 and five lncRNAs based on The Cancer Genome Atlas cohort.Fig. 8
Discussion
4
As the most common histopathological type of PLC, HCC is highly invasive and insidious, accounting for the difficulty clinicians face in diagnosing and treating HCC. Alpha-fetoprotein (AFP) is the most commonly used diagnostic and screening biomarker for HCC. However, its sensitivity and specificity still do not meet clinical needs [28]. In recent years, targeted therapy and immunotherapy for HCC have achieved remarkable efficacy, but the heterogeneity in efficacy and drug resistance remains to be addressed [4,5]. Thus, searching for effective biomarkers and potential therapeutic targets is urgent to offer new strategies for diagnosing and treating HCC. PC4 is a highly conserved nuclear protein implicated in transcriptional regulation, DNA damage repair, and chromatin formation [29]. Moreover, overexpression of PC4 is significantly associated with proliferation, metastasis, and treatment resistance in various malignancies. PC4 accelerates the development of pancreatic ductal adenocarcinoma by activating the mTOR/p70s6k signaling pathway [16]. PC4 can also bind directly to the c-Myc promoter and regulate its transcription, which induces the Warburg effect to promote breast cancer progression [30]. However, the role of PC4 in HCC remains largely understudied. This study comprehensively analyzed the diagnostic and prognostic value of PC4 in HCC and constructed a risk signature to effectively assess the prognosis of HCC patients. The construction of the ceRNA network facilitated the understanding of the regulatory mechanisms of PC4 in HCC.
PC4 is highly expressed in various malignancies, with the most significant differential expression in HCC. In addition, a positive correlation was found between the PC4 expression and the pathological grade and clinical stage of HCC patients, suggesting that PC4 potentially participates in the development of HCC. The present study analyzes the value of PC4 in the diagnosis and prognosis of HCC. The diagnostic ROC curve substantiated the ability of PC4 to distinguish tumor tissue from normal tissue. Survival analysis of OS, PFS, and DSS showed that high PC4 expression correlated with a poor HCC patient prognosis. Additionally, the prognostic ROC curve and nomogram indicated that PC4 could effectively predict the 1-, 3-, and 5-year survival rates of HCC patients. The aforementioned results imply that PC4 has huge potential as a diagnostic and prognostic biomarker for HCC.
To understand the role of PC4 in HCC, we identified DEGs and conducted functional enrichment analyses. The PC4-high group was mainly associated with the “cell cycle”, “cytokine–cytokine receptor interaction”, and “ECM receptor interaction” signaling pathways. This result is consistent with the functions of PC4 documented in the literature. Subsequently, we further explored the association between the expression level of PC4, immune infiltration, and the immunotherapy response. The PC4 expression positively correlated with numerous HLA genes and immune checkpoint–related genes. Additionally, patients with high PC4 expression had a higher probability of immune escape, a poorer response to immunotherapy, and shorter survival. This result is caused by the tumor-suppressive microenvironment induced by the overexpression of immune checkpoint genes [31,32].
We found that the ability of a single gene to assess HCC patient prognosis was limited. Therefore, to improve the predictive power, we constructed and validated an eight-gene risk signature (risk score = [0.074 ∗ CEP55] + [0.091 ∗ TRIP13] + [0.100 ∗ BRIX1] + [0.081 ∗ STIP1] + [0.191 ∗ PIGU] + [0.038 ∗ CFL1] + [0.017 ∗ RBM17] + [0.178 ∗ OLA1]). Most genes in this risk signature are strongly associated with the prognosis of HCC or other malignancies. CEP55 is located in the chromosomal region 10q23 and is important in cell division [33]. CEP55 promotes HCC invasion and migration through the JAK2/STAT3/MMPs signaling pathway [34]. As an AAA + ATPase that can promote the assembly or degradation of protein complexes, TRIP13 is significantly upregulated in various malignancies, including HCC [35]. Moreover, TRIP13 can interact with ACTN4 to induce HCC invasion and metastasis through the AKT/mTOR pathway [36]. A study that constructed a prognostic risk signature for HCC based on RNA-binding proteins showed an association between BRIX1 and poor prognosis of HCC patients [37]. STIP1, called heat shock protein–organizing protein, is reportedly overexpressed in HCC and accelerates cancer-cell growth and migration by interacting with Axin to activate β-catenin/TCF signaling [38]. In addition, it has been reported that STIP1 in serum can be used as an indicator of microvascular invasion and can be used to predict prognosis and response to trans arterial chemoembolisation treatment in HCC patients [39]. PIGU, also called cell division cycle 91-like 1 (CDC91L1), promotes HCC progression by activating the NF-κB pathway and promoting immune escape [40]. CFL1 is an important actin depolymerization factor family member, widely found in eukaryotes. Hypoxia can induce CFL1 expression and thus activate the PLD1/AKT pathway to promote HCC progression [41]. RBM17 is also known as splicing factor 45 (SPF45). Knockdown of RBM17 significantly inhibits cancer-cell proliferation and arrested cells in the G2/M phase [42]. Moreover, OLA1 is implicated in cellular processes such as protein translation, signal transduction, and cell proliferation [43]. OLA1 is significantly expressed in HCC, and knockdown of this gene suppresses the progression of cancer cells [44]. These studies overlap in their assertion that the risk score correlates with HCC patient prognosis. Moreover, various analyses have shown that this risk score is reliable in predicting the prognosis of HCC patients. TIDE analysis showed that patients with high scores were more likely to be unresponsive to immunotherapy. Drug sensitivity analysis screened the drugs to which the high-risk and low-risk groups are sensitive. These results will help guide clinicians in making treatment decisions.
To better understand the role of PC4 in HCC, we constructed a lncRNA/miRNA-101-3p/PC4 network. Most regulatory relationships in this network have been reported in malignancies. The lncRNA SNHG6 improves E2F8 expression by associating with miR-101-3p, thus promoting proliferation and angiogenesis in cholangiocarcinoma [45]. In HCC, SNHG6 upregulates ZEB1 expression by binding miR-101-3p and, in combination with downregulation of Smad7, induces epithelial–mesenchymal transition, speeding up cancer-cell metastasis [46]. An SNX16- and PAPOLG-based study showed that the GSEC/miR-101-3p/SNX16/PAPOLG axis is significantly associated with HCC patient prognosis [47]. Reportedly, the lncRNA SNHG1 interacts with miR-101-3p to promote tumor progression. The SNHG1/miR-101-3p/SOX9/Wnt/β-catenin axis promotes the progression of non–small cell lung cancer [48]. miR-101-3p is closely associated with programmed cell death, such as autophagy and apoptosis, playing an important role in HCC [49]. miR-101-3p can bind PYGB and inhibit its expression, while the overexpression of PYGB, in turn, inhibits the regulation of miR-101-3p on HCC cell invasion, proliferation, and migration [50]. However, the association between miR-101–3p and PC4 in HCC remains unknown. The construction of this lncRNA/miRNA-101-3p/PC4 network may bridge that knowledge gap.
In conclusion, this study demonstrates the potential of PC4 as a diagnostic and prognostic biomarker for HCC. Moreover, the PC4-based risk signature could guide clinicians in evaluating the prognosis and response to immunotherapy of HCC patients.
Funding
This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.
Author contributions
LB, GL, and GD conceived the study. XH, CG, and HZ conducted data collection and collation. LB and GL were responsible for data analysis. LB wrote the manuscript, and KT and XD reviewed and revised it. All authors reviewed the manuscript. LB, GL, and GD contributed equally to this work.
Acknowledgments
We are grateful to the The Cancer Genome Atlas, Gene Expression Omnibus, and International Cancer Genome Consortium databases for providing data for this study. We thank the “ENCORI” and “Tumor Immune Dysfunction and Exclusion” databases and “R” and “Cytoscape” softwares for their help in data analysis.
Declaration of competing interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential competing interest.
Data availability statement
In this study, the data for HCC were freely obtained from publicly accessible datasets: TCGA (https://portal.gdc.cancer.gov/), GEO (https://www.ncbi.nlm.nih.gov/geo/), and the ICGC database (https://dcc.icgc.org/).
Ethics statement
Ethics approval was waived for this study because our data were retrieved from public databases.
Informed consent
Informed consent was not applicable because our data were from public databases.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Sung H.Ferlay J.Siegel R.L.Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries CA Cancer J Clin 713202120924910.3322/caac.2166033538338 · doi ↗ · pubmed ↗
- 2Park J.W.Chen M.Colombo M.Global patterns of hepatocellular carcinoma management from diagnosis to death: the BRIDGE Study Liver Int 35920152155216610.1111/liv.1281825752327 PMC 4691343 · doi ↗ · pubmed ↗
- 3Roayaie S.Obeidat K.Sposito C.Resection of hepatocellular cancer ≤2 cm: results from two Western centers Hepatology 57420131426143510.1002/hep.2583222576353 PMC 3442120 · doi ↗ · pubmed ↗
- 4Huang A.Yang X.R.Chung W.Y.Targeted therapy for hepatocellular carcinoma Signal Transduct Targeted Ther 51202014610.1038/s 41392-020-00264-x PMC 741954732782275 · doi ↗ · pubmed ↗
- 5Cheng A.L.Hsu C.Chan S.L.Challenges of combination therapy with immune checkpoint inhibitors for hepatocellular carcinoma J Hepatol 722202030731910.1016/j.jhep.2019.09.02531954494 · doi ↗ · pubmed ↗
- 6Ge H.Roeder R.G.Purification, cloning, and characterization of a human coactivator, PC 4, that mediates transcriptional activation of class II genes Cell 783199451352310.1016/0092-8674(94)90428-68062391 · doi ↗ · pubmed ↗
- 7Calvo O.Manley J.L.The transcriptional coactivator PC 4/Sub 1 has multiple functions in RNA polymerase II transcription EMBO J 24520051009102010.1038/sj.emboj.760057515692559 PMC 554125 · doi ↗ · pubmed ↗
- 8Mortusewicz O.Roth W.Li N.Recruitment of RNA polymerase II cofactor PC 4 to DNA damage sites J Cell Biol 1835200876977610.1083/jcb.20080809719047459 PMC 2592824 · doi ↗ · pubmed ↗
