# Analysis of HERV-K (HML2) Expression in Colorectal Cancer Samples

**Authors:** Valentina S. Obrezanenko, Polina M. Shulga, Anastasia G. Volkova, Anastasia A. Primova, Yulia A. Remizova, Ivan O. Meshkov, Alexandra D. Kikot, Daria A. Tarasova, Ekaterina S. Bolashova, Alexey A. Ivashechkin, Antonida V. Makhotenko, Ekaterina A. Snigir, Yulia A. Masyukova, Elizaveta I. Radion, Olesya A. Kuznetsova, Maria S. Cheporova, Michail Y. Fedyanin, Alexey A. Tryakin, Valentin V. Makarov, Vladimir S. Yudin, Anton A. Keskinov, Anna S. Makarova

PMC · DOI: 10.3390/epigenomes10010011 · Epigenomes · 2026-02-12

## TL;DR

This study examines the expression of HERV-K (HML2) retrotransposons in colorectal cancer and finds links to epigenetic regulation and cancer subtypes.

## Contribution

The study identifies HML-2 expression patterns and their epigenetic regulation in different molecular subtypes of colorectal cancer.

## Key findings

- HML-2 expression is differentially regulated in colorectal cancer molecular subtypes.
- Inverse associations were found between HML-2 locus methylation and expression.
- HML-2 activation correlates with RIG-I-like receptor signaling in certain CRC subtypes.

## Abstract

Background: HML-2 subgroup mobile genetic elements of the HERV-K family were described to participate in carcinogenesis processes, but their expression and epigenetic regulation in molecular subtypes of colorectal cancer (CRC) remain partly characterized. The present study aimed to evaluate the expression of HML-2 elements using RNA-sequencing data in paired tumor and normal intestinal tissue samples from 63 patients with CRC to identify patterns of the retrotransposons’ activity in different molecular subtypes (CMSs). Methods: RNA-sequencing and DNA methylation data were analyzed for paired CRC and normal tissue samples. HERV-K expression was assessed using three bioinformatics tools: Telescope (version 1.0.3), TEtranscripts (version 2.2.3), GeneTEFlow (version 2020). Molecular tumor subtypes were defined using the CMScaller (version 0.99.2) program. The results of the HML-2 loci expression analysis were supplemented with the HML-2 proteins expression data obtained by quantitative RT-PCR. Results: HML-2 expression assessment by GeneTEFlow (version 2020), TECount (version 2.2.3) and Telescope (version 1.0.3) showed high convergence: the Pearson correlation coefficient for each tool exceeded 0.88. Several HML-2 loci were identified as differentially expressed in CRC samples of different CMS. The PCR results confirmed an increase in HML-2 expression in tumor tissues. For all CMSs, an inverse association was detected between differential methylation of CpG sites and differential expression of HML-2 loci. Associations of HML-2 expressions with differentially expressed genes in which they are located were found, and for a number of such genes an inverse relationship between the expression level and the methylation level of their promoters were demonstrated, and data on the involvement in the pathogenesis of CRC were described: CR1, CD48, TTLL3, ABCC2 and ZNF420. Expression signatures associated with the activity of the RIG-I-like receptor signaling cascade were identified in CMS1–3 CRC samples, which may indicate the possible implementation of viral mimicry against the background of HML-2 activation. Conclusions: Analysis of the expression of HML-2 and its association with CpG methylation contributes to a comprehensive interpretation of the CRC pathogenesis mechanisms.

## Linked entities

- **Genes:** CR1 (complement C3b/C4b receptor 1 (Knops blood group)) [NCBI Gene 1378], CD48 (CD48 molecule) [NCBI Gene 962], TTLL3 (tubulin tyrosine ligase like 3) [NCBI Gene 26140], ABCC2 (ATP binding cassette subfamily C member 2) [NCBI Gene 1244], ZNF420 (zinc finger protein 420) [NCBI Gene 147923]
- **Diseases:** colorectal cancer (MONDO:0005575)

## Full-text entities

- **Genes:** CD200R1 (CD200 receptor 1) [NCBI Gene 131450] {aka CD200R, HCRTR2, MOX2R, OX2R}, ZNF585A (zinc finger protein 585A) [NCBI Gene 199704] {aka Zfp27}, BRAF (B-Raf proto-oncogene, serine/threonine kinase) [NCBI Gene 673] {aka B-RAF1, B-raf, BRAF-1, BRAF1, NS7, RAFB1}, ASRGL1 (asparaginase and isoaspartyl peptidase 1) [NCBI Gene 80150] {aka ALP, ALP1, CRASH}, TP53 (tumor protein p53) [NCBI Gene 7157] {aka BCC7, BMFS5, LFS1, P53, TRP53}, ENPP7P11 (ectonucleotide pyrophosphatase/phosphodiesterase 7 pseudogene 11) [NCBI Gene 100421806], RCC2P6 (regulator of chromosome condensation 2 pseudogene 6) [NCBI Gene 100421131], IGHG4 (immunoglobulin heavy constant gamma 4 (G4m marker)) [NCBI Gene 3503], CR1L (complement C3b/C4b receptor 1 like) [NCBI Gene 1379], ENPP7P1 (ectonucleotide pyrophosphatase/phosphodiesterase 7 pseudogene 1) [NCBI Gene 100421823], RPL32 (ribosomal protein L32) [NCBI Gene 6161] {aka L32, PP9932, eL32}, DHRS4-AS1 (DHRS4 antisense RNA 1) [NCBI Gene 55449] {aka AS1DHRS4, C14orf167, C14orf67, DHRS4AS1, PRO1488}, MLH1 (mutL homolog 1) [NCBI Gene 4292] {aka COCA2, FCC2, HNPCC, HNPCC2, LYNCH2, MLH-1}, Cd4 (CD4 antigen) [NCBI Gene 12504] {aka L3T4, Ly-4}, ZNF600 (zinc finger protein 600) [NCBI Gene 162966] {aka KR-ZNF1}, CXCL8 (C-X-C motif chemokine ligand 8) [NCBI Gene 3576] {aka GCP-1, GCP1, IL8, LECT, LUCT, LYNAP}, DHRS4L1 (dehydrogenase/reductase 4 like 1 (pseudogene)) [NCBI Gene 728635] {aka SDR25C4, SDR25C4P}, OR52K2 (olfactory receptor family 52 subfamily K member 2) [NCBI Gene 119774] {aka OR11-7}, FAM85B [NCBI Gene 619431], TRPC6 (transient receptor potential cation channel subfamily C member 6) [NCBI Gene 7225] {aka FSGS2, TRP6}, ALOX15B (arachidonate 15-lipoxygenase type B) [NCBI Gene 247] {aka 15-LOX-2}, MSTO1 (misato mitochondrial distribution and morphology regulator 1) [NCBI Gene 55154] {aka LST005, MMYAT, MST}, CCL2 (C-C motif chemokine ligand 2) [NCBI Gene 6347] {aka GDCF-2, HC11, HSMCR30, MCAF, MCP-1, MCP1}, ZNF732 (zinc finger protein 732) [NCBI Gene 654254], SSBP1 (single stranded DNA binding protein 1) [NCBI Gene 6742] {aka Mt-SSB, OPA13, SOSS-B1, SSBP, mtSSB}, CIDEC (cell death inducing DFFA like effector c) [NCBI Gene 63924] {aka CIDE-3, CIDE3, FPLD5, FSP27}, LARP1 (La ribonucleoprotein 1, translational regulator) [NCBI Gene 23367] {aka LARP, Lar1, Lhp1}, FGL1 (fibrinogen like 1) [NCBI Gene 2267] {aka HFREP1, HP-041, HPS, LFIRE-1, LFIRE1}, LIPH (lipase H) [NCBI Gene 200879] {aka AH, ARWH2, HYPT7, LAH2, LPDLR, PLA1B}, ABCC2 (ATP binding cassette subfamily C member 2) [NCBI Gene 1244] {aka ABC30, CMOAT, DJS, MRP2, cMRP}, CARMIL3 (capping protein regulator and myosin 1 linker 3) [NCBI Gene 90668] {aka C14orf121, LRRC16B, crml-1}, ENPP7P10 (ectonucleotide pyrophosphatase/phosphodiesterase 7 pseudogene 10) [NCBI Gene 100421805], MEI4 (meiotic double-stranded break formation protein 4) [NCBI Gene 101928601], LHFPL3-AS1 (LHFPL3 antisense RNA 1) [NCBI Gene 645591], MYC (MYC proto-oncogene, bHLH transcription factor) [NCBI Gene 4609] {aka MRTL, MYCC, bHLHe39, c-Myc}, TREH (trehalase) [NCBI Gene 11181] {aka TRE, TREA, TREHD}, SNRPCP10 (small nuclear ribonucleoprotein polypeptide C pseudogene 10) [NCBI Gene 106481701], OR52K1 (olfactory receptor family 52 subfamily K member 1) [NCBI Gene 390036] {aka OR11-8}, AKT1 (AKT serine/threonine kinase 1) [NCBI Gene 207] {aka AKT, PKB, PKB-ALPHA, PRKBA, RAC, RAC-ALPHA}, ZNF420 (zinc finger protein 420) [NCBI Gene 147923] {aka APAK}, WEE2 (WEE2 oocyte meiosis inhibiting kinase) [NCBI Gene 494551] {aka OOMD5, OZEMA5, WEE1B}, CSF3 (colony stimulating factor 3) [NCBI Gene 1440] {aka C17orf33, CSF3OS, GCSF}, LHFPL3 (LHFPL tetraspan subfamily member 3) [NCBI Gene 375612] {aka LHFPL4}, CXCL1 (C-X-C motif chemokine ligand 1) [NCBI Gene 2919] {aka FSP, GRO1, GROa, MGSA, MGSA-a, NAP-3}, DNM1P41 (dynamin 1 pseudogene 41) [NCBI Gene 440299] {aka DNM1DN11-6, DNM1DN11.6, DNM1DN11@}, DGCR5 (DiGeorge syndrome critical region gene 5) [NCBI Gene 26220] {aka DGCR10, DGCR9, DGS-A, DGS-B, LINC00037, NCRNA00037}, ALOXE3P1 (arachidonate lipoxygenase 3 pseudogene 1) [NCBI Gene 100420216], IFNG (interferon gamma) [NCBI Gene 3458] {aka IFG, IFI, IMD69}, FAM86JP (family with sequence similarity 86, member A pseudogene) [NCBI Gene 100125556], ERVK-7 (endogenous retrovirus group K member 7) [NCBI Gene 449619] {aka ERVK7, HERV-K(III), HERV-K102, HERV-KIII}, ALOX12B (arachidonate 12-lipoxygenase, 12R type) [NCBI Gene 242] {aka 12R-LOX, ARCI2}, MTHFD1L (methylenetetrahydrofolate dehydrogenase (NADP+ dependent) 1 like) [NCBI Gene 25902] {aka FTHFSDC1, MTC1THFS, dJ292B18.2}, OR51R1P (olfactory receptor family 51 subfamily R member 1 pseudogene) [NCBI Gene 390034], IFNA1 (interferon alpha 1) [NCBI Gene 3439] {aka IFL, IFN, IFN-ALPHA, IFN-alphaD, IFNA13, IFNA@}, ZNF876P (zinc finger protein 876, pseudogene) [NCBI Gene 642280], CSPG4P11 (chondroitin sulfate proteoglycan 4 pseudogene 11) [NCBI Gene 100631254], CD48 (CD48 molecule) [NCBI Gene 962] {aka BCM1, BLAST, BLAST1, MEM-102, SLAMF2, hCD48}, CR1 (complement C3b/C4b receptor 1 (Knops blood group)) [NCBI Gene 1378] {aka C3BR, C4BR, CD35, KN}, PIK3CB (phosphatidylinositol-4,5-bisphosphate 3-kinase catalytic subunit beta) [NCBI Gene 5291] {aka P110BETA, PI3K, PI3KBETA, PIK3C1}, JUNB (JunB proto-oncogene, AP-1 transcription factor subunit) [NCBI Gene 3726] {aka AP-1}
- **Diseases:** CMS1-3 (MESH:C537153), germ cell tumor (MESH:D009373), laryngeal carcinoma (MESH:D007822), CMS2-4 (MESH:D053632), colon adenocarcinoma (MESH:D003110), gastric cancer (MESH:D013274), lung adenocarcinoma (MESH:D000077192), colorectal adenomas (MESH:D000236), CMSs (MESH:C535673), epithelial ovarian cancer (MESH:D000077216), carcinogenesis (MESH:D063646), Cancer (MESH:D009369), MSI (MESH:D053842), HBV infection (MESH:D006509), schizophrenia (MESH:D012559), rectal cancer (MESH:D012004), injury to (MESH:D014947), inflammation (MESH:D007249), prostate cancer (MESH:D011471), glioma (MESH:D005910), pancreatic adenocarcinoma (MESH:D010190), HCC (MESH:D006528), glioblastoma (MESH:D005909), CMS (MESH:C536089), breast cancer (MESH:D001943), malignant tumors of the ovary, stomach, prostate, pancreas, and liver (MESH:D001948), CMS (MESH:C567116), metastasis (MESH:D009362), deaths (MESH:D003643), carcinogenic (MESH:D011230), CRC (MESH:D015179)
- **Chemicals:** Kynurenine (MESH:D007737), glutamate (MESH:D018698), blood glucose (MESH:D001786), water (MESH:D014867), proline (MESH:D011392), reactive oxygen species (MESH:D017382), ATP (MESH:D000255), lipid (MESH:D008055), 2-acetyl lysophosphatidic acid (-), alpha-ketoglutarate (MESH:D007656)
- **Species:** Clostridium polynesiense (species) [taxon 1325933], Mouse mammary tumor virus (no rank) [taxon 11757], Mus musculus (house mouse, species) [taxon 10090], Human endogenous retroviruses (clade) [taxon 206037], Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** HML-1 — Mus musculus (Mouse), Malignant neoplasms of the mouse mammary gland, Cancer cell line (CVCL_0I87), CMS1-3 — Homo sapiens (Human), Childhood acute megakaryoblastic leukemia, Cancer cell line (CVCL_H248), HeLa — Homo sapiens (Human), Human papillomavirus-related endocervical adenocarcinoma, Cancer cell line (CVCL_0030)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12922068/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12922068/full.md

## References

141 references — full list in the complete paper: https://tomesphere.com/paper/PMC12922068/full.md

---
Source: https://tomesphere.com/paper/PMC12922068