# LINE-1 Transcript Heterogeneity in Non-Small Cell Lung Cancers Is Driven by Host Genomic Context and Conserved Functional Hotspots

**Authors:** Yingshan Wang, Kenneth S. Ramos

PMC · DOI: 10.3390/cancers18030459 · 2026-01-30

## TL;DR

This study explores how LINE-1 retrotransposons are activated in non-small cell lung cancer, revealing that their activity is influenced by genomic context and specific hotspots.

## Contribution

The study identifies conserved genomic hotspots and cancer-relevant genes linked to LINE-1 activity in lung cancer.

## Key findings

- LINE-1 transcription is dominated by young subfamilies like L1HS and L1PA2 through L1PA5.
- LINE-1 activity is highest in lung squamous cell carcinoma and near genes like RB1 and NEDD4.
- Recurrent hotspots at 22q12.1 and 20p11.21 are active across tumor subtypes.

## Abstract

Long interspersed element-1 sequences are normally silenced in healthy cells but frequently reactivated in cancer, where they contribute to genomic instability and transcriptional dysregulation. However, the determinants of their transcriptional heterogeneity in non-small cell lung cancer remain poorly defined. In this study, we systematically characterized locus-specific LINE-1 transcript expression across a large panel of non-small cell lung cancer cell lines to determine how genomic context shapes retrotransposon activity. We show that LINE-1 transcription is dominated by evolutionarily young elements and follows conserved chromosomal patterns across tumor subtypes, with the highest activity observed in lung squamous cell carcinoma. Importantly, LINE-1 expression is anchored by a small number of conserved genomic hotspot loci that remain active across tumor backgrounds and are located near cancer-relevant genes. These findings provide a genome-scale framework for understanding retrotransposon regulation in cancer and highlight conserved LINE-1 loci as potential markers of genome instability and regulatory vulnerability.

Background: Long INterspersed Element-1 (LINE-1) retrotransposons comprise 17–20% of the human genome. These retroelements are normally silenced early in embryonic development through epigenetic mechanisms and reawakened during oncogenesis, leading to transcriptional dysregulation, genomic instability, and immune evasion. Methods: In the present study, we categorized LINE-1 transcripts across 121 non-small cell lung cancer (NSCLC) cell lines from the Cancer Cell Line Encyclopedia (CCLE) by subfamily, length, orientation, chromosomal origin, and distribution. In addition, high-prevalence insertions were mapped to nearby genes to assess potential functional interactions. Results: LINE-1 transcript abundance and length in NSCLC were dominated by evolutionarily young subfamilies, particularly L1HS and L1PA2 through L1PA5. Chromosomal patterns were conserved across NSCLC subtypes, with modest enrichment of L1HS activity on Chromosome 4 and the X Chromosome. The lung squamous cell carcinoma (LSQCC) subtype exhibited the highest total levels of L1HS expression relative to other NSCLC subtypes. Race modestly influenced LINE-1 transcript abundance, with cell lines derived from self-identified African American individuals showing elevated overall LINE-1 and L1HS expression. Age showed a weak positive correlation with total LINE-1 abundance. Integrative analysis revealed recurrent hotspots at 22q12.1 and 20p11.21 that were transcriptionally active across subtypes and coincided with previously reported intact LINE-1 elements active in epithelial cancers. Recurrent insertions were located near cancer-associated genes, including RB1, NEDD4, FTO, LAMA2, NOD1, and KCNB2, implicating LINE-1 activity in cis-regulatory remodeling of oncogenic pathways. Conclusions: Together, these findings indicate that LINE-1 transcript heterogeneity in NSCLC is shaped by host genomic architecture and conserved functional hotspots, providing new insights into the mechanisms of genetic and epigenetic dysregulation associated with LINE-1 retroelements.

## Linked entities

- **Genes:** RB1 (RB transcriptional corepressor 1) [NCBI Gene 5925], NEDD4 (NEDD4 E3 ubiquitin protein ligase) [NCBI Gene 4734], FTO (FTO alpha-ketoglutarate dependent dioxygenase) [NCBI Gene 79068], LAMA2 (laminin subunit alpha 2) [NCBI Gene 3908], NOD1 (nucleotide binding oligomerization domain containing 1) [NCBI Gene 10392], KCNB2 (potassium voltage-gated channel subfamily B member 2) [NCBI Gene 9312]
- **Diseases:** non-small cell lung cancer (MONDO:0005233), lung squamous cell carcinoma (MONDO:0005097)

## Full-text entities

- **Genes:** NOD1 (nucleotide binding oligomerization domain containing 1) [NCBI Gene 10392] {aka CARD4, CLR7.1, NLRC1, hNod1}, FTO (FTO alpha-ketoglutarate dependent dioxygenase) [NCBI Gene 79068] {aka ALKBH9, BMIQ14, GDFD, IFEX9}, RB1 (RB transcriptional corepressor 1) [NCBI Gene 5925] {aka OSRC, PPP1R130, RB, p105-Rb, p110-RB1, pRb}, KCNB2 (potassium voltage-gated channel subfamily B member 2) [NCBI Gene 9312] {aka KV2.2}, NEDD4 (NEDD4 E3 ubiquitin protein ligase) [NCBI Gene 4734] {aka NEDD4-1, RPF1}, LAMA2 (laminin subunit alpha 2) [NCBI Gene 3908] {aka LAMM, MDC1A}
- **Diseases:** LSQCC (MESH:D002294), NSCLC (MESH:D002289), Cancer (MESH:D009369)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12896411/full.md

---
Source: https://tomesphere.com/paper/PMC12896411