# Phylogenetic Analysis of NEAT1 and MALAT1 Long Non-Coding RNAs Highlights Structure–Function Relationships in Paraspeckle Biology

**Authors:** Ksenia Arkhipova, Micha Drukker

PMC · DOI: 10.1093/molbev/msaf265 · Molecular Biology and Evolution · 2025-12-09

## TL;DR

This study explores the evolutionary conservation of NEAT1 and MALAT1 lncRNAs, revealing structural and functional differences in paraspeckle biology.

## Contribution

The first large-scale phylogenetic analysis of NEAT1, highlighting conserved structural motifs and transposable element contributions.

## Key findings

- NEAT1's conserved features include G-quadruplexes, self-complementary regions, and transposable elements that stabilize paraspeckles.
- The NEAT1Short isoform is universally conserved, and TDP-43-mediated isoform switching is evolutionarily preserved.
- MALAT1 relies on conserved primary sequence and purifying selection regions for its function.

## Abstract

Paraspeckles are nuclear bodies essential for gene regulation and stress response, and they are built upon the long non-coding RNA NEAT1. Together with the syntenic MALAT1, these are the only lncRNAs that use the tRNA-processing machinery for maturation, yet they differ in function and evolutionary conservation. To investigate these differences, we identified NEAT1 and MALAT1 orthologs across 545 mammals. For NEAT1, we found that G-quadruplexes, short motifs interacting with DBHS proteins and TDP-43, long gene length, and self-complementary regions are highly conserved features that likely stabilize paraspeckle integrity. Transposable elements also contributed structural modules potentially recognized by DBHS proteins, underscoring their role in NEAT1 evolution. The NEAT1Short isoform was present in all orthologs, and the TDP-43-mediated isoform switch appears to be conserved. In contrast, MALAT1 function likely relies on its conserved primary sequence and regions under purifying selection. This is the first large-scale phylogenetic study of NEAT1—a lncRNA that lacks sequence similarity between orthologs while maintaining functional and syntenic conservation.

## Linked entities

- **Genes:** NEAT1 (nuclear paraspeckle assembly transcript 1) [NCBI Gene 283131], MALAT1 (metastasis associated lung adenocarcinoma transcript 1) [NCBI Gene 378938], TARDBP (TAR DNA binding protein) [NCBI Gene 23435]
- **Proteins:** dbh.S (dopamine beta-hydroxylase (dopamine beta-monooxygenase) S homeolog), TARDBP (TAR DNA binding protein)

## Full-text entities

- **Genes:** TARDBP (TAR DNA binding protein) [NCBI Gene 23435] {aka ALS10, TDP-43}, NEAT1 (nuclear paraspeckle assembly transcript 1) [NCBI Gene 283131] {aka LINC00084, NCRNA00084, TP53LC15, TncRNA, VINC}, MALAT1 (metastasis associated lung adenocarcinoma transcript 1) [NCBI Gene 378938] {aka HCN, LINC00047, NCRNA00047, NEAT2, PRO2853, miPEP-52}

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12941219/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12941219/full.md

## References

99 references — full list in the complete paper: https://tomesphere.com/paper/PMC12941219/full.md

---
Source: https://tomesphere.com/paper/PMC12941219