# Endogenous retroviral elements LTR8B and MER65 rewire PSG9 regulation to control trophoblast syncytialization and pre-eclampsia risk

**Authors:** Manvendra Singh, Yuliang Qu, Amit Pande, Julianna Zadora, Florian Herse, Martin Gauster, Xuhui Kong, Rongyan Zheng, Rabia Anwar, Katarina Stevanovic, Ralf Dechend, Marie Cohen, Attila Molvarec, Jichang Wang, Miriam K. Konkel, Bin Zhang, Cedric Feschotte, Gabriela Dveksler, Sandra M. Blois, Laurence D. Hurst, Zsuzsanna Izsvák

PMC · DOI: 10.1186/s13059-026-03944-z · Genome Biology · 2026-03-09

## TL;DR

This study shows how retroviral elements in the genome control placental development and may be linked to pre-eclampsia risk through regulation of PSG9.

## Contribution

The paper identifies LTR8B and MER65 as retroviral elements that regulate PSG9, influencing trophoblast syncytialization and pre-eclampsia risk.

## Key findings

- LTR8B at PSG9 regulates trophoblast syncytialization and is upregulated in pre-eclampsia.
- MER65-int enables the evolution of secreted PSG variants by altering protein structure.
- PSG9 shows potential as a predictive biomarker for pre-eclampsia due to its early expression and correlation with GATA3/DLX5.

## Abstract

Understanding the causes of the exceptional rate of evolution of the mammalian placenta is likely to aid the understanding of placental development and the etiology of the human-specific pregnancy disorder pre-eclampsia (PE). As retroelements are often lineage-specific and known to be co-opted for placental function, here we consider the binding of the transcription factors GATA3 and DLX5 to retroelements. These factors are dysregulated in pre-eclampsia, as are their downstream consequences.

We identify retrovirus-derived LTR8B as a placentally-relevant cis-regulatory element (CRE), not least within the PSG array, a primate-specific genomic region that exhibits high intraspecies variability. LTR8B at PSG9 is particularly influential affecting other PSG family members. Moreover, unique among PSGs, PSG9 produces both secreted and membrane-anchored isoforms. The retroelement MER65-int provides alternative polyA signals that enable the evolution of secreted PSG variants by truncating the ancestral CEACAM protein’s transmembrane domain. Functional characterization finds that LTR8B/PSG9 regulates the differentiation of multinucleated trophoblasts (syncytialization) and, like chorionic gonadotropin and syncytin1, determines the identity of syncytiotrophoblasts. Notably, PSG9 is the most upregulated PSG in PE, with levels correlated with GATA3 and DLX5 levels.

Retroelements contribute to the structural and expression evolution of PSG genes, facilitating lineage-specific placental evolution. The LTR8B/PSG9 regulatory network plays a central role in syncytiotrophoblast differentiation. Given the association between DLX5/GATA3 dysregulation and elevated PSG9 levels, along with PSG9’s expression in the first trimester, PSG9 shows potential as a predictive biomarker for preeclampsia.

The online version contains supplementary material available at 10.1186/s13059-026-03944-z.

## Linked entities

- **Genes:** GATA3 (GATA binding protein 3) [NCBI Gene 2625], DLX5 (distal-less homeobox 5) [NCBI Gene 1749], PSG9 (pregnancy specific beta-1-glycoprotein 9) [NCBI Gene 5678]
- **Diseases:** pre-eclampsia (MONDO:0005081)

## Full-text entities

- **Genes:** TEAD4 (TEA domain transcription factor 4) [NCBI Gene 7004] {aka EFTR-2, RTEF1, TCF13L1, TEF-3, TEF3, TEFR-1}, CDH1 (cadherin 1) [NCBI Gene 999] {aka Arc-1, BCDS1, CD324, CDHE, ECAD, LCAM}, COL10A1 (collagen type X alpha 1 chain) [NCBI Gene 1300], LHCGR (luteinizing hormone/choriogonadotropin receptor) [NCBI Gene 3973] {aka HHG, LCGR, LGR2, LH/CG-R, LH/CGR, LHR}, CHGA (chromogranin A) [NCBI Gene 1113] {aka CGA, PHE5, PHES}, DLX3 (distal-less homeobox 3) [NCBI Gene 1747] {aka AI4, TDO}, ERVH48-1 (endogenous retrovirus group 48 member 1, envelope) [NCBI Gene 90625] {aka C21orf105, HERV-Fb1, NDUFV3-AS1, SUPYN}, LIPE (lipase E, hormone sensitive type) [NCBI Gene 3991] {aka AOMS4, FPLD6, HSL, LHS, REH}, ASCL2 (achaete-scute family bHLH transcription factor 2) [NCBI Gene 430] {aka ASH2, HASH2, MASH2, bHLHa45}, PSG9 (pregnancy specific beta-1-glycoprotein 9) [NCBI Gene 5678] {aka PS-beta-B, PS-beta-G-9, PS34, PSBG-9, PSG11, PSGII}, SFTPA1 (surfactant protein A1) [NCBI Gene 653509] {aka COLEC4, ILD1, PSP-A, PSPA, SFTP1, SFTPA1B}, ERVW-1 (endogenous retrovirus group W member 1, envelope) [NCBI Gene 30816] {aka ENV, ENVW, ERVWE1, HERV-7q, HERV-W-ENV, HERV7Q}, LGR5 (leucine rich repeat containing G protein-coupled receptor 5) [NCBI Gene 8549] {aka FEX, GPR49, GPR67, GRP49, HG38}, JUNB (JunB proto-oncogene, AP-1 transcription factor subunit) [NCBI Gene 3726] {aka AP-1}, GAPDH (glyceraldehyde-3-phosphate dehydrogenase) [NCBI Gene 2597] {aka G3PD, GAPD, HEL-S-162eP}, TSC1 (TSC complex subunit 1) [NCBI Gene 7248] {aka LAM, TSC}, TFAP2C (transcription factor AP-2 gamma) [NCBI Gene 7022] {aka AP2-GAMMA, ERF1, TFAP2G, hAP-2g}, CEACAM8 (CEA cell adhesion molecule 8) [NCBI Gene 1088] {aka CD66b, CD67, CGM6, NCA-95}, ERVFRD-1 (endogenous retrovirus group FRD member 1, envelope) [NCBI Gene 405754] {aka ERVFRDE1, GLLL6191, HERV-FRD, HERV-W/FRD, UNQ6191, envFRD}, CGB2 (chorionic gonadotropin subunit beta 2) [NCBI Gene 114336], PSG8 (pregnancy specific beta-1-glycoprotein 8) [NCBI Gene 440533], LYPD4 (LY6/PLAUR domain containing 4) [NCBI Gene 147719] {aka SMR}, PC (pyruvate carboxylase) [NCBI Gene 5091] {aka PCB}, GDF15 (growth differentiation factor 15) [NCBI Gene 9518] {aka GDF-15, HG, MIC-1, MIC1, NAG-1, PDF}, F3 (coagulation factor III, tissue factor) [NCBI Gene 2152] {aka CD142, TF, TFA}, DLX5 (distal-less homeobox 5) [NCBI Gene 1749] {aka SHFM1, SHFM1D}, NUBP1 (NUBP iron-sulfur cluster assembly factor 1, cytosolic) [NCBI Gene 4682] {aka CIAO5, NBP, NBP1, NBP35}, CGB5 (chorionic gonadotropin subunit beta 5) [NCBI Gene 93659] {aka CGB, HCG}, PSG11 (pregnancy specific beta-1-glycoprotein 11) [NCBI Gene 5680] {aka PSBG-11, PSBG-13, PSG13, PSG14}, INSM2 (INSM transcriptional repressor 2) [NCBI Gene 84684] {aka IA-6, IA6, mlt1}, SDC1 (syndecan 1) [NCBI Gene 6382] {aka CD138, SDC, SYND1, syndecan}, GCM1 (GCM transcription factor 1) [NCBI Gene 8521] {aka GCMA, hGCMa}, EGF (epidermal growth factor) [NCBI Gene 1950] {aka HOMG4, URG}, CGB1 (chorionic gonadotropin subunit beta 1) [NCBI Gene 114335], PSG3 (pregnancy specific beta-1-glycoprotein 3) [NCBI Gene 5671], PSG5 (pregnancy specific beta-1-glycoprotein 5) [NCBI Gene 5673] {aka FL-NCA-3, PSG}, PSG4 (pregnancy specific beta-1-glycoprotein 4) [NCBI Gene 5672] {aka PSBG-4}, LGALS16 (galectin 16) [NCBI Gene 148003], TFAP2A (transcription factor AP-2 alpha) [NCBI Gene 7020] {aka AP-2, AP-2alpha, AP2TF, BOFS, TFAP2}, GFER (growth factor, augmenter of liver regeneration) [NCBI Gene 2671] {aka ALR, ERV1, HERV1, HPO, HPO1, HPO2}, PSG7 (pregnancy specific beta-1-glycoprotein 7) [NCBI Gene 5676] {aka PS-beta-G-7, PSBG-7, PSGGA}, MED1 (mediator complex subunit 1) [NCBI Gene 5469] {aka CRSP1, CRSP200, DRIP205, DRIP230, PBP, PPARBP}, STAT5B (signal transducer and activator of transcription 5B) [NCBI Gene 6777] {aka GHISID2, STAT5}, CANX (calnexin) [NCBI Gene 821] {aka CNX, IP90, P90}, PSG2 (pregnancy specific beta-1-glycoprotein 2) [NCBI Gene 5670] {aka CEA, PSBG2, PSG1}, ERVV-2 (endogenous retrovirus group V member 2, envelope) [NCBI Gene 100271846] {aka ENVV2, HERV-V2}, RAPGEF4 (Rap guanine nucleotide exchange factor 4) [NCBI Gene 11069] {aka CAMP-GEFII, CGEF2, EPAC, EPAC 2, EPAC2, Nbla00496}, PSG6 (pregnancy specific beta-1-glycoprotein 6) [NCBI Gene 5675] {aka PSBG-10, PSBG-12, PSBG-6, PSG10, PSGGB}, DLX4 (distal-less homeobox 4) [NCBI Gene 1748] {aka BP1, DLX7, DLX8, DLX9, OFC15}, FOSB (FosB proto-oncogene, AP-1 transcription factor subunit) [NCBI Gene 2354] {aka AP-1, G0S3, GOS3, GOSB}, CDX2 (caudal type homeobox 2) [NCBI Gene 1045] {aka CDX-3, CDX2/AS, CDX3}, EP300 (EP300 lysine acetyltransferase) [NCBI Gene 2033] {aka KAT3B, MKHK2, RSTS2, p300}, GATA2 (GATA binding protein 2) [NCBI Gene 2624] {aka DCML, IMD21, MONOMAC, NFE1B}, FLT1 (fms related receptor tyrosine kinase 1) [NCBI Gene 2321] {aka FLT, FLT-1, VEGFR-1, VEGFR1}, PGF (placental growth factor) [NCBI Gene 5228] {aka D12S1900, PGFL, PIGF, PLGF, PlGF-2, SHGC-10760}, KRT7 (keratin 7) [NCBI Gene 3855] {aka CK7, K2C7, K7, SCL}, EREG (epiregulin) [NCBI Gene 2069] {aka EPR, ER, Ep}, PSG1 (pregnancy specific beta-1-glycoprotein 1) [NCBI Gene 5669] {aka B1G1, CD66f, FL-NCA-1/2, PBG1, PS-beta-C/D, PS-beta-G-1}, GATA3 (GATA binding protein 3) [NCBI Gene 2625] {aka HDR, HDRS}
- **Diseases:** PSGs (MESH:D011254), overdose (MESH:D062787), RE (MESH:C535499), ERVs (MESH:D003866), end-organ damages (MESH:C564816), placental disorders (MESH:D010922), hypertension (MESH:D006973), mycoplasma infection (MESH:D009175), ERV (MESH:D000071297), metastasis (MESH:D009362), -specific (MESH:D000080888), proteinuria (MESH:D011507), EO-PE (MESH:D011225), tumour (MESH:D009369), APA (MESH:C536589), EVTB (MESH:D014328), IUGR (MESH:D005317)
- **Chemicals:** ITS-X (MESH:C403901), polyA (MESH:D011061), H2SO4 (MESH:C033158), Tween 20 (MESH:D011136), TBS-T (MESH:C027647), PBS (MESH:D007854), paraformaldehyde (MESH:C003043), SB431542 (MESH:C459179), SYBR  Green (MESH:C098022), CO2 (MESH:D002245), 2-mercaptoethanol (MESH:D008623), A83-01 (MESH:C507011), VPA (MESH:D014635), oil (MESH:D009821), AA (MESH:D000596), Forskolin (MESH:D005576), Dox (MESH:D004317), hematoxylin (MESH:D006416), penicillin (MESH:D010406), glycerol (MESH:D005990), puromycin (MESH:D011691), hydrogen peroxide (MESH:D006861), DMEM (-), Y27632 (MESH:C108830), crystal violet (MESH:D005840), S (MESH:D013455), acrylamide (MESH:D020106), cAMP (MESH:D000242), oligonucleotide (MESH:D009841), HCl (MESH:D006851), SDS (MESH:D012967), L-ascorbic acid (MESH:D001205), CHIR99021 (MESH:C473711), TBS (MESH:D013725), TRIzol (MESH:C411644), aspirin (MESH:D001241), triton X-100 (MESH:D017830), streptomycin (MESH:D013307), 3,3,5,5'-tetramethyl benzidine (MESH:C021758), DPBS (MESH:C012939), NP-40 (MESH:C010615), nitrogen (MESH:D009584), F-12 (MESH:C007782), EDTA (MESH:D004492), P (MESH:D010758), Doxycycline (MESH:D004318), NaCl (MESH:D012965), 3-amino-9-ethylcarbazole (MESH:C020702), methanol (MESH:D000432)
- **Species:** Homo sapiens (human, species) [taxon 9606], Mus musculus (house mouse, species) [taxon 10090], Erysiphe sp. RV (species) [taxon 662690]
- **Cell lines:** pTOVT11 — Homo sapiens (Human), Transformed cell line (CVCL_C1JD), BeWo — Homo sapiens (Human), Gestational choriocarcinoma, Cancer cell line (CVCL_0044), Q00887-1 — Homo sapiens (Human), Hurler syndrome, Finite cell line (CVCL_V530), SGHPL-4 — Homo sapiens (Human), Transformed cell line (CVCL_0521), Swan 71 EVTB — Homo sapiens (Human), Telomerase immortalized cell line (CVCL_D855), EVTB — Homo sapiens (Human), Human papillomavirus-related endocervical adenocarcinoma, Cancer cell line (CVCL_WU77), SHGPL-4 — Homo sapiens (Human), Ataxia telangiectasia syndrome, Finite cell line (CVCL_F083), pLKO.1 — Mus musculus (Mouse), Hybridoma (CVCL_C7RB), ESC — Homo sapiens (Human), Embryonic stem cell (CVCL_9771), S2 — Drosophila melanogaster (Fruit fly), Spontaneously immortalized cell line (CVCL_Z232)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12969887/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12969887/full.md

## References

8 references — full list in the complete paper: https://tomesphere.com/paper/PMC12969887/full.md

---
Source: https://tomesphere.com/paper/PMC12969887