# PPE50 variants as novel phylogeographic signatures of host-pathogen co-evolution in tuberculosis

**Authors:** Christopher D’Souza, Jody E. Phelan, Paula-Josefina Gomez-Gonzalez, Joseph Thorpe, Taane G. Clark, Anthony G. Tsolaki

PMC · DOI: 10.1038/s42003-025-08383-3 · 2025-07-09

## TL;DR

This paper identifies PPE50 protein variants in tuberculosis bacteria that are linked to geographic regions and suggest co-evolution with human hosts.

## Contribution

The study introduces PPE50 as a novel subfamily of proteins with phylogeographic signatures of host-pathogen co-evolution in tuberculosis.

## Key findings

- PPE50 variants are lineage-specific and stably associated with geographic regions in M. tuberculosis complex strains.
- PPE50-381 is the ancestral variant found in both Ancient and Modern MTBC lineages.
- Transcriptomic analysis shows ppe50 variant genes are expressed in specific MTBC lineages but deleted in others.

## Abstract

PPE50 is a diverse novel subfamily of PE/PPE proteins in the Mycobacterium tuberculosis complex that show a phylogeographic distribution indicative host-pathogen co-evolution.

While evidence supports co-evolution between Mycobacterium tuberculosis and humans, underlying mechanisms remain unclear. We identified PPE50 as a novel subfamily of PE/PPE proteins comprising eight variants. Surveying 387 M. tuberculosis complex (MTBC) strains representing global phylogeography, we found PPE50 variants are lineage-specific and stably associated with geographic regions, defining them as phylogeographically-associated proteins (PAPs). PPE50-381 is the ancestral variant (present in early-branching M. canettii) and the only variant observed in both Ancient and Modern MTBC lineages. Transcriptomic analysis confirmed that ppe50 variant genes are expressed in strains from respective MTBC lineages, but not in all L1 strains and sub-lineages L2.1 and L4.1 where the gene was deleted. In silico analysis revealed significant structural diversity among variants, particularly in C-terminal regions. This strong association of M. tuberculosis protein diversity with phylogeography suggests PPE50 may contribute to MTBC adaptation to different host populations. Further characterization of PPE50 and other PAPs may facilitate improved targeted diagnostics, therapeutics and vaccines.

## Linked entities

- **Genes:** PPE50 (PPE family protein PPE50) [NCBI Gene 888153]
- **Proteins:** PPE50 (PPE family protein PPE50)
- **Diseases:** tuberculosis (MONDO:0018076)
- **Species:** Mycobacterium tuberculosis (taxon 1773)

## Full-text entities

- **Diseases:** tuberculosis (MESH:D014376)
- **Species:** Mycobacterium tuberculosis (species) [taxon 1773], Mycobacterium canetti (species) [taxon 78331], Homo sapiens (human, species) [taxon 9606], Mycobacterium tuberculosis complex (species group) [taxon 77643]

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12241500/full.md

---
Source: https://tomesphere.com/paper/PMC12241500