# PDP-Miner: an AI/ML tool to detect prophage tail proteins with depolymerase domains across thousands of bacterial genomes

**Authors:** Jeff Gauthier, Irena Kukavica-Ibrulj, Roger C Levesque

PMC · DOI: 10.1093/bioinformatics/btaf460 · Bioinformatics · 2025-08-21

## TL;DR

PDP-Miner is an AI tool that identifies prophage tail proteins with depolymerase domains in bacterial genomes, offering a potential alternative to antibiotics.

## Contribution

PDP-Miner introduces a machine learning wrapper that improves detection of phage depolymerase proteins across thousands of bacterial genomes.

## Key findings

- PDP-Miner identified 10 high-confidence depolymerase gene candidates in 1294 Pseudomonas genomes.
- The tool accurately detects depolymerases in known phage genomes, comparable to existing tools like PhageDPO or DepoScope.

## Abstract

Antibiotic resistance is predicted to become the leading cause of human mortality by 2050. Despite this, no other major antibiotic class has been approved for medical use since 1987. Nevertheless, phage tail proteins offer a promising alternative, given their depolymerase activity toward outer membrane polysaccharides. Several pathogenic bacteria harbor prophages, thus making these prophages’ molecular target already known.

We therefore developed a wrapper for an existing machine learning-based phage depolymerase prediction tool (Depolymerase-Predictor), called PDP-Miner, which annotates phage tail proteins ab initio, detects depolymerase activity within this candidate protein subset, and then performs post-hoc validation by annotating protein domains thereby allowing the user to investigate for protein domains indicative of depolymerase activity. This tool allowed identification of 10 high confidence phage depolymerase gene candidates across all 1294 Pseudomonas genomes available on the International Pseudomonas Consortium Database while also accurately reporting depolymerases in known phage genomes, similarly to other software like PhageDPO or DepoScope.

Source code, test datasets and documentation are freely available for download at http:///www.github.com/jeffgauthier/pdpminer. This software is free and open source under the GNU General Public License v3.0.

## Linked entities

- **Species:** Pseudomonas (taxon 286)

## Full-text entities

- **Chemicals:** polysaccharides (MESH:D011134)
- **Species:** Homo sapiens (human, species) [taxon 9606], Pseudomonas (RNA similarity group I, genus) [taxon 286]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12579547/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC12579547/full.md

## References

40 references — full list in the complete paper: https://tomesphere.com/paper/PMC12579547/full.md

---
Source: https://tomesphere.com/paper/PMC12579547