# PlasticEnz: An integrated database and screening tool combining homology and machine learning to identify plastic-degrading enzymes in meta-omics datasets

**Authors:** Anna Krzynowek, Jasper Snoeks, Karoline Faust, Eduardo Jardón-Valadez, Eduardo Jardón-Valadez, Eduardo Jardón-Valadez

PMC · DOI: 10.1371/journal.pcbi.1013892 · 2026-01-26

## TL;DR

PlasticEnz is a tool that helps find enzymes capable of breaking down plastic in environmental samples using a mix of sequence similarity and machine learning.

## Contribution

PlasticEnz introduces a novel integration of homology-based search and machine learning for identifying plastic-degrading enzymes in metagenomic data.

## Key findings

- PlasticEnz successfully identified known PETases and PHBases in both lab and real-world samples.
- The tool distinguished between plastic-contaminated and pristine environments effectively.
- PlasticEnz achieved an F1 score > 0.7 on an independent test set for PET and PHB classification.

## Abstract

PlasticEnz is a new open-source tool for detecting plastic-degrading enzymes (plastizymes) in metagenomic data by combining sequence homology-based search with machine learning techniques. It integrates custom Hidden Markov Models, DIAMOND alignments, and polymer-specific classifiers trained on ProtBERT embeddings to identify candidate depolymerases from user-provided contigs, genomes, or protein sequences. PlasticEnz supports 11 plastic polymers with ML classifiers for PET and PHB, achieving F1 > 0.7 on an independent test set. Applied to plastic-exposed microcosms and field metagenomes, the tool recovered known PETases and PHBases, distinguished plastic-contaminated from pristine environments, and clustered predictions with validated reference enzymes. PlasticEnz is fast, scalable, and user-friendly, providing a robust framework for exploring microbial plastic degradation potential in complex communities.

Plastic pollution is a global problem, and one promising solution is to apply microbes to break them down. However, finding the enzymes responsible for this in complex environmental samples is not easy. We developed PlasticEnz, a free and easy-to-use tool that helps researchers identify plastic-degrading enzymes or “plastizymes” in metagenomic data. PlasticEnz combines traditional sequence similarity search methods with machine learning models trained on previously known plastizymes. It works with protein sequences, contigs, or genomes with ML components optimised for classification of two common plastizymes: PETases and PHBases. We tested PlasticEnz on both controlled lab experiments and real-world samples from plastic-polluted soils and clean environments. The tool successfully identified known plastic-degrading enzymes and even helped distinguish between polluted and pristine sites. By making plastizyme detection more accessible, PlasticEnz enables researchers to better explore the microbial potential for plastic degradation, which could support future bioremediation efforts.

## Full-text entities

- **Genes:** F2 (coagulation factor II, thrombin) [NCBI Gene 2147] {aka PT, RPRGL2, THPH1}
- **Diseases:** cancer (MESH:D009369)
- **Chemicals:** PHB (MESH:C000720856), amino acid (MESH:D000596), PES (MESH:C022840), P3HP (-), PCL (MESH:C016240), PET (MESH:D011093), PLA (MESH:C033616), carbon (MESH:D002244), plastic (MESH:D010969), ester (MESH:D004952), polymer (MESH:D011108), poly(3-Hydroxypropionate (MESH:C000600150), PHBV (MESH:C052620), PEA (MESH:C539080), PBSA (MESH:C574545), PBAT (MESH:C488797), PBS (MESH:C089797), polyester (MESH:D011091), H2O (MESH:D014867)
- **Species:** Bacillus subtilis (species) [taxon 1423], Burkholderia cepacia (species) [taxon 292], Fusarium solani (species) [taxon 169388], Staphylococcus aureus (species) [taxon 1280], Fusarium oxysporum (species) [taxon 5507], Marinactinospora thermotolerans (species) [taxon 531310], Chlamydia trachomatis (species) [taxon 813], Ralstonia pickettii (species) [taxon 329], Escherichia coli (E. coli, species) [taxon 562], Cupriavidus necator (species) [taxon 106590], Humicola insolens [taxon 34413], Paracoccus denitrificans (species) [taxon 266], Micromonosporaceae (family) [taxon 28056], Halopseudomonas aestusnigri (species) [taxon 857252], Paucimonas lemoignei (species) [taxon 29443], Alcaligenes faecalis (species) [taxon 511], Homo sapiens (human, species) [taxon 9606], Saccharopolyspora flava (species) [taxon 95161]

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12872022/full.md

---
Source: https://tomesphere.com/paper/PMC12872022