# Master of Metals2: a graph neural network based architecture for the prediction of zinc binding sites in protein structures

**Authors:** Vincenzo Laveglia, Cosimo Ciofalo, Enrico Morelli, Claudia Andreini, Antonio Rosato

PMC · DOI: 10.1093/bib/bbag078 · Briefings in Bioinformatics · 2026-03-02

## TL;DR

This paper introduces MoM2, a new tool that uses graph neural networks to accurately predict where zinc ions bind in protein structures.

## Contribution

The novelty lies in using a graph neural network trained solely on spatial coordinates to predict zinc-binding sites without relying on sequence-based methods.

## Key findings

- MoM2 outperformed existing methods with an F1-score of 55.7% and a false discovery rate of 44.1%.
- It successfully identified 18 out of 20 predicted zinc sites in SARS-CoV-2 proteins.
- The tool processes entire proteomes within hours and is accessible via a web interface.

## Abstract

Zinc ions play essential structural and catalytic roles in a wide range of proteins. Accurate prediction of their binding sites is crucial for structural and functional annotation. We present MoM2, a web-accessible tool for predicting zinc-binding sites in protein 3D structures. MoM2 employs a graph neural network trained exclusively on spatial features specifically, Cα and Cβ coordinates eliminating the need for templates or sequence-based heuristics. The tool efficiently processes entire proteomes within hours and demonstrates strong predictive performance. In a benchmark of 412 experimentally determined apo-structures, MoM2 outperformed existing methods, achieving the highest F1-score (55.7%) and the lowest false discovery rate (44.1%). The web interface supports input via structure files, PDB or UniProt IDs, and allows batch processing with customizable thresholds. As an independent validation, MoM2 correctly identified 18 out of 20 predicted zinc sites in SARS-CoV-2 proteins. The tool is freely available at https://mom2.cerm.unifi.it.

## Linked entities

- **Chemicals:** zinc (PubChem CID 23994)
- **Diseases:** SARS-CoV-2 (MONDO:0100096)

## Full-text entities

- **Genes:** SLC4A11 (solute carrier family 4 member 11) [NCBI Gene 83959] {aka BTR1, CDPD1, CHED, CHED2, NABC1, dJ794I6.2}, SH2D3A (SH2 domain containing 3A) [NCBI Gene 10045] {aka NSP1}, ORF3a (ORF3a protein) [NCBI Gene 43740569], ATP5F1A (ATP synthase F1 subunit alpha) [NCBI Gene 498] {aka ATP5A, ATP5A1, ATP5AL2, ATPM, COXPD22, HEL-S-123m}, MBS1 (Moebius syndrome 1) [NCBI Gene 4156] {aka MBS}
- **Diseases:** MBSs (MESH:D009371), IDs (MESH:C535742), MPNN (MESH:D015441)
- **Chemicals:** Gly (MESH:D005998), Zinc (MESH:D015032), metal (MESH:D008670), His (MESH:D006639), Cys (MESH:D003545), MoM (MESH:D015644), hydrogen (MESH:D006859), Calpha (-), amino acids (MESH:D000596)
- **Species:** Severe acute respiratory syndrome coronavirus 2 (no rank) [taxon 2697049], Escherichia coli (E. coli, species) [taxon 562], Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12951075/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12951075/full.md

## References

40 references — full list in the complete paper: https://tomesphere.com/paper/PMC12951075/full.md

---
Source: https://tomesphere.com/paper/PMC12951075