# GNN-MA: Soft Molecular Alignment with Cross-Graph Attention for Ligand-Based Virtual Screening

**Authors:** Keling Liu, Dongmei Wei, Rui Shi, Zhiyuan Zhou

PMC · DOI: 10.3390/molecules31060991 · Molecules · 2026-03-16

## TL;DR

This paper introduces GNN-MA, a new model for drug discovery that improves molecule screening using graph-based alignment without needing 3D structures.

## Contribution

GNN-MA introduces cross-graph attention and bond-to-atom aggregation for soft molecular alignment in ligand-based virtual screening.

## Key findings

- GNN-MA achieves competitive ROC-AUC scores on DUD-E and LIT-PCBA datasets.
- It improves early-enrichment metrics (EF@1–5%) on DUD-E compared to ablated variants.
- The model provides interpretable atom-level alignment insights in case studies.

## Abstract

Ligand-based virtual screening (LBVS) seeks strong early enrichment when searching ultra-large libraries, but practical screening often relies on 1D/2D descriptions while 3D information is expensive and uncertain due to conformer generation and alignment. We propose GNN-MA, a retrieval-style pairwise scoring model for query–candidate molecular pairs that uses molecular graphs as a unified representation. Built on intra-graph message passing, GNN-MA adds cross-graph attention to learn atom-level soft alignment that focuses on key substructures relevant to activity matching, and introduces a bond-to-atom semantic aggregation module to better exploit chemical bond cues for similarity scoring. The framework uses 2D molecular graphs derived from SMILES for retrieval-style matching and does not rely on explicit 3D conformational modeling or alignment. Experiments on DUD-E and LIT-PCBA show that GNN-MA achieves competitive overall discrimination (ROC-AUC) and, relative to its ablated variants, provides consistent gains in early-enrichment metrics (EF@1–5%) on DUD-E, while on LIT-PCBA the improvements are more target-dependent. The learned atom-level soft alignment also provides a qualitative interpretability cue in case studies. Throughput benchmarks suggest that GNN-MA is most suitable as a re-ranking/refinement model after a fast prefiltering stage.

## Full-text entities

- **Chemicals:** GNN-MA (-)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13029554/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13029554/full.md

## References

56 references — full list in the complete paper: https://tomesphere.com/paper/PMC13029554/full.md

---
Source: https://tomesphere.com/paper/PMC13029554