# Pairwise Neural Networks for Ranking Molecular Structures Based on Properties

**Authors:** Renato Frazzato Viana, Juarez L. F. Da Silva, Luis G. Dias, Ronaldo C. Prati

PMC · DOI: 10.1021/acsomega.6c00717 · ACS Omega · 2026-02-12

## TL;DR

This paper introduces a deep learning model that ranks molecular structures based on their properties, improving the efficiency of molecular discovery.

## Contribution

The novel contribution is a Siamese neural network using pairwise learning to rank molecular structures more effectively than traditional regression methods.

## Key findings

- The pairwise ranking model outperforms pointwise regression in predicting absolute energetic properties.
- Traditional regression remains better for derived or non-energy properties like HOMO–LUMO gap and dipole moment.
- The ranking model's performance is robust across different molecular representation models and parameter sizes.

## Abstract

The rapid discovery and design of new molecules drive
innovation
in science and technology, advancing energy storage, catalysis, and
drug development. Traditionally, exploring chemical space involves
costly quantum-chemical calculations or slow experimental screening,
which limits the speed of identifying promising candidates. Machine
learning has emerged as a groundbreaking approach to accelerate molecular
discovery by predicting key properties directly from molecular structures.
Moreover, in many cases, if we can rank molecular structures, it is
not necessary to know the exact value of a molecular property. In
other words, a ranker model can be useful for molecular screening.
In this work, we develop a deep learning model to rank molecular structures
using a siamese network approach and pairwise learning to learn the
ranking. According to different properties of the QM7x and QO2Mol
data sets, the results show that the performance of the learn-to-rank
Siamese architecture outperforms standard pointwise regression for
predicting absolute energetic properties, such as total and orbital
energies, while traditional pointwise regression remains effective
for derived (e.g., HOMO–LUMO gap) or nonenergy properties (e.g.,
dipole moment). To further validate the robustness of the proposed
framework, we extended our evaluation to include the Uni-Mol molecular
representation model. Experiments with Uni-Mol V1
and V2 across various model sizes (84 M to 1.1 B
parameters) confirm that the pairwise learning-to-rank objective consistently
outperforms standard pointwise regression, even when using highly
expressive pretrained Transformer backbones.

## Full-text entities

- **Genes:** TTC41P (tetratricopeptide repeat domain 41, pseudogene) [NCBI Gene 253724] {aka GNN, GNNP}
- **Diseases:** GNNs (MESH:D015441)
- **Chemicals:** H (MESH:D006859), Cl (MESH:D002713), S (MESH:D013455), Deltahomo (-), F (MESH:D005461), N (MESH:D009584), C (MESH:D002244), Br (MESH:D001966), O (MESH:D010100), P (MESH:D010758)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12947046/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12947046/full.md

## References

41 references — full list in the complete paper: https://tomesphere.com/paper/PMC12947046/full.md

---
Source: https://tomesphere.com/paper/PMC12947046