clusttraj: A Solvent-Informed Clustering Tool for Molecular Modeling
Rafael Bicudo Ribeiro, Henrique Musseli Cezar

TL;DR
clusttraj is a new clustering tool that improves molecular modeling by optimizing RMSD calculations in solute-solvent systems.
Contribution
clusttraj introduces a solvent-informed clustering method that optimizes RMSD by pairing molecular configurations effectively.
Findings
clusttraj reduces inflated RMSD values by finding optimal pairings between configurations.
The tool is effective for solute-solvent systems, including water clusters and solvated proteins.
Evaluation metrics help determine ideal clustering thresholds and linkage schemes automatically.
Abstract
Clustering techniques are consolidated as a powerful strategy for analyzing the extensive data generated from molecular modeling. In particular, some tools have been developed to cluster configurations from classical simulations with a standard focus on individual units, ranging from small molecules to complex proteins. Since the standard approach includes computing the root mean square deviation (RMSD) of atomic positions, accounting for the permutation between atoms is crucial for optimizing the clustering procedure in the presence of identical molecules. To address this issue, we present the clusttraj program, a solvent-informed clustering package that fixes inflated RMSD values by finding the optimal pairing between configurations. The program combines reordering schemes with the Kabsch algorithm to minimize the RMSD of molecular configurations before running a hierarchical…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11
Figure 12
Figure 13
Figure 14Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Machine Learning in Materials Science · Molecular spectroscopy and chirality
