# Unsupervised Classification of Local Clathrate Hydrate Structures

**Authors:** Xinrui Cai, Alberto Striolo, Matteo Salvalaglio

PMC · DOI: 10.1021/acs.jpcc.5c07202 · The Journal of Physical Chemistry. C, Nanomaterials and Interfaces · 2026-02-16

## TL;DR

This paper introduces a new algorithm to classify and analyze the molecular structures of clathrate hydrates, improving understanding of their properties and behavior.

## Contribution

A novel cavity-finder algorithm integrated with DBSCAN enables accurate unsupervised classification of local clathrate hydrate structures.

## Key findings

- The new algorithm successfully identifies coexisting water states in clathrate hydrates.
- It accurately detects hydrate cavities by analyzing molecular rings around voids.
- The method works robustly for structure I CO2 and structure II mixed CH4/Dioxane hydrates.

## Abstract

Quantifying and differentiating
the structural characteristics
of clathrate hydrates at the molecular level is crucial for understanding
the properties that underpin hydrate-based technologies. While useful,
current approaches lack sufficient resolution to discern, e.g., interfacial
and dynamical structures. In this study, we present an algorithm based
on Density-Based Spatial Clustering of Applications with Noise (DBSCAN)
that accurately identifies different water states coexisting within
clathrate hydrates. A key novel component is an effective cavity-finder
algorithm, which provides input to the clustering framework. The new
algorithm detects hydrate cavities by analyzing the number and type
of constituent molecular rings around voids. Integrating the new algorithm
with widely used order parameters (e.g., F3, F4, and F4t) provides
a powerful and accurate tool for analyzing hydrate structures at interfaces
and phase transitions. The performance of the new algorithm is assessed
for structure I (sI) CO2 hydrates and for structure II
(sII) mixed CH4/Dioxane hydrates, demonstrating its robustness
and adaptability across different clathrates. Crucially, the proposed
algorithm enables us to identify partially ordered structures characteristic
of the quasi-liquid layer, thereby capturing interfacial dynamics
and molecular-scale details essential for understanding hydrates under
realistic conditions.

## Linked entities

- **Chemicals:** CO2 (PubChem CID 280), CH4 (PubChem CID 297), Dioxane (PubChem CID 31275)

## Full-text entities

- **Diseases:** sI (MESH:D020914), ice (MESH:C535741)
- **Chemicals:** CO2 (MESH:D002245), THF (MESH:C018674), H (MESH:D006859), ice (MESH:D007053), 1,3-Dioxane (-), Dioxane (MESH:C025223), oil (MESH:D009821), Water (MESH:D014867), O (MESH:D010100), CH4 (MESH:D008697), Carbon (MESH:D002244)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12951555/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12951555/full.md

## References

57 references — full list in the complete paper: https://tomesphere.com/paper/PMC12951555/full.md

---
Source: https://tomesphere.com/paper/PMC12951555