A data driven approach to classify descriptors based on their efficiency in translating noisy trajectories into physically-relevant information
Simone Martino, Domiziano Doria, Chiara Lionello, Matteo Becchi, Giovanni M. Pavan

TL;DR
This paper introduces a data-driven method to evaluate and rank descriptors based on their ability to extract physically relevant information from noisy molecular dynamics trajectories, emphasizing the importance of noise handling.
Contribution
It presents a novel approach to compare descriptor efficiency in noisy environments, applying it to molecular systems and highlighting the impact of denoising on descriptor performance.
Findings
Advanced descriptors like SOAP and LENS outperform classical ones.
Simple descriptors can rival advanced ones after local denoising.
Denoising can transform weak descriptors into effective tools.
Abstract
Reconstructing the physical complexity of many-body dynamical systems can be challenging. Starting from the trajectories of their constitutive units (raw data), typical approaches require selecting appropriate descriptors to convert them into time-series, which are then analyzed to extract interpretable information. However, identifying the most effective descriptor is often non-trivial. Here, we report a data-driven approach to compare the efficiency of various descriptors in extracting information from noisy trajectories and translating it into physically relevant insights. As a prototypical system with non-trivial internal complexity, we analyze molecular dynamics trajectories of an atomistic system where ice and water coexist in equilibrium near the solid/liquid transition temperature. We compare general and specific descriptors often used in aqueous systems: number of neighbors,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Time Series Analysis and Forecasting · Image Retrieval and Classification Techniques
