Predicting Gas-Particle Partitioning Coefficients of Atmospheric Molecules with Machine Learning
Emma Lumiaro, Milica Todorovi\'c, Theo Kurten, Hanna Vehkam\"aki and, Patrick Rinke

TL;DR
This paper develops a machine learning model to accurately predict gas-particle partitioning coefficients of atmospheric organic molecules, facilitating better understanding of secondary organic aerosol formation.
Contribution
The study introduces a kernel ridge regression model trained on COSMOtherm data to predict partitioning coefficients from molecular structure, achieving accuracy comparable to experimental data.
Findings
ML model predicts $P_{sat}$ and $K_{WIOM/G}$ within 0.3 log units
Predictions are as accurate as COSMOtherm calculations
Model performs well on polyfunctional, oxidized molecules
Abstract
The formation, properties and lifetime of secondary organic aerosols in the atmosphere are largely determined by gas-particle partitioning coefficients of the participating organic vapours. Since these coefficients are often difficult to measure or compute, we developed a machine learning (ML) model to predict them given molecular structure as input. Our data-driven approach is based on the dataset by Wang et al. (Atmos. Chem. Phys., 17, 7529 (2017)), who computed the partitioning coefficients and saturation vapour pressures of 3414 atmospheric oxidation products from the master chemical mechanism using the COSMOtherm program. We train a kernel ridge regression (KRR) ML model on the saturation vapour pressure (), and on two equilibrium partitioning coefficients: between a water-insoluble organic matter phase and the gas phase (), and between an infinitely dilute…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
