Graph Neural Networks for Surfactant Multi-Property Prediction
Christoforos Brozos, Jan G. Rittig, Sandip Bhattacharya, Elie Akanny,, Christina Kohlmann, Alexander Mitsos

TL;DR
This paper develops and evaluates graph neural network models for predicting key surfactant properties, CMC and surface excess concentration, using the largest available datasets and advanced learning strategies, demonstrating high accuracy and industrial relevance.
Contribution
It introduces the largest CMC and surface excess concentration datasets for surfactants and develops multi-task GNN models with ensemble learning that outperform previous methods.
Findings
Multi-task GNN with ensemble learning performs best.
Models accurately predict CMC for industrial surfactants.
Large datasets improve GNN predictive performance.
Abstract
Surfactants are of high importance in different industrial sectors such as cosmetics, detergents, oil recovery and drug delivery systems. Therefore, many quantitative structure-property relationship (QSPR) models have been developed for surfactants. Each predictive model typically focuses on one surfactant class, mostly nonionics. Graph Neural Networks (GNNs) have exhibited a great predictive performance for property prediction of ionic liquids, polymers and drugs in general. Specifically for surfactants, GNNs can successfully predict critical micelle concentration (CMC), a key surfactant property associated with micellization. A key factor in the predictive ability of QSPR and GNN models is the data available for training. Based on extensive literature search, we create the largest available CMC database with 429 molecules and the first large data collection for surface excess…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSurfactants and Colloidal Systems · Machine Learning in Materials Science · Computational Drug Discovery Methods
