Intelligent Materials Modelling: Large Language Models Versus Partial Least Squares Regression for Predicting Polysulfone Membrane Mechanical Performance
Dingding Cao, Mieow Kee Chan, Wan Sieng Yeo, Said Bey, Alberto Figoli

TL;DR
This study compares large language models and PLS regression in predicting polysulfone membrane mechanical properties, finding LLMs excel in non-linear, data-scarce scenarios while PLS remains effective for linear properties, suggesting hybrid approaches for materials discovery.
Contribution
It benchmarks LLMs against PLS regression for membrane property prediction, highlighting their complementary strengths and proposing hybrid models for small-data materials science applications.
Findings
LLMs significantly reduce prediction errors for elongation at break.
Run-to-run variability is much lower for LLMs compared to PLS.
Linear properties like Young's modulus and tensile strength are well-predicted by PLS.
Abstract
Predicting the mechanical properties of polysulfone (PSF) membranes from structural descriptors remains challenging due to extreme data scarcity typical of experimental studies. To investigate this issue, this study benchmarked knowledge-driven inference using four large language models (LLMs) (DeepSeek-V3, DeepSeek-R1, ChatGPT-4o, and GPT-5) against partial least squares (PLS) regression for predicting Young's modulus (E), tensile strength (TS), and elongation at break (EL) based on pore diameter (PD), contact angle (CA), thickness (T), and porosity (P) measurements. These knowledge-driven approaches demonstrated property-specific advantages over the chemometric baseline. For EL, LLMs achieved statistically significant improvements, with DeepSeek-R1 and GPT-5 delivering 40.5% and 40.3% of Root Mean Square Error reductions, respectively, reducing mean absolute errors from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Fuel Cells and Related Materials · Advanced Sensor and Energy Harvesting Materials
