Benchmarking Large Language Models for Polymer Property Predictions

Sonakshi Gupta; Akhlak Mahmood; Shivank Shukla; and Rampi Ramprasad

arXiv:2506.02129·cs.CE·February 20, 2026

Benchmarking Large Language Models for Polymer Property Predictions

Sonakshi Gupta, Akhlak Mahmood, Shivank Shukla, and Rampi Ramprasad

PDF

Open Access

TL;DR

This study benchmarks large language models like LLaMA-3 and GPT-3.5 for predicting polymer thermal properties, comparing their performance to traditional methods and analyzing their strengths and limitations in polymer informatics.

Contribution

It introduces a systematic benchmarking of open-source and commercial LLMs for polymer property prediction, highlighting their current capabilities and limitations compared to traditional fingerprinting approaches.

Findings

01

LLMs approach but do not surpass traditional models in accuracy

02

LLaMA-3 outperforms GPT-3.5 due to open-source flexibility

03

Single-task learning is more effective than multi-task for LLMs in this context

Abstract

Machine learning has revolutionized polymer science by enabling rapid property prediction and generative design. Large language models (LLMs) offer further opportunities in polymer informatics by simplifying workflows that traditionally rely on large labeled datasets, handcrafted representations, and complex feature engineering. LLMs leverage natural language inputs through transfer learning, eliminating the need for explicit fingerprinting and streamlining training. In this study, we finetune general purpose LLMs -- open-source LLaMA-3-8B and commercial GPT-3.5 -- on a curated dataset of 11,740 entries to predict key thermal properties: glass transition, melting, and decomposition temperatures. Using parameter-efficient fine-tuning and hyperparameter optimization, we benchmark these models against traditional fingerprinting-based approaches -- Polymer Genome, polyGNN, and polyBERT --…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Materials Science