Evaluating Large Language Models for Material Selection
Daniele Grandi, Yash Patawari Jain, Allin Groom, Brandon Cramer,, Christopher McComb

TL;DR
This paper evaluates the effectiveness of Large Language Models in material selection for product design, comparing their recommendations to expert choices and analyzing factors affecting their performance.
Contribution
It introduces a systematic evaluation of LLMs for material selection, highlighting their limitations and potential through prompt engineering and hyperparameter tuning.
Findings
LLMs often diverge from expert recommendations.
Parallel prompting improves LLM performance.
Significant variability in LLM suggestions depending on configurations.
Abstract
Material selection is a crucial step in conceptual design due to its significant impact on the functionality, aesthetics, manufacturability, and sustainability impact of the final product. This study investigates the use of Large Language Models (LLMs) for material selection in the product design process and compares the performance of LLMs against expert choices for various design scenarios. By collecting a dataset of expert material preferences, the study provides a basis for evaluating how well LLMs can align with expert recommendations through prompt engineering and hyperparameter tuning. The divergence between LLM and expert recommendations is measured across different model configurations, prompt strategies, and temperature settings. This approach allows for a detailed analysis of factors influencing the LLMs' effectiveness in recommending materials. The results from this study…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Manufacturing Process and Optimization
