ZEBRA-Prop: A Zero-Shot Embedding-Based Rapid and Accessible Regression Model for Materials Properties

Ryoma Yamamoto; Akira Takahashi; Kei Terayama; Yu Kumagai; Fumiyasu Oba

arXiv:2603.26060·cond-mat.mtrl-sci·March 30, 2026

ZEBRA-Prop: A Zero-Shot Embedding-Based Rapid and Accessible Regression Model for Materials Properties

Ryoma Yamamoto, Akira Takahashi, Kei Terayama, Yu Kumagai, Fumiyasu Oba

PDF

TL;DR

ZEBRA-Prop is a zero-shot, embedding-based regression model for materials properties that reduces training time by 95% and eliminates fine-tuning, enabling rapid and accessible materials property prediction.

Contribution

It introduces ZEBRA-Prop, a novel zero-shot framework that leverages specialized LLMs and multiple textual embeddings for efficient materials property prediction without fine-tuning.

Findings

01

Performance close to LLM-Prop on two datasets

02

Training time reduced by approximately 95%

03

Effective integration of diverse textual representations

Abstract

Large language models (LLMs) exhibit substantial potential across diverse scientific disciplines, including materials science. A property prediction framework, ZEBRA-Prop (Zero-Shot Embedding-Based Rapid and Accessible Regression Model for Materials Properties), is presented here as an extension of LLM-Prop. In contrast to LLM-Prop, which requires task-specific fine-tuning of the LLM, ZEBRA-Prop eliminates fine-tuning, thereby reducing computational cost and enabling rapid model training. The framework employs MatTPUSciBERT, an LLM specialized for materials science, to enhance predictive capability. Multiple textual embeddings are incorporated through a learnable weighting mechanism, which alleviates the context-length constraints inherent in LLM-Prop and facilitates effective integration of diverse textual representations. Evaluation is conducted using two datasets: the TextEdge…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.