MATRIX: A Multimodal Benchmark and Post-Training Framework for Materials Science
Delia McGrath, Curtis Chong, Rohil Kulkarni, Gerbrand Ceder, Adeesh Kolluru

TL;DR
MATRIX introduces a multimodal benchmark and post-training framework for materials science that leverages experimental images alongside text to improve scientific reasoning and interpretation of experimental data.
Contribution
The paper presents a new benchmark and demonstrates that incorporating visual data during post-training enhances reasoning accuracy in materials science tasks.
Findings
Visual supervision improves experimental interpretation by 10-25%.
Post-training with multimodal data yields 5-16% gains on text-only reasoning.
Cross-modal alignment is crucial for effective transfer.
Abstract
Scientific reasoning in materials science requires integrating multimodal experimental evidence with underlying physical theory. Existing benchmarks make it difficult to assess whether incorporating visual experimental data during post-training improves mechanism-grounded explanation reasoning beyond text-only supervision. We introduce MATRIX, a multimodal benchmark for materials science reasoning that evaluates foundational theory, research-level reasoning, and the interpretation of real experimental artifacts across multiple characterization modalities. Using MATRIX as a controlled diagnostic, we isolate the effect of visual grounding by comparing post-training on structured materials science text alone with post-training that incorporates paired experimental images. Despite using relatively small amounts of multimodal data, visual supervision improves experimental interpretation by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Machine Learning in Materials Science · Topic Modeling
