DataScribe: An AI-Native, Policy-Aligned Web Platform for Multi-Objective Materials Design and Discovery
Divyanshu Singh, Doguhan Sar{\i}t\"urk, Cameron Lea, Md Shafiqul Islam, Raymundo Arroyave, Vahid Attari

TL;DR
DataScribe is a comprehensive, AI-native platform that integrates data management, machine learning, and optimization to accelerate materials discovery through closed-loop workflows across experimental and computational pipelines.
Contribution
It introduces a unified, policy-aligned platform with ontology-backed data ingestion, uncertainty-aware modeling, and multi-objective Bayesian optimization for materials research.
Findings
Successful case studies in electrochemical materials and high-entropy alloys.
Demonstrated real-time optimization and multi-objective trade space exploration.
Enabled reproducible, end-to-end data fusion and decision-making.
Abstract
The acceleration of materials discovery requires digital platforms that go beyond data repositories to embed learning, optimization, and decision-making directly into research workflows. We introduce DataScribe, an AI-native, cloud-based materials discovery platform that unifies heterogeneous experimental and computational data through ontology-backed ingestion and machine-actionable knowledge graphs. The platform integrates FAIR-compliant metadata capture, schema and unit harmonization, uncertainty-aware surrogate modeling, and native multi-objective multi-fidelity Bayesian optimization, enabling closed-loop propose-measure-learn workflows across experimental and computational pipelines. DataScribe functions as an application-layer intelligence stack, coupling data governance, optimization, and explainability rather than treating them as downstream add-ons. We validate the platform…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Catalysis and Oxidation Reactions · Scientific Computing and Data Management
