Machine Learning for Electrode Materials: Property Prediction via Composition
Hao Wu, Cameron Hargreaves, Arpit Mishra, Gian-Marco Rignanese

TL;DR
This paper benchmarks three machine learning models—MODNet, CrabNet, and a random forest—for predicting electrode material properties, demonstrating CrabNet's superior performance and discussing practical challenges in materials discovery workflows.
Contribution
It provides a comprehensive comparison of ML frameworks for electrode property prediction, validating results with statistical methods and analyzing feature embeddings.
Findings
CrabNet outperforms other models across all tests.
Unsupervised clustering reveals coherent material groupings.
ML models are effective for early-stage compositional screening.
Abstract
In this work, we benchmark three leading Machine Learning (ML) frameworks-MODNet, CrabNet, and a random forest model based on Magpie feature-for predicting properties of battery electrode materials using the Materials Project Battery Explorer dataset. We evaluate these models based on predictive accuracy, visualize numerical features using two-dimensional embeddings, and quantify performance using standard metrics. Our results demonstrate that CrabNet consistently outperforms the other models across all tests. To validate these findings, we employ robust statistical methods: bootstrap resampling and two cross-validation (CV) strategies (leave one cluster out and stratified 5-fold CV), comparing each model against a control baseline. In addition, we apply unsupervised clustering on MODNet-derived features using t-SNE and DBSCAN, revealing coherent material groupings without prior labels.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
