Composition and Structure Based GGA Bandgap Prediction Using Machine Learning Approach
Mukesh K. Choudhary, Amal Raj V, Gowri Sankar S, P. Ravindran

TL;DR
This paper develops machine learning models to accurately predict GGA bandgaps from composition and structure, validated by DFT calculations, with ensemble methods outperforming standalone models.
Contribution
It introduces ensemble machine learning approaches for bandgap prediction, achieving high accuracy and validating predictions with DFT calculations for new compounds.
Findings
RF model achieved R2 of 0.943 and RMSE of 0.504 eV.
Ensemble bagging models achieved R2 of 0.948 and RMSE of 0.479 eV.
Predicted new compounds are narrow bandgap semiconductors validated by DFT.
Abstract
This study focuses on developing precise machine learning (ML) regression models for predicting energy bandgap values based on chemical compositions and crystal structures. The primary aim is to match the accuracy of predictions derived from GGA-PBE calculations and validate them through density functional theory (DFT)-based band structure calculations. We assessed eight standalone ML regression models, including AdaBoost, Bagging, CatBoost, LGBM, RF, DT, GB, and XGB. These models were analyzed for their ability to predict GGA-PBE bandgap values across diverse material structures and compositions, using a dataset containing bandgap values for 106,113 compounds. Additionally, we constructed four ensemble models using the stacking method and seven using the bagging method. These ensemble models incorporated RidgeCV and LassoCV to explore if ensemble techniques could enhance prediction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Heusler alloys: electronic and magnetic properties · Surface and Thin Film Phenomena
