Evaluation of Tree Based Regression over Multiple Linear Regression for Non-normally Distributed Data in Battery Performance
Shovan Chowdhury, Yuxiao Lin, Boryann Liaw, Leslie Kerby

TL;DR
This paper compares tree-based regression and multiple linear regression on non-normal, multicollinear battery data, demonstrating the superior performance of tree-based models like Random Forests in handling such complex datasets.
Contribution
The study provides a comparative analysis showing that tree-based regression models outperform linear models on non-normal, multicollinear battery datasets, highlighting their robustness and effectiveness.
Findings
Tree-based models outperform linear regression on non-normal, multicollinear data.
Random Forests with bagging reduce overfitting and improve accuracy.
Linear regression requires data transformation to achieve comparable performance.
Abstract
Battery performance datasets are typically non-normal and multicollinear. Extrapolating such datasets for model predictions needs attention to such characteristics. This study explores the impact of data normality in building machine learning models. In this work, tree-based regression models and multiple linear regressions models are each built from a highly skewed non-normal dataset with multicollinearity and compared. Several techniques are necessary, such as data transformation, to achieve a good multiple linear regression model with this dataset; the most useful techniques are discussed. With these techniques, the best multiple linear regression model achieved an R^2 = 81.23% and exhibited no multicollinearity effect for the dataset used in this study. Tree-based models perform better on this dataset, as they are non-parametric, capable of handling complex relationships among…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFault Detection and Control Systems · Advanced Battery Technologies Research · Machine Learning and ELM
MethodsLinear Regression
