The Impact of Feature Scaling In Machine Learning: Effects on Regression and Classification Tasks

Jo\~ao Manoel Herrera Pinheiro; Suzana Vilas Boas de Oliveira; Thiago Henrique Segreto Silva; Pedro Antonio Rabelo Saraiva; Enzo Ferreira de Souza; Ricardo V. Godoy; Leonardo Andr\'e Ambrosio; Marcelo Becker

arXiv:2506.08274·cs.LG·November 24, 2025

The Impact of Feature Scaling In Machine Learning: Effects on Regression and Classification Tasks

Jo\~ao Manoel Herrera Pinheiro, Suzana Vilas Boas de Oliveira, Thiago Henrique Segreto Silva, Pedro Antonio Rabelo Saraiva, Enzo Ferreira de Souza, Ricardo V. Godoy, Leonardo Andr\'e Ambrosio, Marcelo Becker

PDF

TL;DR

This study systematically evaluates 12 feature scaling techniques across 14 machine learning algorithms and 16 datasets, revealing how scaling impacts performance and computational costs, with insights tailored to specific models.

Contribution

It provides a comprehensive empirical analysis of feature scaling effects on diverse models and datasets, offering practical guidance for selecting optimal scaling methods.

Findings

01

Ensemble methods are largely insensitive to feature scaling.

02

Logistic Regression, SVMs, TabNet, and MLPs show performance heavily influenced by scaling.

03

Scaling choice significantly affects model accuracy and computational efficiency.

Abstract

This research addresses the critical lack of comprehensive studies on feature scaling by systematically evaluating 12 scaling techniques - including several less common transformations - across 14 different Machine Learning algorithms and 16 datasets for classification and regression tasks. We meticulously analyzed impacts on predictive performance (using metrics such as accuracy, MAE, MSE, and $R^{2}$ ) and computational costs (training time, inference time, and memory usage). Key findings reveal that while ensemble methods (such as Random Forest and gradient boosting models like XGBoost, CatBoost and LightGBM) demonstrate robust performance largely independent of scaling, other widely used models such as Logistic Regression, SVMs, TabNet, and MLPs show significant performance variations highly dependent on the chosen scaler. This extensive empirical analysis, with all source code,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsGated Linear Unit · Dense Connections · Masked autoencoder · Batch Normalization · TabNet · Logistic Regression