Towards a Prediction of Machine Learning Training Time to Support Continuous Learning Systems Development
Francesca Marzi, Giordano d'Aloisio, Antinisca Di Marco, and Giovanni, Stilo

TL;DR
This paper investigates the prediction of machine learning training times to enhance automated model selection, analyzing the strengths and limitations of the FPTC approach for logistic regression and random forest models.
Contribution
It provides an empirical evaluation of the FPTC approach, highlighting its context-dependent nature and lack of generalizability for training time prediction.
Findings
FPTC formalizes training time as a function of dataset and model parameters.
Prediction accuracy is highly dependent on dataset context.
FPTC approach is not universally applicable across different datasets.
Abstract
The problem of predicting the training time of machine learning (ML) models has become extremely relevant in the scientific community. Being able to predict a priori the training time of an ML model would enable the automatic selection of the best model both in terms of energy efficiency and in terms of performance in the context of, for instance, MLOps architectures. In this paper, we present the work we are conducting towards this direction. In particular, we present an extensive empirical study of the Full Parameter Time Complexity (FPTC) approach by Zheng et al., which is, to the best of our knowledge, the only approach formalizing the training time of ML models as a function of both dataset's and model's parameters. We study the formulations proposed for the Logistic Regression and Random Forest classifiers, and we highlight the main strengths and weaknesses of the approach.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Neural Networks and Applications
MethodsLogistic Regression
