RankML: a Meta Learning-Based Approach for Pre-Ranking Machine Learning Pipelines
Doron Laadan, Roman Vainshtein, Yarden Curiel, Gilad Katz, Lior Rokach

TL;DR
RankML is a meta-learning approach that efficiently predicts and ranks machine learning pipelines' performance on new datasets, reducing computational costs compared to existing methods.
Contribution
This paper introduces RankML, a novel meta-learning method for pre-ranking ML pipelines, significantly saving time and resources while maintaining accuracy.
Findings
Outperforms or matches state-of-the-art methods
Reduces computational time and resources
Effective on diverse datasets in classification and regression
Abstract
The explosion of digital data has created multiple opportunities for organizations and individuals to leverage machine learning (ML) to transform the way they operate. However, the shortage of experts in the field of machine learning -- data scientists -- is often a setback to the use of ML. In an attempt to alleviate this shortage, multiple approaches for the automation of machine learning have been proposed in recent years. While these approaches are effective, they often require a great deal of time and computing resources. In this study, we propose RankML, a meta-learning based approach for predicting the performance of whole machine learning pipelines. Given a previously-unseen dataset, a performance metric, and a set of candidate pipelines, RankML immediately produces a ranked list of all pipelines based on their predicted performance. Extensive evaluation on 244 datasets, both in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Data Stream Mining Techniques · Machine Learning and Algorithms
