RF+clust for Leave-One-Problem-Out Performance Prediction
Ana Nikolikj, Carola Doerr, Tome Eftimov

TL;DR
This paper introduces RF+clust, a method that enhances leave-one-problem-out performance prediction by calibrating random forest predictions with similarity-based performance data, addressing generalization issues in AutoML.
Contribution
The paper proposes a novel RF+clust approach that improves performance prediction accuracy in LOPO settings by combining random forest outputs with similarity-based calibration.
Findings
RF+clust improves prediction accuracy on several problems.
Predictive power depends on similarity threshold and feature selection.
Highlights importance of feature choice in zero-shot learning scenarios.
Abstract
Per-instance automated algorithm configuration and selection are gaining significant moments in evolutionary computation in recent years. Two crucial, sometimes implicit, ingredients for these automated machine learning (AutoML) methods are 1) feature-based representations of the problem instances and 2) performance prediction methods that take the features as input to estimate how well a specific algorithm instance will perform on a given problem instance. Non-surprisingly, common machine learning models fail to make predictions for instances whose feature-based representation is underrepresented or not covered in the training data, resulting in poor generalization ability of the models for problems not seen during training.In this work, we study leave-one-problem-out (LOPO) performance prediction. We analyze whether standard random forest (RF) model predictions can be improved by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvolutionary Algorithms and Applications · Metaheuristic Optimization Algorithms Research · Machine Learning and Data Classification
Methodsfail · Feature Selection
