ROIDS: Robust Outlier-Aware Informed Down-Sampling
Alina Geiger, Martin Briesch, Dominik Sobania, Franz Rothlauf

TL;DR
ROIDS introduces an outlier-aware down-sampling method that improves symbolic regression performance by excluding outliers, preventing overfitting, and outperforming existing approaches on synthetic and real-world benchmarks.
Contribution
ROIDS is a novel outlier-aware down-sampling technique that enhances symbolic regression by mitigating outlier effects and maintaining IDS advantages.
Findings
ROIDS outperforms IDS on synthetic problems with outliers.
ROIDS surpasses IDS on over 80% of real-world benchmark problems.
ROIDS achieves the best average rank among baseline methods.
Abstract
Informed down-sampling (IDS) is known to improve performance in symbolic regression when combined with various selection strategies, especially tournament selection. However, recent work found that IDS's gains are not consistent across all problems. Our analysis reveals that IDS performance is worse for problems containing outliers. IDS systematically favors including outliers in subsets which pushes GP towards finding solutions that overfit to outliers. To address this, we introduce ROIDS (Robust Outlier-Aware Informed Down-Sampling), which excludes potential outliers from the sampling process of IDS. With ROIDS it is possible to keep the advantages of IDS without overfitting to outliers and to compete on a wide range of benchmark problems. This is also reflected in our experiments in which ROIDS shows the desired behavior on all studied benchmark problems. ROIDS consistently…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Anomaly Detection Techniques and Applications · Imbalanced Data Classification Techniques
