Conceptually Diverse Base Model Selection for Meta-Learners in Concept Drifting Data Streams
Helen McKay, Nathan Griffiths, Phillip Taylor

TL;DR
This paper introduces a novel method for selecting diverse base models in meta-learning for concept drifting data streams by estimating conceptual similarity via Principal Angles, improving efficiency and performance.
Contribution
It proposes two new approaches using conceptual similarity for base model selection, reducing computational overhead while maintaining predictive accuracy in online transfer learning.
Findings
Conceptual similarity thresholding reduces computational cost.
Both proposed methods achieve comparable predictive performance.
Conceptual clustering performs well without parameter tuning.
Abstract
Meta-learners and ensembles aim to combine a set of relevant yet diverse base models to improve predictive performance. However, determining an appropriate set of base models is challenging, especially in online environments where the underlying distribution of data can change over time. In this paper, we present a novel approach for estimating the conceptual similarity of base models, which is calculated using the Principal Angles (PAs) between their underlying subspaces. We propose two methods that use conceptual similarity as a metric to obtain a relevant yet diverse subset of base models: (i) parameterised threshold culling and (ii) parameterless conceptual clustering. We evaluate these methods against thresholding using common ensemble pruning metrics, namely predictive performance and Mutual Information (MI), in the context of online Transfer Learning (TL), using both synthetic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Machine Learning and Data Classification · Advanced Bandit Algorithms Research
MethodsPruning · Balanced Selection
