EC3: Combining Clustering and Classification for Ensemble Learning
Tanmoy Chakraborty

TL;DR
This paper introduces EC3, a novel ensemble learning algorithm that combines classification and clustering to improve prediction accuracy, especially in scenarios with limited labeled data and imbalanced classes.
Contribution
The paper presents EC3, a new convex optimization-based method that merges multiple classifiers and clusterers, along with iEC3 for imbalanced data, outperforming existing methods.
Findings
EC3 outperforms 14 baseline methods on 13 datasets with up to 10% higher AUC.
EC3 is 1.21 times faster than the best baseline.
EC3 is more resilient to noise and class imbalance.
Abstract
Classification and clustering algorithms have been proved to be successful individually in different contexts. Both of them have their own advantages and limitations. For instance, although classification algorithms are more powerful than clustering methods in predicting class labels of objects, they do not perform well when there is a lack of sufficient manually labeled reliable data. On the other hand, although clustering algorithms do not produce label information for objects, they provide supplementary constraints (e.g., if two objects are clustered together, it is more likely that the same label is assigned to both of them) that one can leverage for label prediction of a set of unknown objects. Therefore, systematic utilization of both these types of algorithms together can lead to better prediction performance. In this paper, We propose a novel algorithm, called EC3 that merges…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
