Aggregation as Unsupervised Learning and its Evaluation
Maria Ulan, Welf L\"owe, Morgan Ericsson, Anna Wingkvist

TL;DR
This paper introduces a novel unsupervised aggregation method that leverages intrinsic properties of unlabeled data, demonstrating superior performance over existing approaches and comparable results to linear regression.
Contribution
The paper presents a new unsupervised aggregation approach based on data properties, along with an empirical evaluation framework comparing it to other methods.
Findings
Our approach outperforms other data-agnostic and unsupervised aggregation methods.
It is nearly as effective as linear regression in predicting latent ground truth.
The evaluation framework assesses both property preservation and ground truth prediction.
Abstract
Regression uses supervised machine learning to find a model that combines several independent variables to predict a dependent variable based on ground truth (labeled) data, i.e., tuples of independent and dependent variables (labels). Similarly, aggregation also combines several independent variables to a dependent variable. The dependent variable should preserve properties of the independent variables, e.g., the ranking or relative distance of the independent variable tuples, and/or represent a latent ground truth that is a function of these independent variables. However, ground truth data is not available for finding the aggregation model. Consequently, aggregation models are data agnostic or can only be derived with unsupervised machine learning approaches. We introduce a novel unsupervised aggregation approach based on intrinsic properties of unlabeled training data, such as the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification
