Aggregation as Unsupervised Learning and its Evaluation

Maria Ulan; Welf L\"owe; Morgan Ericsson; Anna Wingkvist

arXiv:2110.15136·cs.LG·October 29, 2021

Aggregation as Unsupervised Learning and its Evaluation

Maria Ulan, Welf L\"owe, Morgan Ericsson, Anna Wingkvist

PDF

Open Access

TL;DR

This paper introduces a novel unsupervised aggregation method that leverages intrinsic properties of unlabeled data, demonstrating superior performance over existing approaches and comparable results to linear regression.

Contribution

The paper presents a new unsupervised aggregation approach based on data properties, along with an empirical evaluation framework comparing it to other methods.

Findings

01

Our approach outperforms other data-agnostic and unsupervised aggregation methods.

02

It is nearly as effective as linear regression in predicting latent ground truth.

03

The evaluation framework assesses both property preservation and ground truth prediction.

Abstract

Regression uses supervised machine learning to find a model that combines several independent variables to predict a dependent variable based on ground truth (labeled) data, i.e., tuples of independent and dependent variables (labels). Similarly, aggregation also combines several independent variables to a dependent variable. The dependent variable should preserve properties of the independent variables, e.g., the ranking or relative distance of the independent variable tuples, and/or represent a latent ground truth that is a function of these independent variables. However, ground truth data is not available for finding the aggregation model. Consequently, aggregation models are data agnostic or can only be derived with unsupervised machine learning approaches. We introduce a novel unsupervised aggregation approach based on intrinsic properties of unlabeled training data, such as the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification