Top-Rank-Focused Adaptive Vote Collection for the Evaluation of   Domain-Specific Semantic Models

Pierangelo Lombardo; Alessio Boiardi; Luca Colombo; Angelo Schiavone,; Nicol\`o Tamagnone

arXiv:2010.04486·cs.CL·November 24, 2020

Top-Rank-Focused Adaptive Vote Collection for the Evaluation of Domain-Specific Semantic Models

Pierangelo Lombardo, Alessio Boiardi, Luca Colombo, Angelo Schiavone,, Nicol\`o Tamagnone

PDF

1 Repo

TL;DR

This paper introduces a new method for creating and evaluating domain-specific semantic models focused on accurately ranking top related words or texts, using adaptive comparisons and specialized metrics.

Contribution

It presents a novel protocol for constructing top-rank-focused evaluation datasets, new metrics for assessment, and a stochastic model to validate the dataset's effectiveness.

Findings

01

The dataset construction protocol improves top-rank evaluation accuracy.

02

New ranking metrics better capture top-rank relevance.

03

The stochastic model confirms the protocol's effectiveness.

Abstract

The growth of domain-specific applications of semantic models, boosted by the recent achievements of unsupervised embedding learning algorithms, demands domain-specific evaluation datasets. In many cases, content-based recommenders being a prime example, these models are required to rank words or texts according to their semantic relatedness to a given concept, with particular focus on top ranks. In this work, we give a threefold contribution to address these requirements: (i) we define a protocol for the construction, based on adaptive pairwise comparisons, of a relatedness-based evaluation dataset tailored on the available resources and optimized to be particularly accurate in top-rank evaluation; (ii) we define appropriate metrics, extensions of well-known ranking correlation coefficients, to evaluate a semantic model via the aforementioned dataset by taking into account the greater…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

intervieweb-datascience/adaptive-comp
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.