HumanAL: Calibrating Human Matching Beyond a Single Task

Roee Shraga

arXiv:2205.03209·cs.DB·May 9, 2022·1 cites

HumanAL: Calibrating Human Matching Beyond a Single Task

Roee Shraga

PDF

Open Access

TL;DR

This paper introduces HumanAL, a method that models human annotator behavior to calibrate and improve label quality across various matching tasks using machine learning.

Contribution

It presents a novel approach to account for human error by building behavioral profiles and calibrating human input for better labeling accuracy.

Findings

01

Improves label quality in schema, entity, and text matching tasks

02

Effective across multiple domains and tasks

03

Utilizes black-box machine learning for calibration

Abstract

This work offers a novel view on the use of human input as labels, acknowledging that humans may err. We build a behavioral profile for human annotators which is used as a feature representation of the provided input. We show that by utilizing black-box machine learning, we can take into account human behavior and calibrate their input to improve the labeling quality. To support our claims and provide a proof-of-concept, we experiment with three different matching tasks, namely, schema matching, entity matching and text matching. Our empirical evaluation suggests that the method can improve the quality of gathered labels in multiple settings including cross-domain (across different matching tasks).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Topic Modeling · Natural Language Processing Techniques