Comparing human and automatic thesaurus mapping approaches in the   agricultural domain

Boris Lauser; Gudrun Johannsen; Caterina Caracciolo; Johannes Keizer,; Willem Robert van Hage; Philipp Mayr

arXiv:0808.2246·cs.DL·January 15, 2019

Comparing human and automatic thesaurus mapping approaches in the agricultural domain

Boris Lauser, Gudrun Johannsen, Caterina Caracciolo, Johannes Keizer,, Willem Robert van Hage, Philipp Mayr

PDF

TL;DR

This paper compares human and automatic thesaurus mapping methods in agriculture, highlighting their strengths, limitations, and potential for complementarity to improve knowledge organization systems.

Contribution

It provides an empirical comparison of human and machine mapping approaches for AGROVOC, identifying their respective advantages and challenges.

Findings

01

Automatic mapping has limitations with complex cases

02

Human mapping is more accurate but labor-intensive

03

Combining both approaches can enhance mapping quality

Abstract

Knowledge organization systems (KOS), like thesauri and other controlled vocabularies, are used to provide subject access to information systems across the web. Due to the heterogeneity of these systems, mapping between vocabularies becomes crucial for retrieving relevant information. However, mapping thesauri is a laborious task, and thus big efforts are being made to automate the mapping process. This paper examines two mapping approaches involving the agricultural thesaurus AGROVOC, one machine-created and one human created. We are addressing the basic question "What are the pros and cons of human and automatic mapping and how can they complement each other?" By pointing out the difficulties in specific cases or groups of cases and grouping the sample into simple and difficult types of mappings, we show the limitations of current automatic methods and come up with some basic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.