RaTE: a Reproducible automatic Taxonomy Evaluation by Filling the Gap

Tianjian Gao; Phillipe Langlais

arXiv:2307.09706·cs.CL·July 20, 2023

RaTE: a Reproducible automatic Taxonomy Evaluation by Filling the Gap

Tianjian Gao, Phillipe Langlais

PDF

Open Access 1 Repo

TL;DR

RaTE introduces a reproducible, label-free automatic evaluation method for taxonomies using large pre-trained language models, aligning well with human judgments and enabling consistent assessment of taxonomy quality.

Contribution

The paper presents RaTE, a novel automatic taxonomy evaluation method that reduces reliance on manual scoring and correlates strongly with human judgments.

Findings

01

RaTE correlates well with human judgments

02

Degrading a taxonomy decreases RaTE score

03

RaTE provides a reproducible evaluation framework

Abstract

Taxonomies are an essential knowledge representation, yet most studies on automatic taxonomy construction (ATC) resort to manual evaluation to score proposed algorithms. We argue that automatic taxonomy evaluation (ATE) is just as important as taxonomy construction. We propose RaTE, an automatic label-free taxonomy scoring procedure, which relies on a large pre-trained language model. We apply our evaluation procedure to three state-of-the-art ATC algorithms with which we built seven taxonomies from the Yelp domain, and show that 1) RaTE correlates well with human judgments and 2) artificially degrading a taxonomy leads to decreasing RaTE score.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cestlucas/rate
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBiomedical Text Mining and Ontologies · Natural Language Processing Techniques · Advanced Text Analysis Techniques