Reference-Free Evaluation of Taxonomies

Pascal Wullschleger; Majid Zarharan; Donnacha Daly; Marc Pouly; Jennifer Foster

arXiv:2505.11470·cs.CL·January 7, 2026

Reference-Free Evaluation of Taxonomies

Pascal Wullschleger, Majid Zarharan, Donnacha Daly, Marc Pouly, Jennifer Foster

PDF

Open Access

TL;DR

This paper presents two novel reference-free metrics for evaluating taxonomies without labels, assessing robustness and logical adequacy, and demonstrating their effectiveness in predicting hierarchical classification performance.

Contribution

Introduces two innovative reference-free metrics for taxonomy quality evaluation, addressing limitations of existing metrics and enabling prediction of downstream classification performance.

Findings

01

Metrics correlate well with F1 against ground truth taxonomies.

02

Metrics effectively predict downstream hierarchical classification performance.

03

Proposed methods evaluate robustness and logical adequacy of taxonomies.

Abstract

We introduce two reference-free metrics for quality evaluation of taxonomies in the absence of labels. The first metric evaluates robustness by calculating the correlation between semantic and taxonomic similarity, addressing error types not considered by existing metrics. The second uses Natural Language Inference to assess logical adequacy. Both metrics are tested on five taxonomies and are shown to correlate well with F1 against ground truth taxonomies. We further demonstrate that our metrics can predict downstream performance in hierarchical classification when used with label hierarchies.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBiomedical Text Mining and Ontologies · Topic Modeling · Data Quality and Management