Enriching Taxonomies Using Large Language Models

Zeinab Ghamlouch; Mehwish Alam

arXiv:2602.22213·cs.IR·February 27, 2026

Enriching Taxonomies Using Large Language Models

Zeinab Ghamlouch, Mehwish Alam

PDF

Open Access

TL;DR

This paper introduces Taxoria, a pipeline that uses Large Language Models to enrich existing taxonomies by proposing and validating new nodes, thereby improving coverage and relevance for better knowledge organization.

Contribution

It presents a novel taxonomy enrichment method leveraging LLMs with validation, enhancing existing taxonomies beyond prior extraction-based approaches.

Findings

01

Enriched taxonomies show increased coverage and relevance.

02

Validation reduces hallucinations and improves semantic accuracy.

03

Visualization aids in analyzing the enriched taxonomy structure.

Abstract

Taxonomies play a vital role in structuring and categorizing information across domains. However, many existing taxonomies suffer from limited coverage and outdated or ambiguous nodes, reducing their effectiveness in knowledge retrieval. To address this, we present Taxoria, a novel taxonomy enrichment pipeline that leverages Large Language Models (LLMs) to enhance a given taxonomy. Unlike approaches that extract internal LLM taxonomies, Taxoria uses an existing taxonomy as a seed and prompts an LLM to propose candidate nodes for enrichment. These candidates are then validated to mitigate hallucinations and ensure semantic relevance before integration. The final output includes an enriched taxonomy with provenance tracking and visualization of the final merged taxonomy for analysis.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBiomedical Text Mining and Ontologies · Topic Modeling · Natural Language Processing Techniques