TaxoBell: Gaussian Box Embeddings for Self-Supervised Taxonomy Expansion

Sahil Mishra; Srinitish Srinivasan; Srikanta Bedathur; Tanmoy Chakraborty

arXiv:2601.09633·cs.CL·January 15, 2026

TaxoBell: Gaussian Box Embeddings for Self-Supervised Taxonomy Expansion

Sahil Mishra, Srinitish Srinivasan, Srikanta Bedathur, Tanmoy Chakraborty

PDF

Open Access

TL;DR

TaxoBell introduces Gaussian box embeddings for automated taxonomy expansion, effectively modeling hierarchical and ambiguous relationships with improved stability and interpretability, outperforming existing methods on multiple benchmarks.

Contribution

It proposes a novel Gaussian box embedding framework that enhances stability, uncertainty modeling, and ambiguity handling in taxonomy expansion tasks.

Findings

01

Outperforms state-of-the-art baselines by 19% in MRR

02

Achieves around 25% improvement in Recall@k

03

Demonstrates robustness and interpretability through extensive experiments

Abstract

Taxonomies form the backbone of structured knowledge representation across diverse domains, enabling applications such as e-commerce catalogs, semantic search, and biomedical discovery. Yet, manual taxonomy expansion is labor-intensive and cannot keep pace with the emergence of new concepts. Existing automated methods rely on point-based vector embeddings, which model symmetric similarity and thus struggle with the asymmetric "is-a" relationships that are fundamental to taxonomies. Box embeddings offer a promising alternative by enabling containment and disjointness, but they face key issues: (i) unstable gradients at the intersection boundaries, (ii) no notion of semantic uncertainty, and (iii) limited capacity to represent polysemy or ambiguity. We address these shortcomings with TaxoBell, a Gaussian box embedding framework that translates between box geometries and multivariate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare · Advanced Graph Neural Networks · Topic Modeling