Hyperbolic Residual Quantization: Discrete Representations for Data with Latent Hierarchies
Piotr Pi\k{e}kos, Subhradeep Kayal, Alexandros Karatzoglou

TL;DR
This paper introduces Hyperbolic Residual Quantization (HRQ), a novel method that embeds data in hyperbolic space to better capture hierarchical structures in discrete representations, outperforming traditional Euclidean-based methods.
Contribution
The paper proposes HRQ, which adapts residual quantization to hyperbolic geometry, providing a more natural fit for hierarchical data and improving downstream task performance.
Findings
HRQ outperforms Euclidean RQ by up to 20% in hierarchy modeling tasks.
Embedding in hyperbolic space better captures latent hierarchical structures.
HRQ improves downstream task performance on hierarchical data.
Abstract
Hierarchical data arise in countless domains, from biological taxonomies and organizational charts to legal codes and knowledge graphs. Residual Quantization (RQ) is widely used to generate discrete, multitoken representations for such data by iteratively quantizing residuals in a multilevel codebook. However, its reliance on Euclidean geometry can introduce fundamental mismatches that hinder modeling of hierarchical branching, necessary for faithful representation of hierarchical data. In this work, we propose Hyperbolic Residual Quantization (HRQ), which embeds data natively in a hyperbolic manifold and performs residual quantization using hyperbolic operations and distance metrics. By adapting the embedding network, residual computation, and distance metric to hyperbolic geometry, HRQ imparts an inductive bias that aligns naturally with hierarchical branching. We claim that HRQ in…
Peer Reviews
Decision·Submitted to ICLR 2026
The paper introduces a novel integration of hyperbolic geometry into residual quantization, enabling discrete representations that naturally capture hierarchical relationships. Theoretical motivation is strong and well-founded, aligning hyperbolic geometry’s exponential growth with hierarchical data’s inherent structure. Experiments show consistent and significant improvements over Euclidean baselines in both supervised and unsupervised settings, demonstrating the model’s robustness and practi
The method assumes data follows a hierarchical structure, limiting its usefulness in domains without clear hierarchies. How does the model behave when applied to data lacking an explicit or strong hierarchical organization? The computational cost and stability issues of hyperbolic training are not explored, leaving efficiency and scalability uncertain. What are the practical training challenges or trade-offs when scaling HRQ to large datasets or deeper architectures?
It demonstrates empirical improvements, with HRQ tokens outperforming standard residual quantization on multiple benchmark tasks related to hierarchy modeling and discovery
Regarding the references, the authors have omitted numerous relevant citations, including HiHPQ and works on knowledge-graph representation and recommender systems, among others. For the main loss formula, the authors failed to provide proper numbering, which represents poor writing practice. The authors employ Euclidean distance for RQ in Euclidean space; however, Euclidean RQ typically utilizes dot product. Furthermore, the contrastive loss formula is not an essential component of RQ and shou
Across hierarchy modeling (WordNet) and hierarchy discovery tasks, HRQ’s tokens improve performance—up to ~20% in the supervised setting compared to RQ.
- Insufficient comparisons to strong hyperbolic baselines. Prior work on hyperbolic quantization (e.g., *HyperVQ*) is discussed but not empirically compared. As a result, the claimed advantages over stronger or more numerous hyperbolic baselines remain unsubstantiated; the study lacks both breadth in baselines and depth in experimental analysis. - Unclear motivation for hyperbolic space. The paper does not convincingly quantify *why* hyperbolic geometry is necessary here. The authors’ claim — *“
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Topological and Geometric Data Analysis · Data Visualization and Analytics
