Thorup-Zwick Emulators are Universally Optimal Hopsets
Shang-En Huang, Seth Pettie

TL;DR
This paper shows that Thorup-Zwick emulators are also universal hopsets with near-optimal size, improving bounds on hopset construction and related graph sparsification structures.
Contribution
It demonstrates that Thorup-Zwick's sublinear additive emulators serve as universal hopsets and improves their size bounds with minor modifications.
Findings
Thorup-Zwick emulators are also $(O(k/eta)^k,eta)$-hopsets.
Modified construction reduces hopset size to $O(n^{1+1/(2^{k+1}-1)})$.
Results improve bounds on sublinear additive emulators and graph spanners.
Abstract
A - is, informally, a weighted edge set that, when added to a graph, allows one to get from point to point using a path with at most edges ("hops") and length . In this paper we observe that Thorup and Zwick's emulators are also actually -hopsets for every , and that with a small change to the Thorup-Zwick construction, the size of the hopset can be made . As corollaries, we also shave "" factors off the size of Thorup and Zwick's sublinear additive emulators and the sparsest known -spanners, due to Abboud, Bodwin, and Pettie.
Click any figure to enlarge with its caption.
Figure 1
Figure 2Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Thorup-Zwick Emulators are Universally Optimal Hopsets††thanks: Supported by NSF Grants CCF-1514383 and CCF-1637546.
Shang-En Huang
University of Michigan
Seth Pettie
University of Michigan
Abstract
A -hopset is, informally, a weighted edge set that, when added to a graph, allows one to get from point to point using a path with at most edges (“hops”) and length . In this paper we observe that Thorup and Zwick’s sublinear additive emulators are also actually -hopsets for every , and that with a small change to the Thorup-Zwick construction, the size of the hopset can be made . As corollaries, we also shave “” factors off the size of Thorup and Zwick’s [20] sublinear additive emulators and the sparsest known -spanners, due to Abboud, Bodwin, and Pettie [1].
1 Introduction
Let be a weighted undirected graph. Define to be the length of the shortest path from to in that uses at most edges, or “hops.” Whereas is a metric, is not in general. A set of weighted edges is called a -hopset if for every ,
[TABLE]
Background.
Cohen [7] formally defined the notion of a hopset, but the idea was latent in earlier work [21, 14, 6, 18]. Cohen’s -hopset had size and . Elkin and Neiman [9] showed that a constant hopbound suffices (when are constants). In particular, their hopset has size and . Abboud, Bodwin, and Pettie [1] recently proved that the tradeoffs of [9] are essentially optimal: for any integer , any hopset of size must have , where is a constant depending only on .111Note that setting in the Elkin-Neiman construction gives , where . Thus, saving any in the exponent of the hopset increases significantly. In general, the statement of [9] obscures the nature of the tradeoff: there are not distinct tradeoffs for each , but only for . There are other constructions of hopsets [5, 11, 12, 16] that are designed for parallel or dynamic environments; their tradeoffs (between hopset size and hopbound) are worse than [7, 9] and the ones presented here. See Table 1.
Hopsets, Emulators, and Spanners.
Recall that is an undirected graph, possibly weighted. A spanner is a subgraph of such that for some nondecreasing stretch function . An emulator of an unweighted graph is a weighted edge set such that . Syntactically, the definition of hopsets is closely related to emulators. The difference is that hopsets have a hopbound constraint but are allowed to use original edges in whereas emulators must use only . The purpose of emulators is to compress the graph metric : ideally . Historically, the literature on hopset constructions [7, 9] has been noticeably more complex than those of spanners and emulators, many of which [3, 2, 8, 20, 4, 15, 1] are quite elegant. Our goal in this work is to demonstrate that there is nothing intrinsically complex about hopsets, and that a very simple construction improves on all prior constructions and matches the Abboud-Bodwin-Pettie lower bound.
New Results.
Thorup and Zwick [20] designed their emulator for unweighted graphs, and proved that it has size and a sublinear additive stretch function . In this paper we show that the Thorup-Zwick emulator, when applied to a weighted graph, produces a -hopset that achieves every point on the Abboud-Bodwin-Pettie [1] lower bound tradeoff curve. Moreover, with two subtle modifications to the construction, we can reduce the size to , shaving off a factor . Our technique also applies to other constructions, and as corollaries we improve the size of Thorup and Zwick’s emulator [20] and Abboud, Bodwin, and Pettie’s -spanners.222A -spanner of an unweighted graph is one with stretch function .
Theorem 1**.**
Fix any weighted graph and integer . There is a -hopset for with size and .
Theorem 2**.**
(cf. [20]) Fix any unweighted graph and integer . There is a sublinear additive emulator for with size and stretch function .
Theorem 3**.**
(cf. [1]) Fix any unweighted graph , integer , and real . There is a -spanner for with size , where .
Remark 1**.**
In recent and independent technical report, Elkin and Neiman [10] also observed that Thorup and Zwick’s emulator yields an essentially optimal hopset. They proposed a modification to Thorup and Zwick’s construction that reduces the size to (eliminating a factor ), but increases the hopbound from to . For example, their technique does not imply any of the improvements found in Theorems 1, 2, or 3.
2 The Hopset Construction
In this section, we present the construction of the hopset based on Thorup and Zwick’s emulator [20], then analyze its size, stretch, and hopbound.
The construction is parameterized by an integer and a set of sampling probabilities. Let be the vertex sets in each layer. For each , each vertex in is independently promoted to with probability . Thus . For each vertex and , define to be any vertex in such that . For any vertex , define to be:
[TABLE]
Note that does not exist; by convention . The hopset is defined to be , where
[TABLE]
The length of an edge in is always the distance between its endpoints. This concludes the description of the construction.
2.1 Size Analysis
The expected size of is at most , for each , and is if . Following Pettie [17], we choose such that the layers of the hopset have geometrically decaying sizes. Setting , the expected size of , for , is
[TABLE]
The expected size of is
[TABLE]
so the expected size of is at most
[TABLE]
2.2 Stretch and Hopbound Analysis
Let us first give an informal sketch of the analysis. Let be vertices. Choose an integer , and imagine dividing up the shortest – path into intervals of length , where defines one “unit” of length. Once and are fixed we prove that given any two vertices at distance at most , there is either an -hop path from to with additive stretch , or there is an -hop path from to a -vertex with length . Of course, when the set is empty, so we cannot be in the second case. Since, by definition of , , there must be an -hop path with additive stretch . In order for this stretch to be we must set .
So, to recap, the integer parameter depends on the desired stretch , and determines the hopcount sequence , which is defined inductively as follows.
[TABLE]
The parameter of the hopset is exactly . It is straightforward to show that . Once and are fixed, Theorem 4 is proved by induction.
Theorem 4**.**
For any fixed real (the “unit”), for all and any pair such that , at least one of the following statements holds.
- (i)
, 2. (ii)
There exists such that .
Proof.
The proof is by induction on . In the base case and . Let with . If then so (i) holds. Otherwise, , meaning . If then , and if then , so . In either case, (ii) holds.
Now assume . Consider vertices with and let be a shortest – path in . Then, as shown in Figure 1, we partition into at most segments , , , as follows. Starting at , we pick to be the farthest vertex on such that , and let be the next edge on the path.333Note that if the first edge has length more than , then . Repeat the process until we reach , oscillating between selecting segments that have length at most and single edges.
- •
Multi-hop segment: the shortest path from to satisfies .
- •
Single-hop segment: the segment is actually an edge .
By the induction hypothesis, each multi-hop segment satisfies (i) or (ii) within hops. Moreover, in each greedy iteration the sum of the lengths from picked multi-hop segment and immediately followed single-hop segment is strictly greater than except the last one. Therefore, by the pigeonhole principle, there are at most multi-hop segments on and at most single-hop segments on .
If condition (i) holds for all multi-hop segments, then in at most hops,
[TABLE]
and condition (i) holds for .
Otherwise, condition (i) does not hold for at least one multi-hop segment. Consider the first multi-hop segment and the last multi-hop segment that do not satisfy condition (i). By condition (ii), there exist and satisfying
[TABLE]
Now we have two cases depending on whether or not. If , then by the triangle inequality, we can get from to with hops and additive stretch
[TABLE]
We know there are a total of at most multi-hop segments satisfying condition (i). Hence, within at most hops, we can get from to with additive stretch
[TABLE]
and condition (i) holds for in this case.
On the other hand, suppose that . Since both but , we know that must exist with . Hence, we can get from to via an -hop path with length
[TABLE]
Similar to the previous case, there are at most multi-hop segments appeared before , and all of them are satisfying condition (i). Hence, the surplus
[TABLE]
Therefore, in at most hops,
[TABLE]
∎
Proof of Theorem 1.
Fix and . Define . Notice that . Set and . By Theorem 4, since , condition (i) must hold: within hops we have
[TABLE]
∎
Observe that if we set the size becomes linear.
Corollary 1**.**
Every -vertex graph has an -size -hopset with and .
3 Conclusion
In this paper our goal was to demonstrate that hopset constructions need not be complex, and that optimal hopsets can be constructed with a simple and elegant algorithm, namely a small modification to Thorup and Zwick’s emulator construction [20]. From a purely quantitative perspective our hopsets also improve on the sparseness and/or hopbound of other constructions [7, 9, 10]. As a happy byproduct of our construction, we also shave small factors off the best sublinear additive emulators [20] and -spanners [1].
We now have a good understanding of the tradeoffs available between and the hopset size when the stretch is fixed at , being a small real. However, when or is large, there are still gaps between the best upper and lower bounds. For example, when a trivial hopset444Let be a clique on a set of vertices chosen uniformly at random. has size with . A construction of Hesse [13] (see also [1, §6]) implies that must be at least for some , but it is open whether -size hopsets exist with . At the other extreme, Thorup and Zwick’s distance oracles imply that -size hopsets exist with and stretch . Is this tradeoff optimal? Are there other tradeoffs available when is a fixed constant (say 3 or 4), independent of ?
Acknowledgement.
Thanks to Richard Peng for help with the references for zero-stretch hopsets.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Abboud, G. Bodwin, and S. Pettie. A hierarchy of lower bounds for sublinear additive spanners. In Proceedings 28th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA) , pages 568–576, 2017.
- 2[2] D. Aingworth, C. Chekuri, P. Indyk, and R. Motwani. Fast estimation of diameter and shortest paths (without matrix multiplication). SIAM Journal on Computing , 28(4):1167–1181, 1999.
- 3[3] I. Althöfer, G. Das, D. Dobkin, D. Joseph, and J. Soares. On sparse spanners of weighted graphs. Discrete and Computational Geometry , 9:81–100, 1993.
- 4[4] S. Baswana, T. Kavitha, K. Mehlhorn, and S. Pettie. Additive spanners and ( α , β ) 𝛼 𝛽 (\alpha,\beta) -spanners. ACM Trans. Algorithms , 7(1), 2010.
- 5[5] A. Bernstein. Fully dynamic ( 2 + ϵ ) 2 italic-ϵ (2+\epsilon) -approximate all-pairs shortest paths with fast query and close to linear update time. In Proceedings 50th Annual IEEE Symposium on Foundations of Computer Science (FOCS) , pages 693–702, 2009.
- 6[6] E. Cohen. Using selective path-doubling for parallel shortest-path computations. Journal of Algorithms , 22(1):30–56, 1997.
- 7[7] E. Cohen. Polylog-time and near-linear work approximation scheme for undirected shortest paths. J. ACM , 47(1):132–166, 2000.
- 8[8] D. Dor, S. Halperin, and U. Zwick. All-pairs almost shortest paths. SIAM Journal on Computing , 29(5):1740–1759, 2000.
