Tight Space-Approximation Tradeoff for the Multi-Pass Streaming Set Cover Problem
Sepehr Assadi

TL;DR
This paper establishes tight bounds on the space-approximation tradeoff for the streaming set cover problem, showing that achieving an a-approximation requires nearly linear space, even with multiple passes and random stream order.
Contribution
It provides the first tight lower bounds matching existing algorithms for space-approximation tradeoffs in streaming set cover, resolving a key open problem.
Findings
Any a-approximation needs e ilde{ ext{Omega}}(mn^{1/a}) space.
The bounds hold even with polylogarithmic passes and random stream order.
Results extend to tight bounds for streaming maximum coverage approximation.
Abstract
We study the classic set cover problem in the streaming model: the sets that comprise the instance are revealed one by one in a stream and the goal is to solve the problem by making one or few passes over the stream while maintaining a sublinear space in the input size; here denotes the number of the sets and is the universe size. Notice that in this model, we are mainly concerned with the space requirement of the algorithms and hence do not restrict their computation time. Our main result is a resolution of the space-approximation tradeoff for the streaming set cover problem: we show that any -approximation algorithm for the set cover problem requires space, even if it is allowed polylog passes over the stream, and even if the sets are arriving in a random order in the stream. This space-approximation tradeoff matches…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplexity and Algorithms in Graphs · Cryptography and Data Security · Stochastic Gradient Optimization Techniques
Tight Space-Approximation Tradeoff for the Multi-Pass Streaming Set Cover Problem
Sepehr Assadi
University of Pennsylvania
[email protected] Supported in part by National Science Foundation grants CCF-1552909, CCF-1617851, and IIS-1447470.
Abstract
We study the classic set cover problem in the streaming model: the sets that comprise the instance are revealed one by one in a stream and the goal is to solve the problem by making one or few passes over the stream while maintaining a sublinear space in the input size; here denotes the number of the sets and is the universe size. Notice that in this model, we are mainly concerned with the space requirement of the algorithms and hence do not restrict their computation time.
Our main result is a resolution of the space-approximation tradeoff for the streaming set cover problem: we show that any -approximation algorithm for the set cover problem requires space, even if it is allowed passes over the stream, and even if the sets are arriving in a random order in the stream. This space-approximation tradeoff matches the best known bounds achieved by the recent algorithm of Har-Peled et al. (PODS 2016) that requires only passes over the stream in an adversarial order, hence settling the space complexity of approximating the set cover problem in data streams in a quite robust manner. Additionally, our approach yields tight lower bounds for the space complexity of -approximating the streaming maximum coverage problem studied in several recent works.
1 Introduction
The set cover problem is one of the most fundamental optimization problems in computer science, with a wide range of applications in various domains including data mining and information retrieval [2, 47], web host analysis [20], operation research [30], and many others. In this problem, we are given a collection of sets from a universe and the goal is to output a smallest number of sets whose union is , or in other words, cover the universe. The set cover problem is one of Karp’s original 21 NP-hard problems [38]. A simple greedy algorithm that iteratively picks the set that covers the most number of uncovered elements achieves a -approximation [36, 48] and this is best possible unless [24, 28, 41, 43].
The aforementioned results focus on the tradeoff between approximation guarantee and time complexity of the set cover problem. Nevertheless, in many settings, space complexity of the algorithms is crucial to optimize. A canonical example is in applications in big data analysis: in such settings, one would like to design algorithms capable of processing massive datasets using only few passes over the input and limited space. The well-established streaming model of computation [1, 44] precisely captures this setting.
In the streaming set cover problem, originally introduced by Saha and Getoor [47], the input sets are provided one by one in a stream and the algorithms are allowed to make a small number of passes over the stream while maintaining a sublinear space for processing the stream. The streaming set cover problem and the closely related maximum coverage problem have received quite a lot of attention in recent years [47, 21, 26, 23, 5, 34, 32, 18, 3, 42, 9]; we refer the reader to [3, 42] for a comprehensive summary of these results.
Particularly relevant to our work, Demaine et al. [23], have shown an -approximation algorithm that uses passes over the stream and needs space. Recently, Har-Peled et al. [32] provide a significant improvement over this algorithm: they developed an -approximation, -pass streaming algorithm that requires space. They further conjectured that the tradeoff between the number of passes and the space in their algorithm is almost tight: this is supported by a lower bound of space for -pass streaming algorithms that compute an exact set cover solution [32].
Notice however that the algorithm of [32] (and [23]) exhibits a somewhat unusual behavior: allowing a larger number of passes over the stream results in a weaker approximation guarantee obtained by the algorithm. This highlights the following natural question: can we achieve a (fixed) constant approximation in -passes and space? (a recent algorithm of Bateni et al. [9] achieves a fixed -approximation within these bounds.) In general, what is the space-approximation tradeoff for the streaming set cover problem if we consider algorithms that are allowed a relatively small number of passes, say up to , over the stream? This is precisely the question addressed in this work.
1.1 Our Contributions
Our main result is a tight resolution of the space-approximation tradeoff for the streaming set cover problem:
Result 1** (Main result, formalized as Theorem 1).**
Any streaming -approximation -pass algorithm for the set cover problem requires space even on random arrival streams. This lower bound applies even for the weaker goal of estimating the optimal value of the set cover instance (as opposed to finding the actual sets that cover the universe).
Prior to our work, the best known lower bounds for randomized multi-pass streaming algorithms ruled out the possibility of -approximation in passes and space [45], and exact solution in passes and space [32] (the later holds only if ). These results left open the possibility of obtaining, say, a -approximation in two passes or even an exact answer in passes and space. On the other hand, Result 1 smoothly extends the bounds in [45] to the whole range of approximation factors , proving the first super-linear in lower bound for approximating set cover in multi-pass streams. It also significantly improves the bounds in [32] to (and all range of ) for pass streaming algorithms that recover an exact answer111Note that this result also implies that the “right” tradeoff between space and number of passes for obtaining an exact solution to the streaming set cover is in fact linear as opposed to exponential, i.e., as opposed to , as was previously shown in [32]..
As mentioned earlier, Har-Peled et al. [32] designed an -approximation algorithm for the set cover problem that requires space (for some unspecified constant larger than in the -notation in the exponent). We can show that with proper modifications, this algorithm in fact only requires space (see Theorem 2), hence proving a tight upper bound for Result 1 (up to logarithmic factors). These results together resolve the space-approximation tradeoff for streaming set cover problem in multi-pass streams. It is worth mentioning that the space-approximation tradeoff for single-pass streaming algorithms of set cover has been previously resolved in [3].
Finally, we point out that the lower bound in Result 1 is quite robust in the sense that it holds even when the sets are arriving in a random order. This is particularly relevant to the streaming set cover problem as most known techniques for this problem are based on element and set sampling and a-priori one may expect that random arrival streams can facilitate the use of such techniques, resulting in better bounds than the ones achievable in adversarial streams. We point that in general, many streaming problems are known to be distinctly easier in random arrival streams compared to adversarial streams (see, e.g., [31, 39, 37]).
We further show an application of our techniques in establishing Result 1 to the streaming maximum coverage problem that has been studied in several recent works [47, 4, 5, 42, 9, 27, 19]. In this problem, we are given a collection of sets from a universe and an integer , and the goal is to find sets that cover the most number of elements in . We prove that,
Result 2** (Formalized as Theorem 4).**
Any streaming -approximation -pass algorithm for the maximum coverage problem requires space even on random arrival streams. This lower bound applies even for the case .
Single-pass -approximation algorithms for this problem that use, respectively, space and have been proposed recently in [42, 9], and [9]. Our Result 2 is hence tight for any (up to logarithmic factors) and within an factor of the best upper bound for the larger values of .
McGregor and Vu [42] have very recently proved an lower bound for -pass streaming algorithms that approximate the maximum coverage problem to within a factor better than (a single-pass -approximation algorithm in space is also developed in [42, 9]). The importance of Result 2 is thus in establishing the tight dependence on the parameter for this problem. This is important as -approximation algorithms for this problem for very small values of , i.e., , are typically used as a sub-routine in approximating the streaming set cover problem in multiple passes [23, 32, 9] (see Section 3.4 for more details).
En route, we also obtain the following result which may be of independent interest: the communication complexity of computing an exact solution to the set cover problem or the maximum coverage problem in the two-player communication model is bits (see Theorems 3 and 5). This improves upon the previous lower bounds of Nisan [45] (for set cover) and McGregor and Vu [42] (for maximum coverage). The two-player communication model for set cover has also been studied in [3, 18, 23, 32].
We conclude this section by highlighting the following important aspect of our lower bounds.
Remark 1.1**.**
In the hard instances we consider in proving Results 1 and 2, the minimum set cover size and the parameter in maximum coverage are small constants and hence these instances admit a trivial poly-time algorithm in the classical (offline) setting. Our results hence establish the “hardness” of these instances under the space restrictions of the streaming model, independent of the NP-hardness of approximating these problems.
1.2 Technical Overview
We focus here on providing a technical overview of the proof of Result 1 - Result 2 is also proven along similar lines. The starting point of our work is [3], which proved a tight space lower bound for single-pass streaming algorithms of set cover by analyzing the one-way communication complexity of this problem (see Section 2 for details on communication complexity).
The overall approach of [3] can be summarized as follows. Consider a communication problem whereby Alice is given a collection of sets , Bob is given a set , and they need to compute an -approximation of the set cover instance in the one-way communication model. The input to the players are correlated in that there exists a set in Alice’s collection which together with Bob’s set cover the whole universe except for a single element. However, if the content of the set is unknown to Bob, i.e., Alice’s message does not reveal almost all , Bob needs to cover (which is a subset of except for one element) with sets other than to ensure that the single element outside is covered. The collection is designed to satisfy the so-called -covering property [41] that states that no small collection of ’s set can cover another set entirely222It is worth mentioning that essentially all known lower bounds for the streaming set cover problem, on their core, are based on some variant of this -covering property; see [18] for more details., hence forcing Bob to use many sets to cover the universe. The authors then use the information complexity paradigm to reduce the set cover problem on this distribution to multiple instances of a simpler problem (called the Trap problem) and prove a lower bound for this new problem.
In this paper, we extend this approach to lower bound the two-way communication complexity of the set cover problem and ultimately obtain the desired lower bound in Result 1 for multi-pass streaming algorithms. To do this, we need to address the following issues:
First, the type of distribution used in [3] is clearly not suitable for proving lower bounds in the two-way model. In particular, we need a distribution with both Alice and Bob having sets and additionally, no clear “signal” to either party as which of the sets are more important, i.e., correspond to the sets and in the above distribution. To achieve this, we employ the -covering property in a novel way: we first design a collection of sets such that no collection of sets ’s can cover the universe unless they contain a single set which is in fact equal to already (for remaining sets , we have ). Next, we decompose each into two sets and and provide Alice with , and Bob with . This way, the sets and form a set cover of size two, and the -covering property ensures that no other collection of pairs can cover the universe; we further prove that “mix and matching” the sets (i.e., picking but not or vice versa) in the solution is not helpful either, hence implying that any -approximation algorithm for set cover needs to find the sets and .
The next step is to prove the lower bound for the above distribution. Unlike the lower bound in the one-way model that was based on hiding the content of the set , here we need to argue that in fact the index itself is hidden from the players (as otherwise, one more round of communication can reveal the content of the sets and as well). Similar to [3], we also use the information complexity paradigm to prove the communication lower bound for this distribution. We embed different instances of the well-known set disjointness problem in each pair such that all embedded instances are intersecting except for the instance for and which is disjoint. As we seek a direct-sum style argument for two-way protocols, we need a more careful argument than the one in [3] that was tailored for one-way protocols. In particular, we now use the notion of internal information complexity (as opposed to external information complexity used in [3]) that allows us to use the powerful techniques developed in [8, 10, 13] to obtain the direct-sum result.
Finally, we need to lower bound the information complexity of the set disjointness problem on the specific distribution induced by the set cover instances. The set cover distribution is designed in a way to ensure that the distribution of underlying set disjointness instances matches the known hard input distributions for this problem. However, there is a subtlety here; known information complexity lower bounds for set disjointness (that we are aware of) are all over distributions that are supported only on disjoint sets, i.e., Yes-instances of the problem (see, e.g., [6, 11, 49])333We remark that this is not just a coincidence and in fact is crucial for performing the typical reduction to the AND problem used in proving the lower bound for set disjointness, see, e.g., [49] for more details.. However, for our purpose, we need to lower bound the information cost of set disjointness protocols on distributions that are intersecting. We achieve this using an application of the “information odometer” of [14] (and subsequent work in [29]) to relate the information cost of the protocols on Yes and No instances of the problem together and obtain the result.
We are not done though, as we seek a lower bound for random arrival streams and for this, we extend the previous communication complexity lower bound to the case when the input sets are partitioned randomly across the players, in a similar way as done in previous work [3] (itself based on [15]). There are however some technical differences needed to execute this approach in our two-way communication model in compare to the one-way model in [3] (see Lemma 3.7 for details).
2 Preliminaries
Notation.
For any integer , we let . We say that a set with is a -subset of . For a -dimensional tuple and index , we define and .
We use capital letters to denote random variables. For a random variable , denotes the support of and . We use to mean that the random variables and are independent conditioned on . The notation indicates that is chosen uniformly at random from the set .
We denote the Shannon Entropy of a random variable by and the mutual information of two random variables and by . If the distribution of the random variables is not clear from the context, we use (resp. ). Appendix A summarizes the relevant information theory tools that we use in this paper.
Concentration bounds.
We use the following standard version of Chernoff bound (see, e.g., [25]).
Proposition 2.1**.**
Let be independent random variables taking values in and let . Then, for any ,
[TABLE]
We also prove the following useful auxiliary lemma that upper bounds the number of elements that a collection of large random sets can cover.
Lemma 2.2**.**
Let be a collection of -subsets of that are chosen independently and uniformly at random. Suppose is another set chosen independent of ; if , then,
[TABLE]
We first briefly explain the bounds in Lemma 2.2. Note that each element , is not covered by a set w.p. (as is a random set of size ). Moreover, since the sets are chosen independent of each other, the probability that is not covered by is . Hence, in expectation elements in are not covered by . We then wish to argue, by means of some concentration bound, that with a very high probability the number of elements not covered by is at least half of this number (notice that the bounds in the lemma statement are quite similar but not exactly equal to this quantity).
However, there is an important subtlety here. The random variables defined in the above process are negatively correlated and hence one cannot readily use a Chernoff-Hoeffding bound (or even similar variants defined for negatively correlated random variables) to bound this probability. This is because we need to bound the probability of the sum of these random variable being too small as opposed to being too large which already follows from known results (see, e.g., [46, 33])444Note that in general, Chernoff bound type inequalities do not hold for bounding the sum of negatively random variables from below.. In the following, we show how to get around this using a careful coupling argument.
Proof of Lemma 2.2.
For any element , define the random variable which is iff . Define ; notice that denotes the number of elements in that are not covered by . Our goal is then to lower bound the value of . Note that the random variables are negatively correlated and hence, as stated earlier, we cannot use Chernoff bound (or its generalizations to negatively correlated random variables) to lower bound the value of .
To get around this, we slightly change the distribution each set is chosen from, prove the result in that case, and then relate that distribution to the original distribution of the sets in . Formally, let be the distribution of from which the sets in are chosen. Consider the following distribution : we create each set (for ) by removing each element in from independently and uniformly at random w.p. .
We lower bound the value of the random variable under this new distribution. We first have,
[TABLE]
For simplicity, define . An important property of is that now all random variables are independent of each other. Hence, we can apply Chernoff bound as follows,
[TABLE]
We now argue that is in fact very close to .
Fix a set . For each , define a random variable which is iff . Let , i.e., the number of elements missing from . Note that . Under the distribution , for each set , each element belongs to independently; hence a simple application of Chernoff bound ensures that:
[TABLE]
Define as the event that all sets has size at least ; by Eq (2) and a union bound, (as ). Notice that to sample a set system from , we can first sample a set system from and then make the size of each set exactly equal to by removing the extra elements uniformly at random; this process does not increase the coverage of the original set system sampled from (or equivalently decrease the value of ). Hence,
[TABLE]
By substituting the value of , we obtain the desired bound.
2.1 Communication Complexity and Information Complexity
Communication complexity and information complexity play an important role in our lower bound proofs. We now provide necessary definitions for completeness.
Communication complexity.
Our lowers bounds for streaming algorithms are established via communication complexity lower bounds. We use standard definitions of the two-party communication model introduced by Yao [50]; see [40] for an extensive overview of communication complexity.
Let be a relation with domain . Alice receives an input and Bob receives , where are chosen from a joint distribution over . They communicate with each other by exchanging messages such that each message depends only on the private input of the player sending the message and the already communicated messages. The last message communicated is the answer such that . We allow players to have access to both public and private randomness.
We use to denote a protocol used by the players. We always assume that the protocol can be randomized (using both public and private randomness), even against a prior distribution of inputs. For any , we say is a -error protocol for over a distribution , if the probability that for an input , outputs some where is at most (the probability is taken over the randomness of both the distribution and the protocol).
Definition 1**.**
*The communication cost of a protocol for a problem on an input distribution , denoted by , is the worst-case bit-length of the transcript communicated between Alice and Bob in the protocol , when the inputs are chosen from .
The communication complexity of a problem with respect to a distribution is the minimum communication cost of a -error protocol over .*
Information complexity.
There are several possible definitions of information complexity of a communication problem that have been considered depending on the application (see, e.g., [7, 8, 13, 17, 6]). We use the notion of internal information complexity [8] that measures the average amount of (Shannon) information each player learns about the input of the other player by observing the transcript of the protocol. Formally,
Definition 2**.**
*Consider an input distribution and a protocol (for some problem ). Let be the input of Alice and Bob and assume denotes the transcript of the protocol concatenated with the public randomness used by . The (internal) information cost of a protocol with respect to is then .
The information complexity of with respect to a distribution is the minimum taken over all -error protocols for over .*
Note that any public coin protocol is a distribution over private coins protocols, obtained by first using public randomness to sample a random string and then running the corresponding private coin protocol . We also use to denote the transcript of the protocol . We have the following well-known claim.
Claim 2.3**.**
For any distribution and any protocol , let be the public randomness used in ; then, .
Proof.
[TABLE]
The last equality is because since and Fact A.1-(2).
The following well-known proposition relates communication complexity and internal information complexity (see, e.g., [13] for a proof).
Proposition 2.4**.**
For any distribution and any protocol : . Moreover, for any parameter : .
2.2 The Set Disjointness Problem
We shall use the well-known set-disjointness communication problem (denoted by Disj) in proving Result 1. Fix an integer ; in , Alice and Bob are given two sets and , and their goal is to return Yes if and No otherwise.
The following is a known hard distribution for .
**Distribution . **A hard input distribution for .
•
Start with .
•
For each element independently: w.p. drop from both and , w.p. drop from , and w.p. drop from .
•
Pick uniformly at random. If , pick a uniformly at random element and let and both contain (if , keep the sets as before).
We further use and to denote, respectively, the distribution of Yes and No instances of Disj on ; in other words, and .
The following proposition on the information complexity of Disj is well-known (see, e.g., [6, 11]).
Proposition 2.5**.**
For any and any -error protocol of on the distribution ,
[TABLE]
3 The Space-Approximation Tradeoff for Set Cover
We prove our main result on the space-approximation tradeoff for the streaming set cover problem in this section. Formally,
Theorem 1**.**
For any , , and , any randomized algorithm that can make passes over any collection of subsets of presented in a random order stream and outputs an -approximation to the optimal value of the set cover problem w.p. larger than (over the randomness of both the stream order and the algorithm) must use space.
Theorem 1 formalizes Result 1 in the introduction. We further prove that the tradeoff achieved in Theorem 1 is in fact tight up to logarithmic factors; this is achieved by performing some proper modifications to the algorithm of [32]. Formally,
Theorem 2**.**
There exists a streaming algorithm that for any integer , and any parameter , with high probability, computes an -approximation to the streaming set cover problem using passes over the stream in adversarial order and space.
We emphasize that the main contribution of the paper is in proving Theorem 1; we mainly present Theorem 2 to prove a matching upper bound on the bounds in Theorem 1, hence establishing a tight space-approximation tradeoff for the streaming set cover problem.
The rest of this section is mainly devoted to the proof of Theorem 1. We start by introducing some notation. In Section 3.1, we introduce a hard input distribution for the set cover problem in adversarial streams. We prove a lower bound for this distribution in Section 3.2. We extend this lower bound to random arrival streams in Section 3.3 and finish the proof of Theorem 1. Section 3.4 contains the proof of Theorem 2.
Notation.
To prove Theorem 1, we prove a lower bound on the communication complexity of the set cover problem: Fix a (sufficiently large) value for , , and ; in this section, SetCover refers to the problem of -approximating the optimal value of the set cover problem with sets555To simplify the exposition, we use instead of as the number of sets. defined over the universe in the two-player communication model, whereby the sets are partitioned between Alice and Bob.
3.1 A Hard Input Distribution for SetCover
Let be an integer to be determined later; we use the distribution for (introduced in Section 2.2) to design a hard input distribution for SetCover. Before that, we need a simple definition.
Definition 3** (Mapping-extension).**
For the two sets and , we define a mapping-extension of to as a function , whereby for each , is mapped to unique elements in . Similarly, for any set , we abuse the notation and define .
We are now ready to define our hard input distribution for SetCover.
**Distribution . **A hard input distribution for SetCover.
Notation. Let and be the set of all mapping-extensions of to .
•
For each :
–
Let for and pick uniformly at random.
–
Let and .
•
Pick uniformly at random. If , do nothing, otherwise:
–
Sample uniformly at random.
–
Resample for and redefine and as before using the new pair .
•
Let the input to Alice and Bob be and , respectively.
In the following, we use to denote any set in , i.e., when it is not relevant whether it belongs to or . For a collection of sets , we use to denote the set of elements that covers, i.e., . We say that is a singleton-collection, if for any , at least one of or is not present in . In contrast, we say that is a pair-collection, if for all , iff as well.
Remark 3.1**.**
A few remarks are in order:
- (i)
W.h.p., for any , and .
(Proof. follows from the definition of the distribution and Chernoff bound). 2. (ii)
For any , conditioned on , the set is chosen uniformly at random from all -subsets of ; similarly for 3. (iii)
*For any , . Moreover, whenever , the set is a -subset of chosen uniformly at random. *
(Proof. the first part follows from the fact that maps each to unique elements; the second part is by the random choice of and the fact that in this case). 4. (iv)
Whenever , for any , the sets and are chosen independent of each other ().
Let \textnormal{\mbox{opt}}(\SS,\mathcal{T}) denote the size of an optimal set cover in the instance . It follows from Remark 3.1-(iii) that whenever in the distribution , \textnormal{\mbox{opt}}(\SS,\mathcal{T})=2; simply take and and since , they cover the whole universe. In the following, we prove that when , \textnormal{\mbox{opt}}(\SS,\mathcal{T}) is relatively large. This implies that any -approximation protocol for SetCover has to essentially determine the value of . In the next section, we prove that this task requires a large communication by the players.
Lemma 3.2**.**
For :
[TABLE]
Proof.
Let be any collection of sets from . We bound the probability that covers the universe entirely, i.e., is a feasible set cover, and then use a union bound on all possible choices for to finalize the proof. In the following, we condition on the event that states that and for all (which happens with probability by Remark 3.1-(i)).
Partition the collection into a pair-collection , and a singleton-collection (this partitioning is always possible and unique by definition). We first lower bound the number of elements that are not covered by the singleton-collection:
Claim 3.3**.**
.
Proof.
Let ; clearly . Without loss of generality, we assume that . By conditioning on the event and Remark 3.1-(ii), we know that each is an -subset of , for some , chosen uniformly at random from all -subsets of . Again without loss of generality, we simply increase the size of each so that they all have size exactly . Moreover, since no two sets and are both simultaneously present in , by Remark 3.1-(iv), all sets in are chosen independent of each other.
Consequently, by Lemma 2.2, for , , and collection , we have,
[TABLE]
A simplification of the above equation, plus using the fact that , and hence , proves the final result.
Let be the event that ; in the following, we condition on this event. Now consider the sets in the pair-collection . For any pair , we define . Note that there are at most different possible sets . By Remark 3.1-(iii), the sets ’s are random sets of size , and by Remark 3.1-(iv), they are chosen independent of each other. By Lemma 2.2, for , , and collection of sets ’s, we have,
[TABLE]
We can now conclude,
[TABLE]
proving the lemma.
3.2 The Lower Bound for the Distribution
Throughout this section, fix as a -error protocol for SetCover on the distribution . We first show that protocol is essentially solving copies of the problem on the distribution (for the parameter in the distribution ) and then use a direct-sum style argument (similar in spirit to the ones in [8, 10, 13]) to argue that the information cost of shall be times larger than the information complexity of solving . However, to make the direct-sum argument work, we can only consider on the distribution , i.e., when all underlying instances are sampled from . Consequently, we can only lower bound the information cost of based on the information complexity of on the distribution .
Lemma 3.4**.**
There exists a -protocol for on the distribution such that:
. 2. 2.
.
Proof.
We design the protocol as follows:
**Protocol . **The protocol for solving using a protocol for SetCover.
Input: An instance . Output: Yes if and No otherwise.
Using public randomness, the players sample an index and mapping-extensions independently and uniformly at random from .
Using public randomness, the players sample the sets and each from independently.
Using private randomness, Alice samples the sets such that (for all ); similarly Bob samples the sets .
The players construct the collections and by setting and (exactly as in distribution ).
The players solve the SetCover instance using and output No iff estimates \textnormal{\mbox{opt}}(\SS,\mathcal{T})\leq 2\alpha and Yes otherwise.
It is easy to see that the distribution of instances created in the protocol matches the distribution for SetCover exactly. Moreover, by Lemma 3.2, \textnormal{\mbox{opt}}(\SS,\mathcal{T})>2\alpha w.p. , whenever and \textnormal{\mbox{opt}}(\SS,\mathcal{T})=2 whenever . Consequently, since is an -approximation protocol,
[TABLE]
and hence is indeed a -error protocol for Disj on the distribution . Moreover, it is clear that the communication cost of is at most the communication cost of . We now prove the bound on the information cost of this protocol.
Our goal is to bound the information cost of whenever the instance is sampled from . Let be a random variable denoting the tuple , be a random variable for and be the set of public randomness used by the players. By Claim 2.3,
[TABLE]
We now bound the first term in the RHS above (the second term can be bounded exactly the same).
[TABLE]
where the last equality is true since conditioned on , all sets (for ) are chosen from and hence are independent of the event666We point out that this is the exact reason we need to consider information cost of on (instead of ) as otherwise ’s are not independent of and hence this equality would not hold.. Define and ; we can further derive,
[TABLE]
where the second last equality is because (resp. ) and (resp. ) determine each other conditioned on , and last equality is because the distribution of set cover instances and the messages communicated by the players under and under exactly matches.
Moreover,
[TABLE]
By performing the same exact calculation for , we obtain that,
[TABLE]
where in the last inequality we used the fact that information cost of is at least . This finalizes the proof of the lemma.
Recall that in Lemma 3.4, we bound the information cost of on the distribution (as opposed to ); in the following we prove that this weaker bound is still sufficient for our purpose.
Lemma 3.5**.**
For any , any -error protocol for on with has
[TABLE]
By Proposition 2.5, any -error protocol for (with ) on has (notice again that the information cost is measured on the distribution ). From this, it is also easy to obtain that . However, to prove Lemma 3.5, we need to lower bound the information cost of under the distribution .
To achieve this, we can relate the information costs and to each other. The goal is to argue that if there is a large discrepancy in the information cost of on and , then the information cost of the protocol itself can be used to distinguish between these two cases. We can achieve this goal using an elegant construction of an “information odometer” by [14]; informally speaking, the odometer allows the players to “keep track” of the amount of information revealed in a protocol (i.e., the information cost of the protocol), while incurring a relatively small additional information cost overhead.
Intuitively, we can use the odometer to argue that as follows: suppose towards a contradiction that for some and consider a new protocol for Disj on which runs and the information odometer for in parallel. Whenever the odometer estimates the information cost of to be larger than (for some sufficiently large constant ), the players terminate the protocol and declare that the answer for Disj is No (as information cost of on is typically not much more than , while its information cost on is ). If the cost is not estimated more than by the end of the protocol, the players output the same answer as in . As the information cost of the information odometer itself is bounded by , this results in protocol to have , a contradiction. This argument was first made explicit in [29].
Lemma 3.6** (Lemma 15 in [29]).**
Fix any function , constants , input distribution , and define . For every -error protocol for on , there exists an -error protocol for on such that:
[TABLE]
We are now ready to prove Lemma 3.5.
Proof of Lemma 3.5.
Let be any -error protocol for Disj on for . We first prove that using the fact that as follows:
[TABLE]
Now suppose towards a contradiction that is . We can then apply Lemma 3.6 for the function , and to obtain a protocol with which is ; a contradiction.
We now conclude,
Theorem 3**.**
For any constant , , and ,
[TABLE]
Proof.
Let and suppose towards a contradiction that there exists a -error protocol for SetCover on the distribution with ; by Proposition 2.4, also. By Lemma 3.4, this implies that there exists a -error protocol for Disj on the distribution such that , and (since and ). However, this is in contradiction with Lemma 3.5, implying that , hence proving the theorem.
As a corollary of Theorem 3, we have that the space complexity of any -approximation streaming algorithm for set cover that uses passes on adversarial streams is . In the next section, we extend this result to random arrival streams and complete the proof of Theorem 1.
3.3 Proof of Theorem 1
The distribution used in the previous section is quite “adversarial” and as such is not suitable for proving the lower bound for random arrival streams. In order to prove the lower bound in Theorem 1 for random arrival streams, we need to relax the adversarial partitioning of the sets in the distribution to a randomized partition.
**Distribution . **A random partitioning of the distribution
•
Sample the collections .
•
Assign each set in to Alice w.p. and the remainings to Bob.
We show that even this seemingly easier distribution still captures all the “hardness” of distribution . Formally,
Lemma 3.7**.**
For any constant , , and ,
[TABLE]
Proof.
Let and be the collections of sets sampled from in the distribution . For a sampled instance in , we say that the index is good iff is given to one player and to another. Let be the collection of all good indices. The index is chosen independent of the random partitioning in , and hence the probability that is exactly . Let denote the event that and . We have,
[TABLE]
where the last inequality is by Chernoff bound. Now fix a -error protocol for SetCover on the distribution . Then,
[TABLE]
This in particular implies that there exists a set with , such that conditioned on the set of good indices being and conditioned on , the probability that errs is at most . Note that conditioned on the aforementioned events, the index is chosen from uniformly at random. This implies that the distribution of the input given to Alice and Bob limited to the sets in matches the distribution (with the number of the sets being instead of ). We can then use this to embed an instance of SetCover over the distribution into the sets and obtain a protocol for .
More formally, the protocol works as follows: Given an instance sampled from (with ), Alice and Bob use public coins to complete their input (i.e., increase the number of the sets to ) by sampling from the distribution conditioned on (this is possible without any communication as the sets outside are sampled independent of the sets in ). The players then run the protocol on this new instance and return the same answer as this protocol. As the distribution of the SetCover instances sampled in the protocol matches the distribution conditioned on and , by Eq (3), the probability that errs is at most . Since , we obtain a -error protocol for SetCover on the distribution with sets and universe of size , for a constant . Consequently, by Theorem 3, , proving the lemma.
We are now ready to prove Theorem 1.
Proof of Theorem 1.
Fix a -pass -space streaming algorithm for the set cover problem over random arrival streams that outputs an -approximation w.p. at least for . One can easily turn into a -error protocol for SetCover on the distribution : Alice and Bob take a random permutation of their inputs and then treat their combined input as a set stream and run on that. The random partitioning of the input plus the random permutation taken by the players ensure that the constructed stream is a random permutation of the input sets. Consequently, this protocol is a -error protocol for SetCover on that uses bits of communication. Since , by Lemma 3.7, , proving the theorem.
3.4 An -Approximation Algorithm for the Streaming Set Cover Problem
In this section, we prove the optimality of the lower bound in Theorem 1 by establishing a matching upper bound (i.e. Theorem 2). As stated earlier, our algorithm is a simple modification of the algorithm of [32]. In particular, we obtain our improved algorithm by using a one-shot pruning step as opposed to the iterative pruning of [32], and employing a more careful element sampling (compare the bounds in Lemma 3.12 in this paper with Lemma 2.5 in [32]).
In the following, we assume that we are given a value which is a -approximation of opt, i.e., the optimal solution size of the given instance. This is without loss of generality as we can run the algorithm in parallel for guesses for and return the smallest computed set cover among all parallel runs.
The general idea behind the algorithm is as follows: we know that sets are enough to cover the whole universe ; hence, if we find a -approximate -cover of the input sets for the parameter and , we can reduce the number of uncovered elements by a factor of . Repeating this process times then results in a collection of at most sets that covers the whole universe, i.e., an -approximate set cover. It is worth mentioning that this is the general principle behind most (but not all) streaming algorithms for set cover, see, e.g. [32, 9, 47, 23].
Notice that we can readily use the maximum coverage streaming algorithms of [42, 9] as a sub-routine to find the approximate -cover above; however, doing so would result in a sub-optimal algorithm for set cover as these algorithms have space dependence of (at least) (even ignoring the dependence on , i.e., ). In fact, as we prove in the next section (see Result 2), any -approximate -cover algorithm needs space in general. To bypass this, we crucially use the fact that the aforementioned maximum coverage instances have the additional property that the optimal answer is the whole universe and hence the element sampling technique of [32] (and similar ones in [42, 9]) can be improved for this special case. We now provide the formal description of the algorithm.
**Algorithm 1. **An -approximation algorithm for the streaming set cover problem.
Input. A stream of subsets of , and a -approximation of \textnormal{\mbox{opt}}(\SS).
Output. A collection of sets that cover the universe.
Let and .
Make a single pass over the stream and if , then:
(a)
and .
For to iterations:
(a)
Let be a subset of chosen by picking each element independently and w.p. .
(b)
Make a single pass over the stream and for all , store in the memory.
(c)
Find an optimal set cover of the instance and let .
(d)
Make another pass over the stream and let .
Return SOL as a set cover of the input instance.
We start by bounding the space requirement of Algorithm 3.4 .
Lemma 3.8**.**
Algorithm 3.4 requires space w.p. at least .
Proof.
It is easy to see that maintaining SOL and requires, respectively, and space. In the following, we analyze the space required for storing the sets . After the first pass of the algorithm, no set contains more than elements in . Fix a set ; we have,
[TABLE]
Hence, by Chernoff bound, w.p. , . The final bound now follows from this and a union bound on all sets in .
Remark 3.9**.**
One can make the space requirement of Algorithm 3.4 deterministic by terminating the algorithm whenever it attempts to use a memory more than the bounds in Lemma 3.8. As this event happens with negligible probability, the correctness of the algorithm can be argued exactly the same.
The following two lemmas establish the correctness of the algorithm.
Lemma 3.10**.**
Algorithm 3.4 picks at most sets in SOL.
Proof.
It is immediate to see that in the first pass, the algorithm picks at most sets as otherwise would be empty. Moreover, in each subsequent iterations, the algorithm picks at most sets since has a set cover of size at most (as the original instance had a set cover of size ).
Lemma 3.11**.**
The set SOL computed by Algorithm 3.4 is a feasible set cover of w.p. .
To prove Lemma 3.11, we use the following property of element sampling that first appeared in [23] (similar ideas also appear in [42, 32]); for completeness we provide a self-contained proof of this lemma here.
Lemma 3.12**.**
Let be a parameter and be a collection of subsets of with \textnormal{\mbox{opt}}(\SS)\leq k. Suppose is a subset of obtained by picking each element independently and w.p. ; then, w.p. , any collection of sets in that covers entirely also covers at least elements in .
Proof.
Fix a collection of subsets in that covers less than elements in . The probability that this collection covers entirely is equal to the probability that none of the elements in that are not appearing in are sampled in . Hence,
[TABLE]
Taking a union bound over all possible choices for finalizes the result.
Proof of Lemma 3.11.
In each of the iterations, Algorithm 3.4 implements the sampling in Lemma 3.12 with the parameters , and . Hence, after each iteration, the number of uncovered elements in reduces to w.p. . Consequently, by taking a union bound over the iterations, after the iterations, number of uncovered elements reduces to less than , hence proving the lemma.
We now conclude the proof of Theorem 2.
Proof of Theorem 2.
We can run Algorithm 3.4 in parallel for possible guesses for . By Lemma 3.8, the space requirement of this algorithm is as desired. Moreover, consider the guess: \textnormal{\mbox{opt}}\leq\widetilde{\textnormal{opt}}\leq(1+\varepsilon)\cdot\widetilde{\textnormal{opt}}. For this choice, we can apply Lemma 3.10 and Lemma 3.11 and obtain that the returned solution is an -approximation of the optimal set cover. Since the algorithm can make sure that the returned solution is always feasible, returning the smallest set cover among all guesses for then ensures that the returned answer is an approximation. Re-parameterizing by a constant factor, finalizes the proof.
4 The Space-Approximation Tradeoff for Maximum Coverage
In this section, we prove a space-approximation tradeoff for the maximum coverage problem.
Theorem 4**.**
For any , , and , any randomized algorithm that can make passes over any collection of subsets of presented in a random order stream and outputs a -approximation to the optimal value of the maximum coverage problem for with a sufficiently large constant probability (over the randomness of both the stream order and the algorithm) must use space.
Similar to previous section, we prove Theorem 4 by considering the communication complexity of the maximum coverage problem: Fix a (sufficiently large) , and ; MaxCover refers to the communication problem of -approximating the optimal value of the maximum coverage problem with sets defined over the universe and parameter , in the two-player communication model.
Our lower bound for MaxCover is obtained by reducing this problem to multiple instances of the gap-hamming-distance problem via a similar distribution as (using an additional simple gadget). In the following, we first introduce the gap-hamming-distance problem and prove a useful lemma on its information complexity on particular distributions required for our reduction, and then describe a hard distribution for MaxCover based on this and finalize the proof of Theorem 4.
4.1 The Gap-Hamming-Distance Problem
The gap-hamming-distance (GHD) problem is defined as follows. Fix an integer ; in , Alice is given a set , Bob is given a set and the goal is to output:
[TABLE]
where means that the answer can be arbitrary; here denotes the hamming distance between and , i.e., the size of the symmetric difference of and .
This problem was originally introduced by [35] and has been studied extensively in the literature (see [16] and references therein). We use the following result on the information complexity of this problem proven in [12]777Technically speaking, [12] bounds the external information complexity of GHD as opposed to its internal information complexity used in our paper. However, since the distribution in Lemma 4.1 is a product distribution, these two quantities are equal and hence we simply state the bound for the internal information complexity..
Lemma 4.1** ([12]).**
Let be the uniform distribution on pairs of subsets of (chosen independently); there exists an absolute constant such that
[TABLE]
For our purpose, we need to consider the following distribution for GHD instead of the uniform distribution. Let be two parameters to be determined later888The values of and are not important for our purpose and are hence only determined in the proof of Lemma 4.2.. Define:
- •
as the distribution of instances .
- •
as the distribution of instances .
- •
.
We use Lemma 4.1 to prove the following result on the information cost of -error protocols on the distribution , which could be independently useful also. The proof is deferred to Appendix B.
Lemma 4.2**.**
Let be a sufficiently small constant and be a -error protocol for on with ; then, .
4.2 Communication Complexity of MaxCover
We are now ready to prove a lower bound on the communication complexity of the MaxCover problem. To do so, we propose the following distribution.
**Distribution . **A hard input distribution for MaxCover.
Notation. Let , , and .
•
For each :
–
Let for on the universe .
–
Create , by assigning each element in w.p. to and o.w. to .
–
Let and .
•
Pick uniformly at random. If , do nothing, otherwise:
–
Sample uniformly at random.
–
Resample for and redefine and as before using the new pair (do not change and ).
•
Let the input to Alice and Bob be and , respectively.
Define \textnormal{\mbox{opt}}(\SS,\mathcal{T}) as the value of the optimal solution of the maximum coverage problem (for the parameter ) for the instance . We wish to argue that \textnormal{\mbox{opt}}(\SS,\mathcal{T}) differs by a factor depending on the choice of in the distribution and hence any approximation algorithm for maximum coverage on this distribution needs to determine the value of .
Lemma 4.3**.**
Assuming , there exists a fixed such that for any instance :
[TABLE]
Proof.
We first prove that, any -approximate -cover in this distribution always has to pick a pair of sets (for some ). This is achieved by considering the projection of the sets on the universe .
Claim 4.4**.**
W.p. :
- (a)
For any , . 2. (b)
For any , for any , and , .
Proof.
Part follows immediately from the fact that is partitioned between and , and that . We now prove Part . To do so, we prove that can only cover (essentially) fraction of w.h.p and since the rest of is a subset of with , we get the final result.
For any element , define an indicator random variable whereby iff . Since , the elements in and that are in are chosen independent of each other, and hence . Define ; we have and since variables are independent, by Chernoff bound, (as and ). The final result now follows from a union bound on all possible () pairs.
Now consider a pair for some and note that ; hence we can simply focus on part of . Moreover, we have that,
[TABLE]
where we used the fact that in the distribution , and always.
Consequently, whenever , we have,
[TABLE]
for . Similarly, whenever ,
[TABLE]
Combining these bounds with Claim 4.4 finalizes the proof.
Having proved Lemma 4.3, we can use any -approximation protocol for MaxCover to determine the parameter in the distribution (by a simple re-parametrizing of the by a constant factor). This allows us to prove the following lemma. The proof is essentially identical to that of Lemma 3.4 in Section 3.2 and is provided only for the sake of completeness.
Lemma 4.5**.**
Let be a -error protocol for MaxCover on . There exists a -protocol for on the distribution such that:
. 2. 2.
.
Proof.
We design the protocol as follows:
**Protocol . **The protocol for solving using a protocol for MaxCover.
Input: An instance . Output: .
Using public randomness, the players sample an index .
Using public randomness, the players sample the sets and each from independently.
Using private randomness, Alice samples the sets such that (for all ); similarly Bob samples the sets .
Using public randomness the players sample the sets for all from a (distinct) universe the same as distribution .
The players construct the collections and by setting and (exactly as in distribution ).
The players solve the MaxCover instance using and output No iff estimates \textnormal{\mbox{opt}}(\SS,\mathcal{T})\leq\tau (for the parameter in Lemma 4.3) and Yes otherwise.
It is easy to see that the distribution of instances created in the protocol matches the distribution for MaxCover exactly, and hence by Lemma 4.3, is a -error protocol for GHD in the distribution . The bound on the communication cost of is also immediate; in the following we bound the information cost of for sampled from .
Let be a random variable for and be the set of public randomness used by the players. By Claim 2.3,
[TABLE]
We now bound the first term in the RHS above (the second term can be bounded exactly the same). In the following, let and denote the vector of random variables for ’s and ’s, respectively.
[TABLE]
where the last equality is because conditioned on , all sets (for ) are chosen from and hence are independent of the event. We can further derive,
[TABLE]
where the second last equality is because (resp. ) and (resp. ) determine each other conditioned on and , and last equality is because the distribution of maximum coverage instances and the messages communicated by the players under and under exactly matches.
Moreover,
[TABLE]
By performing the same exact calculation for , we obtain that,
[TABLE]
where in the last inequality we used the fact that information cost of is at least .
We now have,
Theorem 5**.**
There exists a sufficiently small constant , such that for any , and ,
[TABLE]
Proof.
Suppose there exists a -error protocol for MaxCover on for a sufficiently small constant (to be determined later), with ; by Proposition 2.4, as well. Hence, by Lemma 4.5, we obtain a -error protocol for on with and . However, since as and , we can now apply Lemma 4.2 and argue that is (by taking smaller than the bounds in the Lemma 4.2); a contradiction with the information cost of obtained by Lemma 4.5.
We point out that to extend the results in Theorem 5 to case (i.e., the case not handled by Theorem 5), we can simply use an existing lower bound of [42] (Theorem 21) for this range of the parameter .
We can now prove Theorem 4 by using Theorem 5, the same exact way as we proved Theorem 1, i.e., by defining a random partitioning version of the distribution and proving the lower bound using that partitioning. We briefly sketch the proof here.
Proof Sketch of Theorem 4.
Define the distribution similar to the distribution with the difference that after creating the sets and , we randomly partition the sets between the players (i.e., assign each set to Alice w.p. and o.w. to Bob). The same exact argument in Lemma 3.7, combined with Theorem 5 (instead of Theorem 3 in Lemma 3.7) now proves that for some sufficiently small constant , .
Furthermore, any -pass -space streaming algorithm for maximum coverage on random arrival streams can be turned into an -bit communication protocol for MaxCover on (with the same error probability); see the proof of Theorem 1 for more details. This, together with the lower bound on the distribution implies that as desired.
Acknowledgements
I am grateful to my advisor Sanjeev Khanna for valuable discussions, and to Ehsan Emamjomeh-Zadeh and Sanjeev Khanna for carefully reading the paper and many helpful comments. I also thank the anonymous reviewers of PODS 2017 for many insightful comments and suggestions.
Appendix A Tools from Information Theory
Here, we briefly introduce some basic facts from information theory that are needed in this paper. We refer the interested reader to the textbook by Cover and Thomas [22] for an excellent introduction to this field.
We use the following basic properties of entropy and mutual information (proofs can be found in [22], Chapter 2).
Fact A.1**.**
Let , , and be three (possibly correlated) random variables.
. iff is uniformly distributed over its support. 2. 2.
. The equality holds iff and are independent. 3. 3.
Conditioning on a random variable reduces entropy*: . The equality holds iff . * 4. 4.
The chain rule for mutual information*: .*
We also use the following two simple facts, which assert conditions in which conditioning can provably increase (resp. decrease) the mutual information.
Fact A.2**.**
For random variables , if , then .
Proof.
Since and are independent conditioned on , by Fact A.1-(3), and . We have,
[TABLE]
Fact A.3**.**
For random variables , if , then, .
Proof.
Since , by Fact A.1-(3), . Moreover, since conditioning can only reduce the entropy (again by Fact A.1-(3)),
[TABLE]
Finally, we use the following simple inequality that states that conditioning on a random variable can only increase the mutual information by the entropy of the conditioned variable.
Fact A.4**.**
For any random variables and , .
Proof.
[TABLE]
where the first two equalities are by chain rule (Fact A.1-(4)), the second inequality is by definition of mutual information and its positivity (Fact A.1-(2)), and the last one is because conditioning can only reduce the entropy (Fact A.1-(3)).
Appendix B Proof of Lemma 4.2
Proof.
The proof consists of two separate parts. We first prove that there exists a pair , for which GHD is still “hard” under the distribution (i.e., when we fix the size of the sets and ), and in next part, use this fact to prove the bound for the distribution defined for the same pair of found in the first part (the proof of second part is basically the same as Lemma 3.5).
Claim B.1**.**
Let be a sufficiently small constant; there exists a pair of such that
[TABLE]
whereby .
Proof.
Let be as in Lemma 4.1. Suppose by contradiction that for all , , and let be the protocol achieving this bound for a specific choice of . We design the following protocol for GHD on the distribution : Given an input , Alice and Bob first communicate , to each other and then run on their input and output the same answer as in . Since each is computed on the same exact distribution as (corresponding to the same parameters and ), is a -error protocol for GHD on . We now bound the information cost of as follows (in the following, corresponds to the protocol , corresponds to the protocol , and (resp. ) is a random variable for size of (resp. ))
[TABLE]
where in the second last inequality we used assumption that for all and .
Consequently, we obtained a -error protocol for GHD on the distribution with information cost of , a contradiction with Lemma 4.1. This means that there should exists at least on pair such that is , proving the claim.
Now fix and as in Claim B.1 and define accordingly. Suppose by contradiction that is some . We can use the previous information odometer argument (i.e., Lemma 3.6) to create a protocol that solves GHD on the distribution and has information cost of , a contradiction with Claim B.1. We can simply create as follows: run the protocol and the information odometer in parallel; whenever the information cost of is larger than (for a sufficiently large constant ), terminate the protocol and output an arbitrary answer, otherwise output the same answer as . This ensures that . By definition of the distribution , and the fact that , in the cases that we terminate the protocol , the answer to GHD can be arbitrary w.p. and hence the new protocol is -error protocol for GHD on . This can be made formal exactly as in the proof of Lemma 3.6. The rest of the proof now follows from Lemma 3.6 exactly as in Lemma 3.5.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Alon, N., Matias, Y., and Szegedy, M. The space complexity of approximating the frequency moments. In STOC (1996), ACM, pp. 20–29.
- 2[2] Anagnostopoulos, A., Becchetti, L., Bordino, I., Leonardi, S., Mele, I., and Sankowski, P. Stochastic query covering for fast approximate document retrieval. ACM Trans. Inf. Syst. 33 , 3 (2015), 11:1–11:35.
- 3[3] Assadi, S., Khanna, S., and Li, Y. Tight bounds for single-pass streaming complexity of the set cover problem. In Proceedings of the 48th Annual ACM SIGACT Symposium on Theory of Computing, STOC 2016, Cambridge, MA, USA, June 18-21, 2016 (2016), pp. 698–711.
- 4[4] Ausiello, G., Boria, N., Giannakos, A., Lucarelli, G., and Paschos, V. T. Online maximum k-coverage. Discrete Applied Mathematics 160 , 13-14 (2012), 1901–1913.
- 5[5] Badanidiyuru, A., Mirzasoleiman, B., Karbasi, A., and Krause, A. Streaming submodular maximization: massive data summarization on the fly. In The 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’14, New York, NY, USA - August 24 - 27, 2014 (2014), pp. 671–680.
- 6[6] Bar-Yossef, Z., Jayram, T. S., Kumar, R., and Sivakumar, D. An information statistics approach to data stream and communication complexity. In 43rd Symposium on Foundations of Computer Science (FOCS 2002), 16-19 November 2002, Vancouver, BC, Canada, Proceedings (2002), pp. 209–218.
- 7[7] Bar-Yossef, Z., Jayram, T. S., Kumar, R., and Sivakumar, D. Information theory methods in communication complexity. In Proceedings of the 17th Annual IEEE Conference on Computational Complexity, Montréal, Québec, Canada, May 21-24, 2002 (2002), pp. 93–102.
- 8[8] Barak, B., Braverman, M., Chen, X., and Rao, A. How to compress interactive communication. In Proceedings of the 42nd ACM Symposium on Theory of Computing, STOC 2010, Cambridge, Massachusetts, USA, 5-8 June 2010 (2010), pp. 67–76.
