Constrained Submodular Maximization via Greedy Local Search
Kanthi K. Sarpatwar, Baruch Schieber, Hadas Shachnai

TL;DR
This paper introduces a simple greedy local search algorithm that achieves near-optimal approximation ratios for maximizing monotone submodular functions under complex constraints like knapsacks and multiple matroids.
Contribution
The paper presents a new combinatorial algorithm combining greedy and local search for submodular maximization with knapsack and matroid constraints, extending known approximation guarantees.
Findings
Achieves a $rac{1 - e^{-2}}{2}$-approximation for knapsack and matroid constraints.
Extends to a $rac{1 - e^{-(k+1)}}{k+1}$ ratio for multiple matroid intersections.
Demonstrates the effectiveness of combining greedy algorithms with local search in constrained submodular maximization.
Abstract
We present a simple combinatorial -approximation algorithm for maximizing a monotone submodular function subject to a knapsack and a matroid constraint. This classic problem is known to be hard to approximate within factor better than . We show that the algorithm can be extended to yield a ratio of for the problem with a single knapsack and the intersection of matroid constraints, for any fixed . Our algorithms, which combine the greedy algorithm of [Khuller, Moss and Naor, 1999] and [Sviridenko, 2004] with local search, show the power of this natural framework in submodular maximization with combined constraints.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
\Copyright
Kanthi K. Sarpatwar, Baruch Schieber, Hadas Shachnai
Constrained Submodular Maximization via Greedy Local Search
Kanthi K. Sarpatwar
IBM T. J. Watson Research Center, Yorktown Heights, New York, USA
{sarpatwa, sbar}@us.ibm.com
Baruch Schieber
IBM T. J. Watson Research Center, Yorktown Heights, New York, USA
{sarpatwa, sbar}@us.ibm.com
Hadas Shachnai
Computer Science Department, Technion, Haifa, Israel
Abstract.
We present a simple combinatorial -approximation algorithm for maximizing a monotone submodular function subject to a knapsack and a matroid constraint. This classic problem is known to be hard to approximate within factor better than . We show that the algorithm can be extended to yield a ratio of for the problem with a single knapsack and the intersection of matroid constraints, for any fixed .
Our algorithms, which combine the greedy algorithm of [Khuller, Moss and Naor, 1999] and [Sviridenko, 2004] with local search, show the power of this natural framework in submodular maximization with combined constraints.
Key words and phrases:
Submodular Maximization, Knapsack and Matroid Constraints, Greedy Algorithms, Local Search, Approximation Algorithms
1991 Mathematics Subject Classification:
F.2.2, G.1.6
1. Introduction
A set function is submodular if for every , . Such functions are ubiquitous in diverse fields, most notably in combinatorial optimization, operations research and economics. An equivalent definition of submodularity, which is perhaps more intuitive, refers to its diminishing returns: , for any , and . The concept of diminishing returns is widely used in economics, often leading to submodular utility functions. Submodular functions also provide a unifying framework which captures optimization problems that have been studied earlier, such as Min Cut, Max Cut, Multiway Cut, Maximum Coverage, the Generalized Assignment Problem, or the Separable Assignment Problem.
In these settings, the goal is to optimize a submodular function subject to certain constraints. In fact, in many cases the constraints at hand are quite simple, such as knapsack constraints, matroid constraints, or a combination of the two. Such submodular maximization problems naturally arise in advertising campaigns [1], combinatorial auctions [8, 33], social networks [22, 23], and document/corpus summarization [31, 25].
We consider first the classic problem of maximizing a monotone submodular function subject to a knapsack and a matroid constraint. Formally, let be a non-negative monotone submodular function of a ground set . Throughout the paper, we assume that is given via a value oracle; that is, given a set , the oracle returns . Let be a family of subsets of , and a matroid defined over and (see Section 2 for the formal definition).
The subsets in the collection are called independent sets. Suppose that each element has a nonnegative size , and let be a given size bound. Our goal is to maximize over all subsets , such that is an independent set of , and the total size of the elements in is bounded by , i.e.,
[TABLE]
We further consider a generalization of (1) in which we are given collections of independent sets, , each containing subsets of , and the corresponding matroids , . The goal is to maximize over all subsets of which are independent sets in . Formally, the optimization problem is
[TABLE]
There is a beautiful line of research in this area. For a long time, the only known work had been a sequence of papers by Fisher, Nemhauser and Wolsey [30, 16, 29], showing that a greedy algorithm achieves a ratio of to the optimum for maximizing a monotone submodular function under a cardinality constraint,111The special case of a knapsack constraint where for all . with a matching hardness of approximation result in the oracle model. The paper [16] shows that simple local search yields a ratio of when the function is maximized under a matroid constraint. A greedy algorithm was shown to achieve a ratio of for this problem with matroid constraints.
The hardness of approximation within ratio better than , already for Maximum Coverage (i.e., maximizing a linear function under cardinality constraint), follows from a result of Feige [9]. This has led to an ongoing research aiming to develop approximation algorithms which come close to this bound. The seminal paper of Khuller, Moss and Naor [24] achieved the ratio of for Budgeted Maximum Coverage, using a greedy algorithm. Their result inspired the later work of Sviridenko [32], presenting a simple greedy algorithm for maximizing a monotone submodular function subject to a knapsack constraint. For a single matroid constraint, there are several algorithms achieving the ratio via continuous greedy and multilinear relaxations [33, 4] as well as other advanced techniques [15, 2].
For all of these cases the best known approximation ratios are essentially the best possible, with many of the algorithms easy to employ in practical scenarios; yet, already for maximizing a monotone submodular function subject to a knapsack and a matroid constraint, attempting to approach the ratio within additive of , for any fixed , requires existing algorithms to perform steps [5] (see also [18]). The dependence on in the exponent renders these algorithms impractical even for . The next best ratio of , obtained via multilinear relaxations and contention resolution schemes [6], deviates from the known hardness of approximation bound of almost by factor of .222The bound of follows from the known inapproximability results for each constraint alone [9, 12]. A fast algorithm of [2] achieves the ratio . Similarly, for a single knapsack constraint and matroid constraints, where , the best known ratio is , due to [14] (see also [11] and Table 2). For fixed , the best ratio is derived in [2].333The algorithm of [2] achieves these bounds in almost linear time, for a class of more general instances. The best known hardness of approximation result is , due to [21], for the intersection of matroid constraints.
In this work, we amend the above state of affairs by obtaining practical algorithms whose approximation ratios are closer to the known hardness of approximation results. We note that some of the previous algorithms have better running times, while the results of [5, 18] lead to better bounds (albeit at high computational cost). Yet, as detailed below, a notable advantage of our algorithms is in being extremely simple.
While we focus in this paper on maximizing a monotone submodular function under a single knapsack and (single or multiple) matroid constraints, there are known results for extension to intersection of -system and knapsack constraints (e.g., [17, 19, 20, 2, 6]), and for maximizing a non-monotone submodular function [10, 28, 27, 20, 13, 14, 26, 6, 12]. For unconstrained submodular maximization, the best known result is a randomized linear time -approximation algorithm by Buchbinder et al. [3].
**Our Contribution: ** We present combinatorial algorithms for maximizing a monotone submodular function subject to a knapsack and matroid constraints, yielding approximation ratio of , for any fixed . The algorithms are based on the greedy approach of [24, 32] combined with local improvement steps which allow -swaps (see, e.g., [27]). These easy features make the algorithms highly intuitive in tackling our problems. For the single matroid case, we show that our algorithm can be implemented in time, where is the cardinality of the ground set.444The notation “hides” poly-logarithmic factors. We note that the greedy algorithm of Sviridenko [32], that yields the best possible ratio of for maximizing a submodular function subject to a knapsack constraint, has running time . For a single knapsack and matroid constraints, where is fixed, the running time of our algorithm is . Table 1 gives the known results for the special case where , along with the running times of the algorithms. Table 2 compares our approximation ratios with best previous results for several values of .
Our algorithms handle the combined knapsack and matroid constraints by applying greedy swaps. While the greedy property guarantees that we do not exceed the knapsack constraint without collecting enough value, the swaps maintain the independence of the solution set given the matroid constraint. Our algorithms for single and multiple matroid constraints (outlined in Alg. 1 and Alg. 2, respectively) proceed in the same fashion. The only difference is that in the multiple matroid case, we replace the ‘swaps’ with ‘-swaps’; a -swap may involve the deletion of up to elements from the solution set, while adding to the set a single element. To the best of our knowledge, this notion of greedy swaps is used here for the first time.
2. A Single Matroid Constraint
We first study the problem where we have a knapsack constraint and a single matroid constraint (as defined in Eq. (1)).
We start with some definitions and notation. For a subset of elements , let denote the total size of the elements in , i.e., .
For notational convenience, we consider as a dummy element not in the ground set . We assume that , and for any set , . Given a subset , where is independent, i.e., , a pair of elements , where and , is a swap if is independent, i.e., . Let denote the collection of swaps that involve elements from . The marginal profit density of a swap is given by .
2.1. The Algorithm
Let be an optimal solution of the given instance, ordered such that is the element with maximum marginal profit with respect to the prefix . We assume that , since in case consists of a single element, this element can be guessed to yield an optimal solution. The algorithm starts by guessing , the two elements with largest marginal profits in .
We set the initial solution to consist of the set . The algorithm then applies iterations of ‘greedy swaps’ that expand the solution while maintaining the knapsack and the matroid constraint.
We note that for ease of understanding the pseudocode of 1-MatKnap (in Algorithm 1) does not guarantee that the algorithm terminates in polynomial time. However, in Section 2.3 we discuss a modified version of our algorithm that runs in polynomial time with no harm to the approximation ratio.
2.2. Analysis
We now show that Algorithm 1-MatKnap yields an approximation ratio that is close to the optimal of known for submodular maximization with either a knapsack or a matroid constraint. Formally, our main result is the following.
Theorem 2.1**.**
1-MatKnap is a -approximation algorithm for submodular maximization subject to a knapsack and a matroid constraint.
The following simple lemmas will be useful in the proof of Theorem 2.1. Let be a collection of subsets of , and a matroid defined on . Recall that satisfies the following (see, e.g., [7]).
Non-emptiness: The empty set is in (thus, is not itself empty). 2.
Hereditary property: If a set is in then every subset of is also in . 3.
Exchange property: If and are two sets in , where , then there is an element such that is in .
We note that all maximal independent sets (or bases) of a matroid have the same cardinality. The first lemma follows directly from the above Exchange property.
Lemma 2.2**.**
Let be two bases of a matroid , then for any , there exists such that is a base of .
Lemma 2.3**.**
Given two independent sets , there exists a mapping , such that , for all , and , for all .
Proof 2.4**.**
Let be a base of , such that . By the Exchange property, elements from can be added to the set to obtain a base of . Denote this base by . For each element , define .
We now iterate over all elements in . Let be the current element. By Lemma 2.2, there exists , such that is a base of . Note that must be in , since . We set and , and iterate on the next element in . By our construction, no two elements in are mapped to the same element in .
The next lemma is due to Wolsey [34].
Lemma 2.5**.**
Given two positive integers, and , and a set of real-valued non-negative numbers , ,
[TABLE]
The following generalizes a result of Sviridenko [32]. We include the proof here for completeness.
Lemma 2.6**.**
Given an element , , and a subset , for any swap , such that ,
Proof 2.7**.**
The ordering of the elements of , and the fact that is submodular, monotone, and non-negative imply that for , where and the subsets and , the following inequalities are satisfied:
[TABLE]
and
[TABLE]
Summing the two inequalities, we have that
[TABLE]
Proof 2.8** (Proof of Theorem 2.1).**
*Suppose we have guessed and correctly. Let denote the subset output by the algorithm.
By Lemma 2.3, there exists a mapping , such that , for all , and , for all .*
Throughout the analysis, we refer to iterations of the outer loop (Line 4.) in Algorithm 1. Thus, each iteration, except maybe the last one, consists of a single greedy swap. We distinguish between two cases, based on the last iteration of the algorithm.
Case 1:* For all elements , the swap was rejected in the last iteration because it satisfied , and not because . That is, the swap did not violate the knapsack constraint.*
From Case 1 assumption, it follows that no swap , for an element , could increase the value of . Hence, we have that
[TABLE]
The first and second inequalities follow from submodularity, and the third from our assumption about the swaps. Recall that for every we have , thus the (multi) set does not contain any element from with multiplicity higher than 1. Let , where and the indices are assigned arbitrarily. Then, by (2.8), and using submodularity and monotonicity, we have
[TABLE]
Hence, in this case, we have that 1-MatKnap yields a -approximation to the optimum.
Case 2:* At least one swap , for , considered in the last iteration, violated the knapsack constraint; namely, .*
Denote by the subset of elements in the solution after iteration of the algorithm, for , where . Let be the mapping guaranteed by Lemma 2.3. We note that in all iterations except the last the algorithm performs a swap (where may be ). Denote by .
Let be the first iteration in which a swap , for , was considered in iteration and was rejected since it violated the knapsack constraint. Clearly, such an iteration exists by the assumption of Case 2. By the choice of , for all , for every such that is a swap, we must have .
Define the function . Since is submodular and monotone, so is . Hence, for all ,
[TABLE]
Using submodularity, we can now write:
[TABLE]
Claim 1**.**
.
Proof 2.9**.**
Clearly, for any such that , we have . Thus, we need only to consider the sum over the subset , such that for any , . As noted above, the (multi) set does not contain any element from with multiplicity higher than 1. Let , where , and the indices are assigned arbitrarily. Then, similar to Inequality (2.8),
[TABLE]
Using Claim 1, we have for any ,
[TABLE]
The last inequality is due to the following:
- (a)
By our choice of , for any swap , . 2. (b)
.
Now, from the above discussion, we have for any ,
[TABLE]
For , let , and . Note that . Since the algorithm performs swaps, we have that . Also, let , and We note that, by the definition of , we have that . For , let when . Using the above notation, we have that
[TABLE]
and for ,
[TABLE]
Now, we note that
[TABLE]
Using Lemma 2.5 and Inequality (7), we have
[TABLE]
Finally, using Lemma 2.6,
[TABLE]
2.3. Running Time Analysis
We note that the running time of Algorithm 1-MatKnap, as described above, may not be polynomial, since the number of greedy swaps cannot be bounded polynomially. Below, we show how the algorithm can be implemented in polynomial-time with no harm to the approximation ratio.
Theorem 2.10**.**
Algorithm 1-MatKnap can be modified to run in time.
Proof 2.11**.**
To guarantee polynomial running time, we modify the greedy swaps as follows. Fix some (to be set later). Whenever an improvement is encountered, a swap is performed only if the value of increases by at least a factor of 1+ as a result of the swap. (We note that this modification applies only to “real” swaps, and not to swaps with the dummy element .) Let , then by Lemma 2.6,
[TABLE]
Hence, the overall number of swaps is bounded by . We show next that we can find a constant that would not degrade the approximation ratio of the algorithm. Consider the two cases in the proof of Theorem 2.1. In Case 1, after the modification Inequality (2.8) becomes
[TABLE]
Thus, we get -approximation. Since the approximation factor in Case 2 is , the overall approximation remains unchanged, as long as , which implies .
Now, consider Case 2 of Theorem 2.1. After the modification, Inequality (3.5) becomes
[TABLE]
Note that for any , if then , otherwise, i.e. , either or . The set contains all the elements for which the first inequality is satisfied. Inequality (2.11) implies modified Inequality (2.8):
[TABLE]
Proceeding to propagate this modification, we finally get
[TABLE]
It follows that the approximation ratio remains , as long as . By (10), we have ; thus, this holds for .
We now turn to analyze the running time. We have guesses of the set . For each such guess, we have overall successful swaps. (Note that there are swaps in which is involved.) We have overall swaps, each requires time. We conclude that the overall running time is .
We remark that a factor of can be saved in the running time at the expense of reducing the approximation ratio by . Specifically, we can perform the swaps only if the multiplicative improvement is 1+.
3. Matroid Constraints
In this section we extend our algorithm for a single matroid constraint to handle a single knapsack constraint and matroid constraints, for any fixed ; namely, we give an approximation algorithm for problem (2).
3.1. Algorithm
A key operation in our algorithm for matroid constraints is -swap, in which we add to the solution set a single element and eliminate up to elements. Denote the collection of subsets of of size at most by ; note that . Given a subset , and , is a k-swap if is independent, i.e., .
Let denote the collection of -swaps. The marginal profit density of a -swap is given by .
**Overview: ** Similar to 1-MatKnap, Algorithm -MatKnap modifies the solution set iteratively, while increasing the objective function value and maintaining the knapsack and matroid constraints. The main change is in replacing swaps with -swaps. A pseudocode of -MatKnap is given in Algorithm 2.
3.2. Analysis
We first analyze the performance ratio of -MatKnap.
Theorem 3.1**.**
For any fixed , -MatKnap is a -approximation algorithm for problem (2).
We use in the proof the next lemmas.
Lemma 3.2**.**
Given two independent sets , there exists a mapping of the elements in to (namely, subsets of of size at most , including the empty set), such that each element appears in at most subsets.
Proof 3.3**.**
For a matroid , , since are independent sets, by Lemma 2.3 there exists a mapping , such that , for all . We use the mappings to define the mapping as follows. For each element , define . The -swaps are defined accordingly. By Lemma 2.3, for each element , and for , . It follows that each element appears in at most mapped subsets.
The proof of the next lemma is similar to the proof of Lemma 2.6 (details omitted).
Lemma 3.4**.**
Given an element , , and a subset , for any -swap , where ,
Proof 3.5** (Proof of Theorem 3.1).**
*Suppose we have guessed and
correctly. Let denote the subset output by the algorithm. By Lemma 3.2, there exists a mapping , such that , for all .*
*We distinguish between two cases, based on the last iteration of the algorithm. *
Case 1:* For all elements the -swap was rejected because it satisfied (and not because . That is, the swap did not violate the knapsack constraint.*
*By Case 1 assumption, it follows that no -swap , for an element , could increase the value of . Hence, similar to the single matroid case, we have *
[TABLE]
In the next claim, we show that . Thus, in this case, we have that -MatKnap yields a -approximation to the optimum.
Claim 2**.**
Let be the mapping defined in Lemma 3.2. Then .
Proof 3.6**.**
Let , where the indices are assigned arbitrarily. Fix an element , for which . Let , where , and , and let , for , and . We have
[TABLE]
The last inequality follows from submodularity, since for ,
[TABLE]
As each element in appears in at most subsets , for , summing Inequality (3.6) over all elements in we get
[TABLE]
Case 2:* At least one -swap , for , considered in the last iteration, violated the knapsack constraint; namely, satisfied .*
As in the single matroid case, denote by the subset of elements in the solution after iteration of the algorithm, for , where . Let be the mapping guaranteed by Lemma 3.2. Recall that in all iterations except the last, the algorithm performs a swap , where . Denote by .
Let be the first iteration in which a swap , for , was considered and rejected, since it violated the knapsack constraint. By the choice of , for all , for every such that is a swap, we must have .
*As in the single matroid case, define the function .
Using submodularity, we can now write:*
[TABLE]
Following the proofs of claims 1 and 2, we have . Thus, for any ,
[TABLE]
It follows that
[TABLE]
A derivation similar to the one for the single matroid case, combined with Lemma 2.5 and (17), yields
[TABLE]
Finally, using Lemma 3.4,
[TABLE]
Using analogous techniques, as in the case of a single matroid constraint, we can modify the algorithm to obtain the following theorem.
Theorem 3.7**.**
Algorithm -MatKnap can be modified to run in time.
Proof 3.8** (Proof (sketch)).**
Fix some (to be set later), and perform a -swap only if the value of increases at least by a factor of 1+ as a result of the swap. As before, the overall number of swaps is bounded by , and it can be shown that for , the approximation ratio remains the same.
We now turn to analyze the running time. We have guesses of the set . For each such guess, we have overall successful -swaps (Note that there are swaps in which is involved.) This implies, as before, an overall of -swaps. Each -swap requires time. We conclude that the overall running time is .
**Acknowledgments: ** We thank Chandra Chekuri and Moran Feldman for helpful comments on an earlier version of the paper.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Ron Adany, Moran Feldman, Elad Haramaty, Rohit Khandekar, Baruch Schieber, Roy Schwartz, Hadas Shachnai, and Tami Tamir. All-or-nothing generalized assignment with application to scheduling advertising campaigns. ACM Trans. Algorithms , 12(3):38:1–38:25, 2016.
- 2[2] Ashwinkumar Badanidiyuru and Jan Vondrák. Fast algorithms for maximizing submodular functions. In Proceedings of the 25th Annual ACM-SIAM Symposium on Discrete Algorithms , pages 1497–1514, 2014.
- 3[3] Niv Buchbinder, Moran Feldman, Joseph Naor, and Roy Schwartz. A tight linear time (1/2)-approximation for unconstrained submodular maximization. SIAM J. Comput. , 44(5):1384–1402, 2015.
- 4[4] Gruia Calinescu, Chandra Chekuri, Martin Pál, and Jan Vondrák. Maximizing a monotone submodular function subject to a matroid constraint. SIAM Journal on Computing , 40(6):1740–1766, 2011.
- 5[5] Chandra Chekuri, Jan Vondrák, and Rico Zenklusen. Dependent randomized rounding via exchange properties of combinatorial structures. In Proceedings of the 51st Annual IEEE Symposium on Foundations of Computer Science , pages 575–584, 2010.
- 6[6] Chandra Chekuri, Jan Vondrák, and Rico Zenklusen. Submodular function maximization via the multilinear relaxation and contention resolution schemes. SIAM J. Comput. , 43(6):1831–1879, 2014.
- 7[7] Thomas H. Cormen, Charles Eric Leiserson, Ronald L Rivest, and Clifford Stein. Introduction to algorithms . MIT press Cambridge, 2001.
- 8[8] Shahar Dobzinski and Michael Schapira. An improved approximation algorithm for combinatorial auctions with submodular bidders. In Proceedings of the 17th Annual ACM-SIAM Symposium on Discrete Algorithms , pages 1064–1073, 2006.
