Partial Function Extension with Applications to Learning and Property Testing
Umang Bhaskar, Gunjan Kumar

TL;DR
This paper investigates the problem of extending partial functions to satisfy properties like subadditivity, submodularity, and convexity, providing complexity bounds, algorithms, and applications in learning and property testing.
Contribution
It offers new complexity results, algorithms, and testing methods for extending partial functions with key properties, advancing understanding in optimization and learning.
Findings
Extension for subadditive functions is coNP-complete with tight approximability bounds.
Algorithms for submodular extension are effective in specific cases, but the general complexity remains open.
Determining the existence of convex function extensions is efficient, but computing their values is NP-hard.
Abstract
In partial function extension, we are given a partial function consisting of points from a domain and a function value at each point. Our objective is to determine if this partial function can be extended to a function defined on the domain, that additionally satisfies a given property, such as convexity. This basic problem underlies research questions in many areas, such as learning, property testing, and game theory. We formally study the problem of extending partial functions to satisfy fundamental properties in combinatorial optimization, focusing on upper and lower bounds for extension and applications to learning and property testing. (1) For subadditive functions, we show the extension problem is coNP-complete, and we give tight bounds on the approximability. We also give an improved lower bound for learning subadditive functions, and give the first nontrivial testers for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
\newfloatcommand
capbtabboxtable[][\FBwidth]
Partial Function Extension with Applications to Learning and Property Testing
Umang Bhaskar Work supported in part by a Ramanujan fellowship. Email: [email protected] Tata Institute of Fundamental Research, Mumbai
Gunjan Kumar Email: [email protected] Tata Institute of Fundamental Research, Mumbai
Abstract
In partial function extension, we are given a partial function consisting of points from a domain and a function value at each point. Our objective is to determine if this partial function can be extended to a function defined on the domain, that additionally satisfies a given property, such as convexity. This basic problem underlies research questions in many areas, such as learning, property testing, and game theory. We formally study the problem of extending partial functions to satisfy fundamental properties in combinatorial optimization, focusing on upper and lower bounds for extension and applications to learning and property testing.
- •
For subadditive functions, we show the extension problem is coNP-complete, and we give tight bounds on the approximability. We also give an improved lower bound of for learning subadditive functions. Previously, Balcan et al. (2012) gave a lower bound of for this problem. We also give the first nontrivial testers for subadditive and XOS functions.
- •
For submodular functions, we show that if a partial function can be extended to a submodular function on the lattice closure111The lattice closure of set of points is the minimal set that contains and is closed under union and intersection. of the partial function, it can be extended to a submodular function on the entire domain. We obtain algorithms for determining extendibility in a number of cases, including if is a constant, or the points are nearly the same size. The result uses a combinatorial certificate for non-extendibility which we call a square certificate. Seshadhri and Vondrak (2014) previously give a characterization in terms of path certificates. The complexity of extendibility is in general unresolved.
- •
Lastly, for convex functions in , we show an interesting juxtaposition: while we can determine the existence of an extension efficiently, computing the value of a widely-studied convex extension at a given point is strongly NP-hard.
1 Introduction
A partial function consists of a set of points from a domain, and a real value at each of the points. Given a property , the partial function extension problem is to determine if there exists a total function ( is defined on the entire domain) that extends the partial function ( equals the given value at each point in ) and satisfies . E.g., property could be linearity, and we are required to determine if there exists a linear function that extends the given partial function. In this paper, we study partial function extension when is finite, to fundamental properties in combinatorial optimization.
The problem of partial function extension underlies research and techniques in a number of different areas, and is hence intensely studied. We mention three such areas. Firstly, in property testing, a function is given by an oracle, and the problem is to determine with high probability by querying the oracle whether the function satisfies a required property, or is far from it. The focus in property testing is on algorithms with optimal query-complexity. Typically a testing algorithm cleverly queries some sets and rejects if the values at the queried sets cannot be extended to a function with the required property. Clearly characterizing when a partial function can be extended plays an important role here. The problem of partial function extension, and its connection to property testing, is also explicitly raised by Seshadhri and Vondrak [31].
Secondly, in learning theory, the goal is to understand if a family of functions can be learned by random samples. That is, does there exist an efficient algorithm that for any target function in the family takes as input the function values at a set of sampled points, and returns a function that is “close” to the target function? Here, partial function extension can be used to give lower bounds on the learnability of various function classes (and has been used thus in previous papers, e.g., Balcan, Constantin, Iwata and Wang [3]).
Thirdly, in economics, given data from experiments regarding agent behaviour (such as the purchases made by an agent, or the bids of a (truthful) bidder in auctions), revealed preference theory studies whether the data is rationalizable by utility functions with a particular property. That is, whether there exists a utility function with a particular property that is consistent with the observed data. A natural assumption for utility functions is “diminishing marginal returns”, which translates to submodularity for indivisible goods, and concavity for divisible goods [29]. Other assumptions on utility functions are also common, e.g., subadditivity, XOS, etc [3, 11]. The problem of deciding rationalizability by utility functions with these properties is exactly the problem we study.
In each of the above areas, a basic step towards a solution is often to determine if a given partial function can be extended to a total function. In this paper, rather than a means to an end, we study the problem of partial function extension itself. We focus on the complexity of deciding if a partial function can be extended to functions satisfying fundamental properties — subadditivity, XOS, submodularity, and convexity. These represent perhaps the most commonly studied classes of functions in all of combinatorial optimization. In obtaining our results on partial function extension, we show that the structural lemmas can be used to obtain several results for property testing and learning, thus validating a direct study of partial function extension.
Formally, a partial function is a set of duples , with in the domain or , and the observed function value at . Additionally, we are given a property . The -Extension problem is to determine if there exists a total function defined on the domain or that satisfies property and extends the given partial function , i.e., for all . We also consider the Approximate -Extension problem, where we want to determine the minimum multiplicative error for a given partial function to extend to a function that satisfies the given property. That is, in Approximate -Extension, we want to find the minimum such that a function satisfies property and additionally, for all . If Approximate -Extension is computationally hard, we are interested in approximating .
Note that in our case, our input is . An algorithm is efficient if it runs in time polynomial in the size of , which may be exponential in the dimension .
We note a basic difference between partial function extension on the one hand, and property testing and learning on the other. In partial function extension, there is no target function ; we are interested in determining if any total function that extends the given partial function has the required property. In property testing and learning, there exists a target function which we access via an oracle (in property testing) or via samples from a distribution (in learning).
A function is subadditive if for all sets and . A function is an XOS function if it can be expressed as the maximum of linear functions for some . XOS functions are a subclass of subadditive functions. A function is submodular if for all .
For notation, is the set of points in the given partial function . These are called defined points, and are undefined points. Points on the hypercube are naturally subsets of , and for , is its characteristic vector. We frequently use this correspondence. All missing proofs are in the appendices.
Our Contribution.
We show the following main results.
Result 1**.**
Subadditive Extension is coNP-complete. There is an approximation algorithm for Approximate Subadditive Extension, and if , this is tight.
The lower bounds in the theorem depend upon characterizations of partial functions that can be extended to subadditive functions. The upper bound uses the fact that for XOS functions, a well-studied subclass of subadditive functions, Approximate Extension (and hence Extension) can be solved in polynomial time. Further, any subadditive function can be approximated by an XOS function, by a factor of [11, 20].
Our characterization for subadditive functions, as well as known characterizations for XOS functions, can be used to give the following results for learning and property testing.
Result 2**.**
Subadditive functions cannot be learned by a factor of .
This improves upon a previous lower bound of [3]. We combine the characterization of subadditive functions with recent results on the size of combinatorial families of sets called -cover free families [23, 22] for the above result.
Result 3**.**
Given , there are testers for subadditive and XOS functions that make queries. Further, there is a tester for nonmonotone subadditive functions that makes queries.
We thus obtain the first nontrivial testers for subadditive and XOS functions.
For submodular functions, we show the following main result. Given , we say a function is submodular in if for all .
Result 4**.**
For partial function with defined points , let be the family of sets that are (i) both contained in and contained by some set in , and (ii) obtained by the union and intersection of sets in (are in the lattice closure of ). Then the partial function is extendible to a submodular function in iff it can be extended to a submodular function in .
Thus if is then Submodular Extension can be solved in polynomial time. This includes the case when is a constant, when all points in have size difference , and when is an antichain. Further, if is an antichain, then and any assignment of values to the points in is trivially submodular in . Hence if is an antichain, it can always be extended to a submodular function. Our results for submodular functions depend on a combinatorial characterization of nonextendibility, which we call a square certificate. Our proof of the above result and development of square certificates forms our main technical contribution. Seshadhri and Vondrak [31] also study property testing of submodular functions, and develop an alternative certificate called a path certificate. We believe that square certificates are more natural, and may lead to improved testers for submodular functions. In general the problem of Submodular Extension remains open. However, we can use our result for the extendibility of antichains to obtain the following result for learning submodular functions
Result 5**.**
Submodular functions cannot be learned.222In contrast, the class of nonnegative, monotone submodular functions can be learned with approximation ratio [6].
Lastly, we consider convex functions. The problem of Convex Extension has been studied before in convex analysis (e.g., [21, 33]). We however show an interesting juxtaposition of results: While it can be determined in polynomial time if a partial function is extendible to a convex function, determining the value of a natural and widely-studied extension at a given point is NP-hard.
Result 6**.**
Approximate Convex Extension (and hence Convex Extension) is in . However, determining the value of a canonical extension at a given point is strongly NP-hard.
2 Subadditive and XOS Functions
We now consider the problem of extending a given partial function to monotone subadditive and XOS functions. A function is subadditive if for all sets and , and monotone if for all . A function is an XOS function if it can be expressed as the maximum of linear functions for some , i.e., there exist vectors for such that for every . XOS functions are a subclass of subadditive functions and are equivalent to fractionally subadditive functions [24]. A function is fractionally subadditive if for all such that and for each . Subadditive functions capture the important case of complement-free functions, for which no two subsets of the ground set “complement” each other. This is a natural assumption in many applications, and hence these functions and various subclasses, including XOS functions, are widely used in game theory [3, 11, 29].
For XOS functions, the following positive result follows from writing a linear program with the vectors with as variables. While in general may be exponential, a partial function is extendible iff the linear program is feasible for .
Theorem 1**.**
Aproximate XOS Extension (and hence XOS Extension) can be solved efficiently.
We give the following characterization for subadditive functions, also implicit in Lemma 3.3 of [2].
Lemma 1** ([2]).**
Partial function is extendible to a subadditive function iff for all such that .
Proof.
For the first direction, if there exist such that and then either or , and hence either monotonicity or subadditivity is violated. For the other direction, first assume that . Then the function can be seen to be monotone subadditive extension. If then the function is a monotone subadditive extension where for all , and otherwise where . ∎
We immediately obtain the following result.
Corollary 1**.**
Subadditive Extension is in coNP, and be solved in time.
We use the characterization to show that is a tight bound on the approximability of Approximate Subadditive Extension, unless (and that the Extension problem is coNP-complete). For the lower bound, we give a reduction from Set-Cover. For the upper bound, we use earlier results which show that any subadditive function can be -approximated by an XOS function [11, 20]. Since Approximate XOS Extension can be efficiently solved (Theorem 1), this gives us our upper bound.
Theorem 2**.**
Subadditive Extension is coNP-complete. There is an approximation algorithm for Approximate Subadditive Extension, and if , this is optimal.
Proof.
Recall the Set-Cover problem. An instance of Set-Cover is a universe , family of sets such that and an integer . We need to determine if there exists a cover of universe of size at most .
First we prove that the Subadditive Extension is CoNP-hard by reduction from Set-Cover. Construct a partial function that is defined on each set in and . The value at each set is , and the value at is . If this partial function can be extended then every cover of must have size at least . On the other hand, if the partial function can not be extended then there must exist a cover of size at most . Both of the above facts easily follow from Lemma 1.
The lower bound of for Approximate Extension, as before, the partial function is defined on sets , and value at each set in is , and at is . Suppose we have a approximation algorithm for Approximate Extension, which for this instance returns value . Then note that , where is the optimal value of for the Approximate Extension.
Since the algorithm returns value , so there exists an extension such that for all and . Therefore, by Lemma 1, every cover of has size at least . Now we claim that there must exist a cover with size at most . If not, then all covers of have size at least , and it is easy to see that the partial function is extendible by Lemma 1. This implies which is a contradiction. This then gives an -approximation algorithm for Set Cover, and since Set Cover cannot be approximated by a factor better than [19], this is true of Approximate Extension also.
The upper bound for Approximate Extension is shown in Appendix A ∎
A lower bound on learning subadditive functions.
We now show that subadditive functions cannot be learned by a factor of in the PMAC model of learning. The PMAC (Probably Mostly Approximate Correct) model seeks to determine for a family of functions, if it is possible to efficiently obtain a function “close to” a target function , given samples from some distribution over and the value of at the sampled points. Formally, let be a family of set functions (e.g., subadditive functions).
Definition 1** ([7]).**
An Algorithm PMAC-learns a family of functions with approximation factor , if for any distribution (on ) and any target function , and for any sufficiently small :
- •
* takes the sequence as input where is and the sequence is drawn i.i.d. from the distribution ,*
- •
* runs in time,*
- •
* returns a function such that*
[TABLE]
That is, with at least probability (over examples drawn from ), the value of the returned function should be within an factor of the target function for at least fraction of the probability mass according to . The following lemma makes explicit the connection between PMAC-learning and extending partial functions (we use it later in showing lower bounds for learning submodular functions as well). The lemma has been implicitly used earlier to obtain lower bounds on learning subadditive and submodular functions [4, 7].
Lemma 2**.**
Suppose there exists a family of subsets of such that is superpolynomial in , and the partial function is extendible to a function in for any values of (where ), . Then the family of functions cannot be learned by any factor .
The above bound holds even if the algorithm knows the distribution , is allowed unbounded computation and chooses samples adaptively.
Balcan et. al. [4] proved an upper bound of and an lower bound for learning subadditive functions. Using Lemmas 1, 2 and a known result for cover-free families [23, 22], we show an improved lower bound of .
A family of sets is called an -cover free family [23, 22] if for all distinct sets we have . Let be the cardinality of the largest -cover free family.
Theorem 3** ([22]).**
.
Lemma 3**.**
If is an -cover free family then the partial function is extendible to a subadditive function for any value of .
Proof.
Suppose the partial function is not extendible. Therefore, by Lemma 1, there exists sets for some such that and . Therefore, we have which is a contradiction as is an -cover free family. ∎
Theorem 4**.**
In the PMAC model, subadditive functions cannot be learned by any factor.
Proof.
We have for some constant . For , is clearly superpolynomial in . Also, for , . Hence, is superpolynomial for . Therefore, for any such , by Theorem 3 there exists a -cover free family such that is superpolynomial. The theorem is directly implied by Lemmas 2 and 3. ∎
Testers for subadditive and XOS functions.
We now describe testers for subadditive and XOS functions that make queries; these are the first non-trivial testers for either of these functions. For definitions, we focus on subadditive functions, but the corresponding definitions for XOS functions are obvious. A function is -far from subadditive if for any subadditive function , we have . A tester for subadditive functions is a randomized algorithm that takes distance parameter and oracle access to a function as inputs, and accepts if is subadditive, and rejects with constant probability if is -far from subadditive.
We describe the testers here. Let , and define . The tester repeats the following steps times.
- •
Randomly pick a set and query the sets .
- •
(Subadditive tester) Reject if EITHER there exists such that OR there exist for some such that and .
- •
(XOS tester) Reject if EITHER there exists such that OR there exist and for some such that for all elements , and .
It is clear that tester makes queries and , which is also a bound on the number of queries by the tester.
Recall that XOS functions are fractionally subadditive functions, i.e., a function is XOS iff for all and such that for each , . Clearly if the function is subadditive or XOS then the tester accepts. Now we need to show if the function is -far from subadditive or XOS then tester rejects with high probability. Before that we show the following characterization for extension of a partial function to an XOS function.
Lemma 4**.**
Partial function is extendible to a XOS function iff for all and all for all such that for each , we have .
Proof.
Obviously if there is a XOS extension then the above property must hold as XOS functions are also fractionally subadditive. For the other direction, consider . We claim that is a XOS extension of the partial function. Let . By the assumption, we have . Also, if we set and for rest of then we get . Therefore, is an extension. Now suppose for some and , we have . We will show that which will complete the proof. Let for all . Here, are optimal values as in definition of and we have . Therefore, we need to show that . Note that . Therefore, by definition of , we have . ∎
A set is called bad if it causes the tester to reject. For subadditive functions, the set of bad sets consists of such that either there exists such that or there exists for some such that and . Similarly, for XOS functions, is in if there exists such that or there exists, for some , such that and such that for all elements , and .
We show that removing all sets not in , as well as the bad sets, gives us a partial function that can be extended to subadditive (or XOS). Since the function is -far and is large by our choice of , therefore there must be many bad sets.
Lemma 5**.**
The partial function is extendible to a subadditive (XOS) function.
Proof.
Suppose the partial function is not extendible, and let be the defined points in . Then for subadditive functions, by Lemma 1, there exist such that and . Then either or for some , . Note that since , so is . Further, . Thus, in the first case, is in while in the second case, , giving a contradiction.
For XOS functions, by Lemma 4 there exists and for some , such that for each and . Like subadditive functions, either or for some , . Again, in the first case, is in , while in the second case . ∎
Theorem 5**.**
If is -far from subadditive (XOS) functions then the above tester rejects with constant probability.
Proof.
Let and . Since the partial function is extendible so (since is -far). Note that . By Chernoff bound, where is a binomial random variable . By the choice of , . Hence we have . Therefore in a single iteration our tester will pick a bad set with probability at least . Hence after iterations, the tester will pick a bad set with constant probability.
∎
A subexponential tester for nonmonotone subadditive functions.
We now describe a property testing algorithm for general (nonmonotone) subadditive functions that makes queries; in this subsection, subadditive refers to nonmonotone subadditive functions.
Let , and define . The tester repeats the following steps times:
- •
Randomly pick a set and query the sets .
- •
If there exists for some such that and then reject.
The tester makes queries, where , and hence , which is also a bound on the number of queries by the tester.
Obviously if the function is subadditive then the tester accepts. Now we will show if the function is -far from subadditive then tester rejects with high probability.
We first give the characterization for partial function extension similar to claim 1 for general subadditive functions.
Lemma 6**.**
The partial function is extendible to a subadditive function (not necessarily monotone) iff for all such that .
A set is called bad if it causes the tester to reject. The set of bad sets consists of such that there exists for some such that and .
We show that removing all sets not in , as well as the bad sets, gives us a partial function that can be extended to subadditive function. Since the function is -far and is large by our choice of , therefore there must be many bad sets.
Lemma 7**.**
The partial function is extendible to a subadditive function.
Proof.
Suppose the partial function is not extendible. Let be the defined sets in . Then by Lemma 6, there will exist such that and . This implies which is a contradiction. ∎
Theorem 6**.**
If is -far from subadditive functions then the above tester rejects with constant probability.
Proof.
Let and . Since the partial function is extendible, (since is -far). Note that . Hence again using Chernoff bound and the value of , we have . Therefore in a single iteration our tester will pick a bad set with probability. Hence after iterations, the tester will pick a bad set with constant probability. ∎
3 Submodular functions
Submodular functions are perhaps the the most important functions in combinatorial optimization. They capture diminishing marginal returns for set functions, which is satisfied in many practical applications as well as theoretical constructs, and can be viewed as the discrete analog of concave functions. Coverage functions, matroid rank functions, and many others are special cases of submodular functions. A function is submodular if for all . As before, is the set of points in the given partial functions , and . These are called defined and undefined points respectively. The lattice closure is the minimal set that contains and is closed under union and intersection. For a family of set , we say a function is submodular on if for all sets , so that , are also in , .
Seshadri and Vondrak [31] study property testing of submodular functions and give the first subexponential time non-adaptive tester for submodularity. They also introduce the problem of Submodular Extension — the problem we study — to a submodular function, and note its usefulness in analyzing property testing algorithms. They give a partial function that is defined on an exponential number of points and is not extendible to a submodular function. However, if any point is removed from , then it can be extended to a submodular function, indicating the difficulty in designing a tester for submodularity.
Seshadri and Vondrak introduce and use a combinatorial structure called a path certificate, the existence of which certifies that a given partial function cannot be extended to a submodular function. Our results are instead based on a structure called a square certificate. A square certificate is multiset of tuples with , with additional constraints on the multiset. Since the union and intersection of sets can be seen as bitwise AND and OR of the characteristic vectors of the sets, our analysis proceeds by viewing a square certificate as special monotone Boolean circuit. Since submodular functions are defined using union and intersection, we believe that square certificates are more natural than path certificates. It is easy to show that a square certificate exists iff the given partial function is not extendible. Our main technical result is that if a partial function is not extendible, then there exists a square certificate where the sets in every square are in the lattice closure of . We use this to obtain a number of results on the extendibility of a partial function.
Theorem 7**.**
If the sets in form an antichain,333A family of sets is an antichain if no set in the family is contained in another set. then the partial function can always be extended to a submodular function. Hence, submodular functions cannot be PMAC-learned. 2. 2.
Let be the sets obtained by the union and intersection of sets in , that are also both contained in and contained by sets in . If the partial function can be extended to a submodular function on , then it can be extended to a submodular function on . Hence, Submodular Approximate Extension (and thus Extension) can be solved in time.
Let be the maximum difference in the size of two sets . Then an upper bound on the size of is . Thus if all sets are roughly of the same size, Approximate Extension can be solved in polynomial time. Further, if is a constant then is a constant and hence Approximate Extension can be solved in polynomial time. For the proof of the theorem, the second result is in particular very non-trivial and require insights into the structure of square certificates. We start by formally defining square certificates.
Square Certificates. A square tuple is a triple where . The sets and are called middle points, is the top point and is the bottom point of this square tuple. Sets are said to be part of this square tuple. Given a multiset of square tuples, for a set , define to be the number of square tuples with as a middle point, and to be the number of square tuples with as a top or bottom point. We say a set is an input set if , an intermediate set if , and an output set if . A square certificate is a multiset of square tuples with the following properties:
- (P1)
If is an input or an output set, i.e., then must be in . 2. (P2)
.
A set is involved if it is a part of some square triple in the square certificate. By definition, a partial function can be extended iff the following linear program (with variables for all ) is feasible:
[TABLE]
A square certificate can be seen as a dual solution obtained from Farkas’ lemma.
Lemma 8**.**
A partial function is extendible iff there does not exist a square certificate.
Proof.
A partial function can be extended iff the following linear program (with variables for all ) is feasible:
[TABLE]
Using Farkas’ lemma, the following linear program with variables for all must then be infeasible:
[TABLE]
We will show that there exists a square certificate iff this dual linear program is feasible. Firstly, note that if the dual is feasible then it has a rational solution, and any rational solution to the dual can be converted into an integral solution by multiplying by the product of the denominators. Now given a square certificate, we identify each variable in the dual with the square tuple , and set if this square tuple appears times in the square certificate. Then for a set , , and . The first constraint then says that if is not defined, then must appear an equal number of times as a middle point, and as a top or bottom point. That is, must be an intermediate set. The second constraint in the linear program corresponds exactly to property (P2). Thus, a square certificate gives us a feasible dual solution. The converse can be proved with the same construction: given an integral dual solution, we create a square certificate by including a square tuple times. The first and second constraints in the linear program give us properties (P1) and (P2) of the square certificate exactly. ∎
Lemma 9**.**
If there is a square certificate for a partial function, then for any extension to the hypercube, there is a square tuple in the square certificate with .
Our main technical result is the following lemma.
Lemma 10**.**
If there is a square certificate for a partial function, then there is a square certificate where, if is an involved set, then (i) there exist , so that , and (ii) .
We will develop the proof of the second part of the lemma in the remainder of the paper. The first part, however, is easily seen. Let be an involved set. Then either by property (P1) , in which case , or is undefined and hence is an intermediate set. In the latter case, , and hence is a middle set in some square . If is defined, then . Otherwise, is an intermediate set, and we can continue in this way until we find a defined set that contains . Similarly, considering the bottom point of the square, we can find a defined set that is contained by .
Proof of Theorem 7 (assuming Lemma 10)..
For the first part of the theorem, if the partial function cannot be extended, there must exist a square certificate. In particular, from (P2), there must exist with . Thus, must be part of at least one square tuple that is not trivial, i.e., (otherwise all four sets are just equal to , and this square contributes to and equally). Then , and either , or (or both). In the former case, by Lemma 10, there must exist with , or in the latter case, there must exist with . However, since is an antichain, no such sets , can exist.
To prove that submodular functions cannot be learned, consider the family of sets of size , and note that . Since any partial function with sets in and arbitrary values for the sets is extendible to a submodular function, submodular functions can not be learned by any factor (by Lemma 2).
For the second part of the theorem, let be a submodular extension of the partial function to . Then for any points so that are also in , . Suppose for a contradiction that the partial function cannot be extended to a submodular function on the entire domain. Then by Lemma 8 and Lemma 10, there is a square certificate where all involved sets are in . But then we have a square certificate and an extension for which for all square tuples in the square certificate. This gives us a contradiction, since by Lemma 9 the two cannot coexist. Thus, it is necessary and sufficient to find a submodular extension to points in . Therefore, Approximate Extension can be solved by writing a linear program with the variable for each set and objective of minimizing subject to submodularity constraints on the sets for , and the constraints for each set . This gives an algorithm with the desired running time. ∎
Given a square certificate, we need to construct another square certificate where all involved sets are in the lattice closure of , i.e., are obtained by the union and intersection of sets in . In particular, this means that all intermediate sets must be in , since by property (P1) input and output sets are already in . Our insight is to think of the union and intersection of two sets as the bitwise OR and AND of the characteristic vectors respectively. Given a square certificate, we will construct a boolean circuit with certain properties, which we call a boolean certificate. Similar to square certificates, boolean certificates are also necessary and sufficient conditions for a partial function to not be extendible to a submodular function.
As background for defining boolean certificates, we first define boolean circuits. Formally, a boolean circuit is a directed graph with three types of nodes — input nodes IP with no incoming edges, intermediate nodes IM with incoming and outgoing edges, and output nodes OP with no outgoing edges. Intermediate and output nodes are labelled with logic gates and perform the labelled boolean operation. The computed value at an intermediate node is fed to other intermediate or output nodes, and the computed value at the output nodes are the output of the circuit. The fan-in of a gate is its indegree, which is the number of arguments for the boolean operation. Throughout the paper, we will consider only AND and OR gates with fan-in 2. An assignment is called satisfying if the boolean operation operation at each gate is satisfied by the assignment. That is, if is an AND gate with inputs and , then , and if is an OR gate.
The boolean circuits we consider are constructed from square certificates and can contain cycles. In a cyclic boolean circuit, given values to the input nodes, there may be many satisfying assignments (see figure 1). The technical difficulty in proving second part of Lemma 10 is due to the presence of cycles in the constructed boolean circuit (the square certificate may correspondingly have analogous “cycles”). If the constructed boolean circuit does not have cycles, then the proof of the lemma follows trivially.
We now define the notion of computation in cyclic boolean circuits. We use G to denote the set of all gates.We assume an ordering of the gates, with the input gates appearing first in this order, followed by the output gates and then the intermediate gates. An assignment to the input gates is denoted . An assignment to all gates will be denoted by . We use , , , to refer to the input gates.
Recall that an assignment is called satisfying if the boolean operation operation at each gate is satisfied by the assignment. Given an assignment to input gates, we say a gate computes a value if the value at is in every satisfying assignment to the gates, with the input gates as specified. We now formally define the notion of a gate being fixed to a value . The equivalence of the two definitions is shown in Lemma 12. It is possible that a gate does not compute any value.
Definition 2**.**
Given an assignment to the input gates, a gate is fixed to value if there exists a subgraph of the boolean circuit with values for all gates , so that:
- •
* is a rooted tree with all edges directed towards the root.*
- •
* is the root, and all leaves are input nodes.*
- •
, and for each input node that is a leaf.
- •
If gate has only one child in the tree then either (i) is an AND gate and or (ii) is an OR gate and .
- •
*If gate has two children , in the tree then either (i) is an AND gate and or (ii) is an OR gate and . *
We call the above rooted tree and associated values a proof of fixed to for the given assignment to the input gates. We make a few observations regarding the definition. First, since in the boolean circuit the fan-in for each gate is 2, the rooted tree is a binary tree. Second, since the value of every gate in the rooted tree must be the same as its children, in fact every gate in the rooted tree must have the same value.
Note that a priori, a gate may be fixed to different values for the same input. We now show that in fact this cannot happen.
Lemma 11**.**
If a gate is fixed to both and for a particular assignment to the input gates, then .
Define to be the set of all gates that gets fixed by the assignment to the input gates. We say such gates are fixed, with the inputs clear from the context. If is fixed, then Lemma 11 allows us to define as the value that gate is fixed to.
Lemma 12 says that the value of fixed gates should be consistent with any satisfying assignment and Lemma 13 says that is monotone for fixed gate as a function of . These lemmas are intuitive and are proven by induction on the height of the rooted tree corresponding to any proof that fixes the gate .
Lemma 12**.**
Let be a satisfying assignment to a boolean circuit, and gate be fixed by the assignment to the input gates. Then is fixed to .
For two vectors , we use the standard notation that if this is true of each component.
Lemma 13**.**
Let be two assignments to the input gates, and consider a gate . If is fixed to [math] by the assignment , then this is true for as well. Conversely, if is fixed to by the assignment , then this is true for as well.
Boolean Certificates.
Given a partial function , a boolean certificate, similar to a square certificate, characterizes partial functions that cannot be extended to a submodular function. A boolean certificate consists of the following two parts.
A boolean circuit with AND and OR gates, that satisfies two conditions: (i) Input gates and intermediate gates have outdegree two, and output and intermediate gates have indegree two. (ii) If , are inputs to an AND gate, then these are inputs to an OR gate as well. As before, IP, OP, and IM are the set of input, output, and intermediate gates, and G is the set of all gates. 2. 2.
A function called the creator function, that satisfies three conditions: (i) for all , i.e., all input and output gates map to defined sets. (ii) For each , the assignment (which assigns to the gate if the th element is present in , and [math] otherwise) is a satisfying assignment. This gives satisfying assigments to the boolean circuit. (iii) For , let be the number of input gates that maps to , and be the number of output gates that maps to . Then .
Figure 2 shows an example of a square certificate and the corresponding boolean certificate.
Lemma 14**.**
Given a partial function, there exists a square certificate iff there exists a boolean certificate. If there is a boolean certificate with creator function then there is a square certificate with as the family of involved sets.
Proof.
Given a square certificate, we first show how to obtain a boolean certificate. The construction proceeds in two steps. In the first step, for each square tuple in the square certificate, we create four gates , , , , and set , , , and . Gates , are input gates, is an output OR gate, and is an output AND gate. We add edges from both and to and . We do this for each square tuple in the square certificate. Thus, if there are square tuples, we obtain a boolean circuit with components and gates. We call this the primary boolean circuit, and note its properties below.
The circuit only has input gates and output gates. Each input gate has fan-out 2, and each output gate has fan-in 2. Further, if , are inputs to an AND gate, these are inputs to an OR gate as well. 2. 2.
Consider an involved set in the square certificate. For every square tuple that has as a top or bottom point, there is a component in the circuit that has as an output gate, and . For every square tuple that has is a middle point, there is a square circuit that has as an input gate, and . The converse also holds. Thus, if and are the number of input and output gates respectively that the creator function maps to , then . Hence, . 3. 3.
For each , the assignment is a satisfying assignment. This can be verified by checking each component. 4. 4.
However, may be an input or output gate, but an intermediate set, and hence not in .
Hence, the primary boolean circuit (with the creator function) satisfies all the conditions to be a boolean certificate, except that an input or output gate may be mapped to a set not in . In the second step, we fix this. Now we take two gates , of which one (say ) is an input gate, one () is an output gate, and . We replace these two by a single intermediate gate that takes inputs as did , provides output as did , has logical operation (AND or OR) as did , and set . Note that has fan-in and fan-out 2. We do this for all pairs of gates that satisfy these conditions, one pair at a time. We call the circuit thus obtained the secondary boolean circuit. It is easy to verify by induction on the number of gates replaced that the properties of a boolean certificate satisfied earlier are satisfied now as well. Finally, consider a gate , i.e., an input gate in the secondary boolean circuit. Let . Since has not been replaced, there is no output gate that the creator function maps to set . Then since , it must be true that , and hence as required. Similarly, if is an output gate in the secondary boolean circuit, then . This concludes the construction of the boolean certificate.
Given a boolean certificate, we now want to construct a square certificate so that the involved sets are exactly . For this, we first convert the boolean circuit to a primary boolean circuit by reversing the earlier procedure. That is, if is an intermediate gate (and hence has two inputs and two outputs), we replace it by an input gate that provides output as did , and an output gate that takes inputs as did , with , and the output gate has the same logical operation as . Continuing this process, we obtain a primary boolean circuit consisting only of input and output gates. This maintains the property that if , are inputs to an AND gate, then they are inputs to an OR gate as well. Thus, this primary boolean circuits must consist of components with four gates each: inputs and , an output OR gate , and an output AND gate . We now construct our square certificate by including a square for each such component in the boolean circuit. It is easy to see that for any input or output set in the constructed square certificate, . Similarly, for any input or output gate in boolean certificate with , we have . Thus . Therefore, the multiset of squares thus obtained is indeed a square certificate. Further, if is an involved set, then for some gate in the boolean certificate, as required. ∎
Recall that in a cyclic boolean circuit, a gate may not get fixed to some value. The next lemma crucially shows that any assignment to the input gates of a boolean circuit (of a boolean certificate) always fixes the values at the output gates, and hence for any satisfying assignments , to the gates that assign the same values to the input sets, the values assigned to the output gates must be the same as well.
The proof requires the special structure of the boolean circuit in our boolean certificate, namely that if , are inputs to an AND gate, they are also inputs to an OR gate. First, since intermediate gates in the circuit have fan-in = fan-out, and fan-out for input gates = fan-in for output gates = 2, . We will show that for any assignment to the input gates. That is, all the output gates get fixed for any assignment to the input gates.
We show this by giving an algorithm (Algorithm 1) that takes as input an assignment to the input gates, and assigns value [math] or to some subset of gates by setting for gates . We will prove that all gates in are in and that .
Initially, . The algorithm maintains the invariant that for each gate , there is a value and a proof in the sense of Definition 2 that using gates that were assigned values before . It starts with a gate as the current gate. If the current gate is not an output gate, then it is an input to an OR gate (say ) and an AND gate (say ). Further, let be the other input to these gates (such a gate must exist, by property of boolean certificates). Then it uses properties from Definition 2 to show that at least one of , has not been assigned a value, and can be chosen as the current gate and assigned a value so that the invariant is maintained. If the (new) current gate is not an output gate, the algorithm continues by considering the AND and OR gates that the current gate is input to.
Lemma 15**.**
Let be the set of all gates such that is set by the Fixing Algorithm. Then , and for all gates , is in and .
Proof.
The algorithm starts from gate , and picks a sequence of gates and assigns them a value. Assume first that if we enter the while loop, at least one of the conditions from lines 6, 8, 10, or 12 must be true. Then starting with an input gate (i.e., in the th iteration of the for loop), the algorithm assigns values to distinct gates that did not earlier have a value, until it assigns a value to an output gate. Then for each , a distinct output gate is assigned a value, and since , .
We now show one of the conditions from lines 6, 8, 10, or 12 must be true at any step, and that for all gates in , we have a proof of being fixed to . We claim that the following statement is an invariant: if gate is assigned a value, then has a proof of being fixed to this value consisting only of gates that were assigned values before . To show this, note that after the execution of , the invariant will continue to hold if it was true previously, as input gates are trivially fixed to their value. We show that if we enter the while loop with the invariant condition then one of the conditions in lines 6, 8, 10, or 12 must be true and the invariant must hold true after execution of lines to . This will complete the proof. Let be the current gate . Then must be assigned a value in the previous iteration of the while loop. Since we now enter the while loop, is not an output gate. Let , be the OR and AND gates for which is an input, and let be the other input to these gates. Suppose for a contradiction that and . By the invariant condition, both and have a proof of being fixed that does not use gate . But is an OR gate, and can have a proof without only if is fixed to , and can have proof without only if is fixed to [math]. By Lemma 11, this is a contradiction. Thus one of the conditions 6, 8, 10, or 12 must hold true.
We now show that the invariant condition holds after execution of one of the lines 6, 8, 10, 12. Suppose condition holds (the case when condition 8 holds is very similar). Then is assigned a value in the current while loop. Since is an OR gate and has a value assigned previously, by the invariant condition must previously (before was assigned its value) been assigned . We then get the proof of being fixed to by making the root of proofs of and the children of when , or the root of the proof of as the child of when .
Suppose condition holds (the case when condition 12 holds is very similar). Then gate is assigned a value in this while loop. Since is an AND gate, we get a proof of its being fixed to by making the root of proof of the only child of . ∎
A function is a monotone function if for any two assignments and such that for all , . It is well known that a function is monotone iff for some for all , and hence can be written as the OR of the AND of the subsets with .
Given a boolean certificate, we associate with each gate a function as follows. if the assignment to the input gates fixes gate to , else . The next two lemmas show that for each gate , the function is a monotone function, and that for any , the assignment is a satisfying assignment. These allow us to complete the proof of Lemma 10.
Lemma 16**.**
The function is a monotone for each intermediate and output gate .
Lemma 17**.**
For any assignment to the input gates, the extension given by for all for all is a satisfying assignment.
Proof of Lemma 10..
We prove the second property, since the first was proved earlier. If there is a square certificate for a partial function, then there exists a boolean certificate. From the definition of boolean certificates, for each , the assignment to all gates in which gate is assigned is a satisfying assignment. From Lemmas 12 and 15, we have for all output gates and (Recall that ). Consider the assignment , for each , given by for all and for all . Hence, for all and . Consider the function given by . Note that for all and hence satisfies the second property of boolean certificate. Also is a satisfying assignment for each by Lemma 17. Therefore, there exists a boolean certificate with creator function instead of . By Lemma 14, there exists a square certificate with set of involved sets as . Since is a monotone function of the values at the input gates, we have for all , for some . Therefore, for all , can be obtained by union and intersection of sets for , i.e., of input sets. ∎
4 Convex functions
A function is convex () iff for all and ,
[TABLE]
The problem of extending a convex function is extensively studied in convex analysis. We briefly discuss the work of Yan [33] and Dragomirescu and Ivan [21], focusing on the presentation by Yan. Given a partial function that is convex on a non-convex domain , Yan considers extending to a convex function both within and outside the convex hull . For a point , he defines function using the well-known convex roof construction:
[TABLE]
The infimum thus runs over all possible convex combinations of points that evaluate to . If the function is bounded below or the domain contains a point in the relative interior of , then is a convex extension [33]. For a point outside the convex hull of , assuming now that is defined and convex inside , Yan defines
[TABLE]
The function is a convex extension iff satisfies the Lipschitz property444A function has the Lipschitz property on if there exists a constant such that on [33]. The extensions and are optimal in the following sense. Within the convex hull, is maximal, i.e., for any convex function that extends to the convex hull of , for any point inside the convex hull, . This is because for optimal in the definition of , . Similarly, outside the convex hull, is minimal, i.e., for any convex function that extends (assuming is defined and convex on the convex hull) to , for any point outside the convex hull. This again can be easily seen as for optimal and in the definition of , .
Our results.
The set of points in the given partial function corresponds to the domain described above. We assume for simplicity that has non-zero volume. We show that Approximate Convex Extension (and hence Convex Extension) can be solved in polynomial time. We also give a unified construction for a convex function that equals inside the convex hull and outside, and show that if there exists a convex extension of , then our construction is a convex extension. However, we show that evaluating for a point is strongly NP-hard. Our results hold for concave functions as well, using the fact that is a convex function iff is concave. Recall that is our partial function, with each and
We now give the construction of our convex extension . Since our partial function is defined on a finite set of points , the convex roof function from (4) is the optimal value of the linear program Convex-P:
[TABLE]
[TABLE]
The linear program Convex-D is dual of Convex-P and has variables . Now let denote the polyhedron of dual feasible solutions. By our assumption that has non-zero volume, has at least one vertex (shown in Appendix C). Let be the set of vertices of the polyhedron , and let for all , with and . We define as the maximum over all vertices of , of the objective of Convex-D:
[TABLE]
Since is the maximum of linear functions, it is convex. Also, if . This is because Convex-P is feasible for , and hence the dual is bounded and there is an optimal vertex.
Lemma 18**.**
A partial function can be extended to a convex function on iff for all .
Proof.
Since is convex so if for all then clearly is a convex extension to .
Now suppose that is not equal to for some . For any , it is clear that (if we set , then the objective value is ). Thus . If there exists a convex extension then , which is a contradiction. The first inequality is due to the earlier observation that if is any convex extension of on then for all . ∎
Theorem 8**.**
Convex Extension is in P, and if is extendible then is an extension.
Proof.
By Lemma 18, determining the existence of extension boils down to checking for all . Since can be efficiently computed by solving Convex-P, Convex Extension is in P. If is extendible then we have by Lemma 18, for all . Hence, is an extension. ∎
Theorem 9**.**
Approximate Convex Extension is in P.
We have shown that is the maximal extension inside the . Interestingly, is also the minimal extension outside the convex hull, where is defined as follows.
[TABLE]
Lemma 19**.**
For any partial function , we have for all outside the convex hull .
The function is thus a natural and canonical convex extension of a given partial function. In fact, we know of no other convex extensions that are widely studied. It is then natural to ask, given a partial function, to evaluate the convex extension at a given point. Surprisingly, we show that this problem is strongly NP-hard, and hence this canonical extension cannot be efficiently evaluated at a given point, unless P = NP. Since is the maximum over vertices of the polyhedron , one may wonder why it can not be computed by solving the linear program Convex-D. This is because, for outside , Convex-P is infeasible and hence Convex-D is unbounded. So the optimal value is , whereas is a finite.
We show a reduction from the Optimal Vertex problem, which is strongly NP-hard [25]. In this problem, we are given an rational matrix , rational -vector , -vector and a rational number . Then the objective is to decide if there exists a vertex of the polyhedron with .
In the proof of hardness of Optimal Vertex in [25], the instance and satisfy the property that the polyhedron has at least one vertex. We will use this property in our proof. Before we give the reduction, we state a property of vertices of polyhedra.
Lemma 20** (Lemma 8.2 of [10]555The first part of this lemma is 8.2(a) of [10] whereas the second part follows from the proof of 8.2(a)).**
Suppose is a matrix and is an -vector. Also assume all the entries of and are integers and is the largest absolute value of the entries in and . Then every extreme point of the polyhedron satisfies :
** 2. 2.
If then
Theorem 10**.**
Given a partial function , a point and rational , it is strongly NP-hard to determine if . However, if the dimension is constant, can be computed in polynomial time.
Proof.
First note that the number of vertices of the dual polyhedron is bounded by , and can be enumerated [18]. If is constant, this gives a polynomial time algorithm to compute .
Let the instance of Optimal Vertex have parameters , , and , and as noted we assume that the polyhedron has at least one vertex. We will also assume wlog that all entries in and are integers. Also let be the largest absolute value among all entries of and . Let be the th row of matrix .
Let . We construct the instance for our problem as follows. We set the partial function , and , where (we need ). Note that the size of is polynomial in the sizes of and . Our claim is that where maximum is taken over all vertices of polyhedron . Thus iff there exists a vertex such that . To prove the claim, we observe that the polyhedron associated with the definition of is (where is an -vector with each entry ). Let be the polyhedron and be the set all vertices of ( is non-empty). It is easy to see that is a vertex of iff is a vertex of . Therefore, the polyhedron also has at least one vertex. Let be the set of all vertices of . We denote any vertex in as . Since is non-empty, we have from (6). If we prove that this maximum can only be attained at the vertices with then we will be done. Note that any vertex has the property (because of first part of Lemma 20). Consider a vertex with (recall that ). For such a vertex (Lemma 20). For our choice of , . Hence, , completing the proof of the claim. ∎
Conclusion. Our work is the first to formally study the complexity of partial function extension. We show that results can often be counterintuitive, and shed new light on problems previously studied. Our work also gives a number of new results for learning and property testing. While there are clearly a large number of interesting open problems, one we particularly would like to highlight is the basic question of membership in NP (or coNP) of partial function extension. We are able to resolve this for XOS, subadditive, and convex functions, but leave it open for submodular functions. Resolving this problem may lead to further insights on the structure of these functions.
Appendix A Subadditive and XOS functions
Proof of Theorem 1
Let the given partial function be . We claim that the optimal value of for Aproximate XOS Extension (say ) is equal to the optimal value of in the following linear program (say ), with variables and for all and . Since the linear program can be solved in polynomial time, this claim implies that Approximate XOS Extension can be efficiently solved.
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
Let the XOS function corresponding to the optimal solution for Approximate XOS Extension be given by linear functions for some , and let the linear functions be indexed so that for (the same linear function can appear with multiple indices, i.e., for ). Then for all , and for all . It is clear that are feasible for the linear program, hence . By definition, , since the linear program produces an XOS function that has value within factor at each for all . Hence .
Upper bound for Approximate Extension
We now show the upper bound for the Approximate Extension. Let and be two classes of functions. We say that -approximates if for all functions in , there exists a function in such that for all . We first prove the following lemma.
Lemma 21**.**
Let and be two classes of functions so that -approximates and -approximates . If there is a -approximation algorithm for Approximate Extension for then there is an - approximation algorithm for Approximate Extension for .
Proof.
For a given instance of partial function extension, let and be the optimal value of in the Approximate Extension problem for and respectively. Let be the -approximation algorithm for . A -approximation algorithm for Approximate Extension for is as follows: given any partial function , return where is the value returned by algorithm on . We have as is an -approximation algorithm. Since -approximates , we have . As so we have . Also -approximates so we have . Then . Hence . This proves our result. ∎
Recall that XOS functions are a subclass of subadditive functions. We will use the following result:
Theorem 11** ([11, 20]).**
For any subadditive function , there exists an XOS function such for all .
From the above results and Lemma 21, the upper bound for Theorem 2 follows, with and .
Proof of Lemma 2
Consider the distribution that assigns probability mass uniformly to and [math] elsewhere. We restrict the target function to the family . Suppose there is an algorithm that PMAC-learns with approximation factor . Let be an arbitrary function in . Let the input to the algorithm be where is (let and be constant). Let . By our assumption, given any values in at the sets , there is a function in that takes those values. Let and be the value returned by the algorithm at . If then for all target function iff . Since is polynomial (whereas is superpolynomial), the algorithm then returns a value within factor with only small probability. Hence must be at least . Also above argument and hence lower bound holds even if the algorithm knows the distribution , allowed unbounded computation and choose samples adaptively.
Proof of Lemma 6
One direction is trivial. If there exists such that then partial function is not extendible. Now assume this is not the case. Let be the union-closure of . We now define which is an extension of to . If then . If (and ) then is the minimum value of over all families of sets such that each , and . Let be the maximum value of on . We define an extension of to by assigning value to each set not in . Let this extension be . We claim that is subadditive.
Note that is the maximum value of . Let and be any two sets. If any of or is not in then is at least and thus . Therefore, we assume both and are in which implies is also in . Let be the union of () and be the union of (). Therefore, is union of . If is in then by assumption and otherwise by definition of , we have .
Appendix B Submodular functions
Proof of Lemma 9.
Assume for a contradiction that there exists a square certificate and an extension without a square tuple as described in the lemma. Summing the inequalities for each square tuple in the square certificate, we observe that all the intermediate sets cancel out, since they appear an equal number of times on the left and right hand sides. We get which contradicts property (P2) of the square certificate.
Proof of Lemma 11.
If gate is fixed to both and , there must be two proofs with values , and with values as described above. In the rooted tree , let be a gate at maximum distance from the root that has different values in the two proofs (such a gate must exist, since is such a gate). Gate cannot be a leaf, since all leaves have the same values by definition. Assume that is an AND gate; a similar proof holds if is an OR gate. If has one child in both proofs, since is an AND gate, by definition it must have value [math] in both proofs. Similarly if has two children in both proofs, it must have value in both proofs. Now suppose without loss of generality that has one child (say ) in , and two children in . Then . Since the fan-in for each gate is 2 in the boolean circuit, must be a child of in as well, and by assumption, . But this gives us a contradiction, since if is an AND gate with two children in any proof, then both children must have value 1.
Proof of Lemma 12.
Let and be a proof for gate for the given assignment to the input gates. Let be the gate at maximum distance from the root for which . The children of by assumption have the same value as in the satisfying assignment , and considering the cases in the construction of the proof gives us a contradiction.
Proof of Lemma 13.
Let and be a proof that fixes for the assignment . Then as observed previously, every gate in the proof has value 0, including the input gates as the leaves. Since , the values at the leaves of the rooted tree remain unchanged, and the proof and is a proof with inputs as well, that fixes gate to [math]. The proof of the converse holds similarly.
Proof of Lemma 16.
Consider an assignment in which an input is set to [math] and is for some . If the gate is fixed to by the assignment then it will continue to be fixed to after setting to 1 by Lemma 13. Suppose instead that the gate is not fixed by the initial assignment with . In this case also, after setting , gate cannot get fixed to [math] by Lemma 13. So, flipping any input from [math] to can not make the function evaluate from to [math]. Hence, is monotone.
Proof of Lemma 17.
We will show that the boolean operation at any gate is not violated by this assignment. Let be an AND gate with incoming edges from gates and gate (the case when is an OR gate is similar). Suppose first that is fixed by the assignment to the input gates. Since is fixed, by Definition 2, either all of , and are fixed to or and one of and is fixed to [math]. In either case, the boolean operation is satisfied. Now assume that is not fixed and hence it is assigned . For a violation, either or must be assigned [math]. Say is assigned 0. But then is fixed, and since is an AND gate, must also be fixed to 0, giving a contradiction.
Appendix C Convex functions
Proof that has at least one vertex
Claim 1**.**
If has non-zero volume then the polyhedron has at least one vertex.
Proof.
Since has non-zero volume, the ’s do not lie on a hyperplane. Formally, this means that satisfies the inequalities for all iff . This assumption implies that there can not exist a line666A polyhedron contains a line if there exists a vector and a non-zero vector such that the vector for all in . This is because, for any polyhedron to contain a line, there must exists a point and a non-zero vector such that for all which is possible only if . Since does not contain a line, it must have at least one vertex (Theorem 2.6 of [10]). ∎
Proof of Theorem 9
We will show that for any , it can be efficiently determined whether there exists a convex function such that for all . Then by binary search Approximate Extension can be solved efficiently. By setting and in the following Theorem, we can determine the same.
Lemma 22**.**
Given a set and pairs () associated with each , it can be efficiently determined if there exists a convex function such that .
Proof.
Let be the convex roof function corresponding to the partial function . Our claim is that there exists a convex function such that iff . Suppose for all . Consider the function defined by (6). We know that is convex and for all . Thus is the required convex function. For the other direction, suppose for a contradiction that for some and there exists a required convex function . Therefore, we have as for all (since the partial function is ). Now consider a partial function and its convex roof function . Since is clearly extendible, we have for all (Lemma 18). Further from the definition of and since for all , it follows that for all . Therefore we have . This is a contradiction since as noted, for all .
This proves the theorem as the conditions for all can be checked efficiently. ∎
Proof of Lemma 19
Fix outside and inside . Let and be such that . In the proof we vary , and assume that changes accordingly so that . Since is convex, the value of increases as gets close to . Therefore, where
[TABLE]
We know that (where is the vertex of the polyhedron for all , as defined in main section). Let be the index of the vertex that maximizes . Therefore, at and in vicinity of in line segment joining and . Therefore,
[TABLE]
Since so we have for all . Now
[TABLE]
Thus for any , is equal to where is the index of the vertex that maximizes . This implies that is at most .
Now we will show . Let be the index of the vertex that maximizes . Consider the vertex . By definition of a vertex, there exists such that for all . We claim that . First note that the left side of the above inequality is a finite quantity. Also for (sufficiently small), the point is in for any . If then the right side can be made arbritray large by setting . For the case , at least one of the ’s must be non zero. Therefore, by setting and if (and if ) the quantity can be made arbitrary large. Therefore, we assume . Now the above inequality can be written as for all . Let . The vector must be in otherwise Convex-D should be unbounded (and hence right side should be unbounded), which is a contradiction. Now as . Since for all , the function is given by in a sufficiently small neighbourhood of . Therefore, as before and hence .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] S. N. Afriat. The construction of utility functions from expenditure data. International Economic Review , 8(1):67–77, feb 1967.
- 2[2] Ashwinkumar Badanidiyuru, Shahar Dobzinski, Hu Fu, Robert Kleinberg, Noam Nisan, and Tim Roughgarden. Sketching valuation functions. In Proceedings of the twenty-third annual ACM-SIAM symposium on Discrete Algorithms , pages 1025–1035. Society for Industrial and Applied Mathematics, 2012.
- 3[3] Maria-Florina Balcan, Florin Constantin, Satoru Iwata, and Lei Wang. Learning valuation functions. In COLT 2012 - The 25th Annual Conference on Learning Theory, June 25-27, 2012, Edinburgh, Scotland , pages 4.1–4.24, 2012.
- 4[4] Maria Florina Balcan, Florin Constantin, Satoru Iwata, and Lei Wang. Learning valuation functions. In Conference on Learning Theory , pages 4–1, 2012.
- 5[5] Maria-Florina Balcan, Amit Daniely, Ruta Mehta, Ruth Urner, and Vijay V Vazirani. Learning economic parameters from revealed preferences. In International Conference on Web and Internet Economics , pages 338–353. Springer, 2014.
- 6[6] Maria-Florina Balcan and Nicholas J. A. Harvey. Learning submodular functions. In Proceedings of the 43rd ACM Symposium on Theory of Computing, STOC 2011, San Jose, CA, USA, 6-8 June 2011 , pages 793–802, 2011.
- 7[7] Maria-Florina Balcan and Nicholas JA Harvey. Learning submodular functions. In Proceedings of the forty-third annual ACM symposium on Theory of computing , pages 793–802. ACM, 2011.
- 8[8] Xiaohui Bei, Wei Chen, Jugal Garg, Martin Hoefer, and Xiaoming Sun. Learning market parameters using aggregate demand queries. In AAAI , pages 411–417, 2016.
