Constraint Optimization over Semirings

A. Pavan; Kuldeep S. Meel; N. V. Vinodchandran; Arnab; Bhattacharyya

arXiv:2302.12937·cs.LO·February 28, 2023

Constraint Optimization over Semirings

A. Pavan, Kuldeep S. Meel, N. V. Vinodchandran, Arnab, Bhattacharyya

PDF

Open Access 1 Video

TL;DR

This paper studies the complexity of constraint optimization problems over various semirings, generalizing satisfiability, and provides complexity bounds, algorithms, and inapproximability results for different semiring-based formulas.

Contribution

It introduces the first complexity analysis of constraint optimization over semirings, including bounds, algorithms, and hardness results for various semiring interpretations.

Findings

01

optConfVal and optConf are in FP^NP for propositional formulas in negation normal form.

02

For CNF formulas, optConfVal is at most 1/4^{m-r}, where r is the maximum satisfiable clauses.

03

optConfVal for CNF formulas is hard for FP^NP[log], with polynomial-time approximation algorithms and inapproximability results.

Abstract

Interpretations of logical formulas over semirings have applications in various areas of computer science including logic, AI, databases, and security. Such interpretations provide richer information beyond the truth or falsity of a statement. Examples of such semirings include Viterbi semiring, min-max or access control semiring, tropical semiring, and fuzzy semiring. The present work investigates the complexity of constraint optimization problems over semirings. The generic optimization problem we study is the following: Given a propositional formula $φ$ over $n$ variable and a semiring $(K, +, \cdot, 0, 1)$ , find the maximum value over all possible interpretations of $φ$ over $K$ . This can be seen as a generalization of the well-known satisfiability problem. A related problem is to find an interpretation that achieves the maximum value. In this work, we first focus on these…

Equations59

optSemVal (φ) = π max {Sem (φ, π)},

optSemVal (φ) = π max {Sem (φ, π)},

i = 1 \prod n x_{i}^{a_{i}} (1 - x_{i})^{b_{i}}

i = 1 \prod n x_{i}^{a_{i}} (1 - x_{i})^{b_{i}}

Conf (T, π) = p_{T} (π (x_{1}), \dots, π (x_{n})) .

Conf (T, π) = p_{T} (π (x_{1}), \dots, π (x_{n})) .

optConfVal (φ) = T max optConfVal (T)

optConfVal (φ) = T max optConfVal (T)

L_{opt} = {⟨ φ, v ⟩ ∣ optConfVal (φ) \geq v}

L_{opt} = {⟨ φ, v ⟩ ∣ optConfVal (φ) \geq v}

L_{farey} = {⟨ N, u, v, z ⟩ ∣ \exists z^{'}; u \leq z z^{'} \leq v & z z^{'} \in F_{N}}

L_{farey} = {⟨ N, u, v, z ⟩ ∣ \exists z^{'}; u \leq z z^{'} \leq v & z z^{'} \in F_{N}}

{⟨ φ, v, z ⟩ ∣ \exists z^{'} :

{⟨ φ, v, z ⟩ ∣ \exists z^{'} :

& optConfVal (T) = v}

ℓ_{C} = argmax_{ℓ \in C} {π^{*} (ℓ)}

ℓ_{C} = argmax_{ℓ \in C} {π^{*} (ℓ)}

p_{x}=|\{C~{}|~{}\mbox{$C$ is neutral and }\ell_{C}=x\}|

p_{x}=|\{C~{}|~{}\mbox{$C$ is neutral and }\ell_{C}=x\}|

q_{x}=|\{C~{}|~{}\mbox{$C$ is neutral and }\ell_{C}=\neg{x}\}|

q_{x}=|\{C~{}|~{}\mbox{$C$ is neutral and }\ell_{C}=\neg{x}\}|

a_{\ell}=|\{C~{}|~{}\mbox{$C$ is a low-clause and }\ell_{C}=\ell\}|,

a_{\ell}=|\{C~{}|~{}\mbox{$C$ is a low-clause and }\ell_{C}=\ell\}|,

b_{\ell}=|\{C~{}|~{}\mbox{$C$ is a high-clause and }\ell_{C}=\neg{\ell}\}|,

b_{\ell}=|\{C~{}|~{}\mbox{$C$ is a high-clause and }\ell_{C}=\neg{\ell}\}|,

Conf (φ, π) = Π_{i} Conf (φ_{∣ x_{i}}, π)

Conf (φ, π) = Π_{i} Conf (φ_{∣ x_{i}}, π)

Conf (φ_{∣ x_{j}}, π^{*}) = π^{*} (ℓ)^{a_{ℓ}} \times (1 - π^{*} (ℓ))^{b_{ℓ}}

Conf (φ_{∣ x_{j}}, π^{*}) = π^{*} (ℓ)^{a_{ℓ}} \times (1 - π^{*} (ℓ))^{b_{ℓ}}

\frac{Conf ( φ _{∣ x_{j}} , π ^{'} )}{Conf ( φ _{∣ x_{j}} , π ^{*} )}

\frac{Conf ( φ _{∣ x_{j}} , π ^{'} )}{Conf ( φ _{∣ x_{j}} , π ^{*} )}

Conf (φ, π^{*}) = ℓ \prod (π^{*} (ℓ)^{a_{ℓ}} \times (1 - π^{*} (ℓ))^{b_{ℓ}}) \times \frac{1}{2 ^{N}}

Conf (φ, π^{*}) = ℓ \prod (π^{*} (ℓ)^{a_{ℓ}} \times (1 - π^{*} (ℓ))^{b_{ℓ}}) \times \frac{1}{2 ^{N}}

ℓ \prod π^{*} (ℓ)^{a_{ℓ}} \times (1 - π^{*} (ℓ))^{b_{ℓ}} .

ℓ \prod π^{*} (ℓ)^{a_{ℓ}} \times (1 - π^{*} (ℓ))^{b_{ℓ}} .

optConfVal (φ)

optConfVal (φ)

φ^{'} = (C_{1} \lor y_{1}) \land \dots \land (C_{m} \lor y_{m}) \land \neg y_{1} \land \dots \land \neg y_{m}

φ^{'} = (C_{1} \lor y_{1}) \land \dots \land (C_{m} \lor y_{m}) \land \neg y_{1} \land \dots \land \neg y_{m}

(\frac{1}{2})^{a} \times 1^{b} \times (\frac{1}{2})^{a} \times 1^{b} > π (x_{i})^{a} \times (1 - π (x_{i}))^{b} \times 1^{a + b}

(\frac{1}{2})^{a} \times 1^{b} \times (\frac{1}{2})^{a} \times 1^{b} > π (x_{i})^{a} \times (1 - π (x_{i}))^{b} \times 1^{a + b}

Conf (φ^{'}, π) = 1^{H} \times (\frac{1}{2})^{N} \times (\frac{1}{2})^{2 Y} = \frac{1}{4 ^{N /2 + Y}} = \frac{1}{4 ^{m - r}}

Conf (φ^{'}, π) = 1^{H} \times (\frac{1}{2})^{N} \times (\frac{1}{2})^{2 Y} = \frac{1}{4 ^{N /2 + Y}} = \frac{1}{4 ^{m - r}}

lo g E [Conf (φ, π)]

lo g E [Conf (φ, π)]

= i \sum E [lo g ℓ \in C_{i} max π (ℓ)]

= - i \sum \int_{- \infty}^{0} Pr [lo g ℓ \in C_{i} max π (ℓ) \leq t] d t

= - i \sum \int_{- \infty}^{0} Pr [ℓ \in C_{i} max π (ℓ) \leq e^{t}] d t

= - i \sum \int_{- \infty}^{0} e^{k_{i} t} d t = - i \sum \frac{1}{k _{i}}

π \leftarrow_{U} [0, 1]^{n} E [lo g Conf (φ, π) ∣ π (x_{j}) = π^{*} (x_{j}) \forall j \leq i] \geq - i \sum \frac{1}{k _{i}} .

π \leftarrow_{U} [0, 1]^{n} E [lo g Conf (φ, π) ∣ π (x_{j}) = π^{*} (x_{j}) \forall j \leq i] \geq - i \sum \frac{1}{k _{i}} .

= - \int_{- \infty}^{0} π Pr [lo g ℓ \in C max π (ℓ) \leq t ∣ π_{< i} = π_{< i}^{*}, π^{*} (x_{i}) = p] d t

= - \int_{- \infty}^{0} π Pr [lo g ℓ \in C max π (ℓ) \leq t ∣ π_{< i} = π_{< i}^{*}, π^{*} (x_{i}) = p] d t

= - \int_{l o g m a x (α, p)}^{0} π Pr [lo g ℓ \in C \cap {x_{j}, \overset{x}{ˉ}_{j} : j > i} max π (ℓ) \leq t] d t

= - \frac{1}{k ^{'}} (1 - max (α, p)^{k^{'}})

π E [lo g Conf (φ, π) ∣ π_{< i} = π_{< i}^{*}, π^{*} (x_{i}) = p]

π E [lo g Conf (φ, π) ∣ π_{< i} = π_{< i}^{*}, π^{*} (x_{i}) = p]

optConf (φ) < \frac{1}{4 ^{m - m (1 - 2^{- k} + ε)}} = \frac{1}{4 ^{m (2^{- k} - ε)}}

optConf (φ) < \frac{1}{4 ^{m - m (1 - 2^{- k} + ε)}} = \frac{1}{4 ^{m (2^{- k} - ε)}}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Constrained Optimization Over Semirings· youtube

Taxonomy

TopicsLogic, Reasoning, and Knowledge · Logic, programming, and type systems · Formal Methods in Verification

Full text

Constraint Optimization over Semirings††thanks: The authors decided to forgo the old convention

of alphabetical ordering of authors in favor of a randomized ordering, denoted by ⓡ. The publicly verifiable record of the randomization is available at https://www.aeaweb.org/journals/policies/ random-author-order/search. An abridged version of the paper appeared in AAAI 2023.

A. Pavan ⓡ

Iowa State Univerity

Kuldeep S. Meel ⓡ

National University of Singapore, Singapore

N. V. Vinodchandran ⓡ

University of Nebraska-Lincoln

Arnab Bhattacharyya

National University of Singapore, Singapore

Abstract

Interpretations of logical formulas over semirings (other than the Boolean semiring) have applications in various areas of computer science including logic, AI, databases, and security. Such interpretations provide richer information beyond the truth or falsity of a statement. Examples of such semirings include Viterbi semiring, min-max or access control semiring, tropical semiring, and fuzzy semiring.

The present work investigates the complexity of constraint optimization problems over semirings. The generic optimization problem we study is the following: Given a propositional formula $\varphi$ over $n$ variable and a semiring $(K,+,\cdot,0,1)$ , find the maximum value over all possible interpretations of $\varphi$ over $K$ . This can be seen as a generalization of the well-known satisfiability problem (a propositional formula is satisfiable if and only if the maximum value over all interpretations/assignments over the Boolean semiring is 1). A related problem is to find an interpretation that achieves the maximum value. In this work, we first focus on these optimization problems over the Viterbi semiring, which we call $\mathsf{optConfVal}$ and $\mathsf{optConf}$ .

We first show that for general propositional formulas in negation normal form, $\mathsf{optConfVal}$ and $\mathsf{optConf}$ are in ${\mathrm{FP}}^{\mathrm{NP}}$ . We then investigate $\mathsf{optConf}$ when the input formula $\varphi$ is represented in the conjunctive normal form. For CNF formulae, we first derive an upper bound on the value of $\mathsf{optConf}$ as a function of the number of maximum satisfiable clauses. In particular, we show that if $r$ is the maximum number of satisfiable clauses in a CNF formula with $m$ clauses, then its $\mathsf{optConf}$ value is at most $1/4^{m-r}$ . Building on this we establish that $\mathsf{optConf}$ for CNF formulae is hard for the complexity class ${\mathrm{FP}}^{\mathrm{NP}[\log]}$ . We also design polynomial-time approximation algorithms and establish an inapproximability for $\mathsf{optConfVal}$ . We establish similar complexity results for these optimization problems over other semirings including tropical, fuzzy, and access control semirings.

1 Introduction

Classically, propositional formulae are interpreted over the Boolean semiring $\mathbb{B}=(\{\mbox{\rm F},\mbox{\rm T}\},\vee,\wedge,{\mbox{\rm F}},{\mbox{\rm T}})$ which is the standard semantics for the logical truth. In this setting, the variables take one of the two values T (true) or F (false). However, it is natural to extend the semantics to other semirings. Here, the idea is to interpret logical formulae when the variables take values over a semiring $\mathbb{K}=(K,+,\cdot,0,1)$ . Such interpretations provide richer information beyond the truth or falsity of a statement and have applications in several areas such as databases, AI, logic, and security (see [ILJ89, FR97, Zim97, CWW00, Cui02, GT20] and references therein). In particular, semiring provenance analysis has been successfully applied in several software systems, such as Orchestra and Propolis (see, e.g., [ADT11, DMRT14, FGT08, Gre11, Tan13]).

Examples of semirings that are studied in the literature include Viterbi semiring, fuzzy semiring, min-max or access control semiring, and tropical semiring. Semantics over the Viterbi semiring $\mathbb{V}=([0,1],\max,\cdot,0,1)$ has applications in database provenance, where $x\in[0,1]$ is interpreted as a confidence score [GT20, GKT07, Tan17, GM21], in probabilistic parsing, in probabilistic CSPs, and in Hidden Markov Models [Vit67, KM03, BMR95]. The access control semiring can be used as a tool in security specifications [GT20]. Other semirings of interest include the tropical semiring, used in cost analysis and algebraic formulation for shortest path algorithms [Moh02], and fuzzy semirings used in the context of fuzzy CSPs [BMR95].

Optimization problems over Boolean interpretations have been central in many application as well as foundation areas. Indeed, the classical satisfiability problem is determining whether a formula $\phi(x_{1},\cdots,x_{n})$ has an interpretation/assignment over the Boolean semiring that evaluates to True. Even though semiring semantics naturally appear in a variety of applications, the optimization problems over semirings, other than the Boolean semiring, have not received much attention.

In this work, we introduce and investigate the complexity of optimization problems over semiring semantics. Let $\mathbb{K}=({K},+,\cdot,0,1)$ be a semiring with a total order over $K$ and $\varphi$ be a propositional formula over a set $X$ of variables. A $\mathbb{K}$ -interpretation $\pi$ is a function from $X$ to $K$ . Such an interpretation can be naturally extended to formula $\varphi$ , which we denote by $\mathsf{Sem}(\varphi,\pi)$ . We study the following computational problem: Given a propositional formula $\varphi$ in negation normal form over a set $X$ of variables, compute the maximum value of $\mathsf{Sem}(\varphi,\pi)$ over all possible interpretations $\pi$ . We call this problem $\mathsf{optSemVal}$ . A related problem, denoted $\mathsf{optSem}$ , is to compute an interpretation $\pi$ that maximizes $\mathsf{Sem}(\varphi,\pi)$ . Refer to Section 2 for a precise formulation of these problems.

There has been a rich history of work which formulated the notion of CSP over semirings and investigated local consistency algorithms in the general framework [Bis04, BG06, BMR95, BMR97, BMR*+*99, MRS06]. These works did not involve interpretations and did not focus on the computational complexity of the above-defined problems. Relatedly, the computational complexity of sum-of-product problems over semirings has been studied recently [EK21]. However, the problems they study are different from ours. To the best of our knowledge, optimization problems $\mathsf{optSem}$ and $\mathsf{optSemVal}$ that we consider over semirings have not been studied earlier and there are no characterizations of their computational complexity.

1.1 Our Results

We comprehensively study the computational complexity of $\mathsf{optSem}$ and the related problem $\mathsf{optSemVal}$ over various semirings such as Viterbi semiring, tropical semiring, access control semiring and fuzzy semiring, from both an algorithmic and a complexity-theoretic viewpoint. When the underlying semiring is the Viterbi semiring, we call these problems ${\mathsf{optConf}}$ and ${\mathsf{optConfVal}}$ . Our results can be summarized as follows:

We establish that both $\mathsf{optConf}$ and $\mathsf{optConfVal}$ are in the complexity class $\mathrm{FP}^{\mathrm{NP}}$ . The crucial underlying observation is that even though $\pi$ maps $X$ to real values in the range $[0,1]$ ; the solution to $\mathsf{optConfVal}$ can be represented using polynomially many bits. We then draw upon connections to Farey sequences to derive an algorithm with polynomially many $\mathrm{NP}$ calls (Theorem 3.2). 2. 2.

For CNF formulas, we establish an upper bound on $\mathsf{optConfVal}$ as a function of the number of maximum satisfiable clauses (Theorem 3.7). 3. 3.

We also establish a lower bound on the complexity of $\mathsf{optConfVal}$ and $\mathsf{optConf}$ . In particular, we show that both the problems are hard for the complexity class $\mathrm{FP}^{\mathrm{NP}[\log]}$ . To this end, we demonstrate a reduction from MaxSATVal to $\mathsf{optConfVal}$ ; this reduction crucially relies on the above-mentioned upper bound on $\mathsf{optConfVal}$ in terms of the number of maximum satisfiable clauses (Theorem 3.9). 4. 4.

We design a polynomial-time approximation algorithm for $\mathsf{optConfVal}$ and establish an inapproximability result. In particular, for 3-CNF formulas with $m$ clauses, we design a $0.716^{m}$ -approximation algorithm and show that the approximation factor can not be improved to $0.845^{m}$ unless P = NP (Theorems 4.3 and 4.5). 5. 5.

Finally, we show that for the access control semiring, the complexity of these optimization problems is equivalent to the corresponding problems over Boolean semiring (Theorem 5.3).

Remark 1.

Since Viterbi semiring and tropical semiring are isomorphic via the mapping $x\leftrightarrow-\ln x$ , results established for Viterbi semiring also hold for the tropical semiring. Fuzzy semiring can be seen as an “infinite refinement” of access control semiring with the same algebraic structure, results that we establish for access control semiring also hold for fuzzy semiring.

Organization. The rest of the paper is organized as follows. We give the necessary notation and definitions in Section 2. Section 3 details our results on the computational complexity of $\mathsf{optConf}$ and $\mathsf{optConfVal}$ . Section 4 deals with approximate algorithms and the hardness of approximation of $\mathsf{optConfVal}$ . In Section 5, we give complexity results for optimization problems for the access control semiring. Finally, we conclude in Section 6.

2 Preliminaries

We assume that the reader is familiar with definition of a semiring. We denote a generic semiring by $\mathbb{K}=(K,+,\cdot,0,1)$ where $K$ is the underlying set. For interpreting formulas over $\mathbb{K}$ , we will add a “negation” function $\mbox{$ \daleth $}:K\rightarrow K$ . We assume $\daleth$ is a bijection so that $\mbox{$ \daleth $}(\mbox{$ \daleth $}(x))=x$ , and $\mbox{$ \daleth $}(0)=1$ . For ease of presentation, we use the most natural negation function (depending on the semiring). However, many of our results hold for very general interpretations of negation. Finally, as our focus is on optimization problems, we will also assume a (natural) total order on the elements of $K$ .

For a set $X=\{x_{1},x_{2},\ldots x_{n}\}$ of variables, we associate the set $\overline{X}=\{\neg x_{1},\ldots,\neg x_{n}\}$ . We call $X\cup\overline{X}$ the literals and formulas we consider are propositional formulas over $X\cup\overline{X}$ in negation normal form. We also view a propositional formula $\varphi$ in negation normal form as a rooted directed tree wherein each leaf node is labeled with a literal, 1, or 0 and each internal node is labeled with conjunction $(\wedge)$ or disjunction $\vee$ . Note that viewing $\varphi$ as a tree ensures a similar size as its string representation. We call the tree representing the formula $\varphi$ as formula tree and denote it with $T_{\varphi}$ . For a propositional formula $\varphi(x_{1},\cdots,x_{n})$ , in negation normal form we use $m$ to denote the size of the formula, i.e. the total number of occurrences of each variable and its negation. When $\varphi(x_{1},\cdots x_{n})$ is in CNF form, $m$ denotes the number of clauses. We interpret a propositional formula over a semiring $\mathbb{K}$ by mapping the variables to $K$ and naturally extending it. Formally, a $\mathbb{K}$ -interpretation is a function $\pi:X\rightarrow K$ . We extend $\pi$ to an arbitrary propositional formula $\varphi$ in negation normal form, which is denoted by $\mathsf{Sem}(\varphi,\pi)$ ( $\mathsf{Sem}$ stands for ‘semantics’), as follows.

$\mathsf{Sem}(x,\pi)=\pi(x)$ 2. -

$\mathsf{Sem}(\neg x,\pi)=\mbox{$ \daleth $}(\pi(x))$ 3. -

$\mathsf{Sem}(\alpha\vee\beta,\pi)=\mathsf{Sem}(\alpha,\pi)+\mathsf{Sem}(\beta,\pi)$ 4. -

$\mathsf{Sem}(\alpha\wedge\beta,\pi)=\mathsf{Sem}(\alpha,\pi)\cdot\mathsf{Sem}(\beta,\pi)$

2.1 Optimization Problems and Complexity Classes

For a formula $\varphi$ , we define $\mathsf{optSemVal}(\varphi)$ as

[TABLE]

where $\max$ is taken over all possible $\mathbb{K}$ -interpretations from $X$ to $K$ .

Definition 2.1 ( $\mathsf{optSem}$ and $\mathsf{optSemVal}$ ).

Given a propositional formula $\varphi$ in negation normal form, the $\mathsf{optSemVal}$ problem is to compute $\mathsf{optSemVal}(\varphi)$ . The $\mathsf{optSem}$ problem is to compute a $\mathbb{K}$ -interpretation that achieves $\mathsf{optSemVal}(\varphi)$ , i.e, output $\pi^{*}$ so that $\mathsf{optSemVal}(\varphi)=\mathsf{Sem}(\varphi,\pi^{*})$ .

Notice that when $\mathbb{K}$ is the Boolean semiring (with $0<1$ ordering and standard negation interpretation), $\mathsf{optSemVal}$ is the well-known satisfiability problem: the formula $\varphi$ is satisfiable if and only if $\mathsf{optSemVal}(\varphi)=1$ . Also, the problem $\mathsf{optSem}$ is to output a satisfying assignment if the formula $\varphi$ is satisfiable.

In this work, we consider the following semirings.

Viterbi semiring $\mathbb{V}=([0,1],\max,\cdot,0,1)$ . As mentioned, the Viterbi semiring has applications in database provenance, where $x\in[0,1]$ is interpreted as confidence scores, in probabilistic parsing, in probabilistic CSPs, and in Hidden Markov Models. 2. 2.

The tropical semiring $\mathbb{T}=(\mathbb{R}\cup\{\infty\},\min,+,\infty,0)$ .

The tropical semiring is isomorphic to the Viterbi semiring via the mapping $x\leftrightarrow-\ln x$ . 3. 3.

The fuzzy semiring $\mathbb{F}=([0,1],\max,\min,0,1)$ . 4. 4.

Access control semiring $\mathbb{A}_{k}=([k],\max,\min,0,k)$ . Intuitively, each $i\in[k]$ is associated with an access control level with natural ordering. Here 0 corresponds to public access and $n$ corresponds to no access at all. $[k]$ is the set $\{0<1<\cdots<k\}$ .

Most of our focus will be on complexity of $\mathsf{optSem}$ and $\mathsf{optSemVal}$ problems over the Viterbi semiring. We call the corresponding computational problems $\mathsf{optConf}$ and $\mathsf{optConfVal}$ respectively. We call the extended interpretation function $\mathsf{Sem}$ as $\mathsf{Conf}$ in this case.

Definition 2.2 ( $\mathsf{MaxSat}$ and $\mathsf{MaxSatVal}$ ).

Given a propositional formula $\varphi$ in CNF form, the $\mathsf{MaxSat}$ problem is to compute an assignment of $\varphi$ that satisfies the maximum number of clauses. Given a propositional formula $\varphi$ in CNF form, the $\mathsf{MaxSatVal}$ problem is to compute the maximum number of clauses of $\varphi$ that can be satisfied.

We need a notion of reductions between functional problems. We use the notion of metric reductions introduced by Krentel [Kre88].

Definition 2.3 (Metric Reduction).

For two functions $f,g:\{0,1\}^{*}\rightarrow\{0,1\}^{*}$ , we say that $f$ metric reduces to $g$ if there are polynomial-time computable functions $h_{1}$ and $h_{2}$ where $h_{1}:\{0,1\}^{*}\rightarrow\{0,1\}^{*}$ (the reduction function) and $h_{2}:\{0,1\}^{*}\times\{0,1\}^{*}\rightarrow\{0,1\}^{*}$ so that for any $x$ , $f(x)=h_{2}(x,g(h_{1}(x)))$ .

Definition 2.4.

For a function $t:\mathbb{N}\rightarrow\mathbb{N}$ , ${\rm FP}^{{\rm NP}[t(n)]}$ denotes the class of functions that can be solved in polynomial-time with $O(t(n))$ queries to an ${\rm NP}$ oracle where $n$ is the size of the input. When $t(n)$ is some polynomial, we denote the class by ${\rm FP}^{{\rm NP}}$ .

Metric reductions are used to define notions of completeness and hardness for function classes ${\rm FP}^{{\rm NP}}$ and ${\rm FP}^{{\rm NP}[\log]}$ . The following result due to Krentel [Kre88] characterizes the complexity of the $\mathsf{MaxSatVal}$ problem.

Theorem 2.5 ([Kre88]).

$\mathsf{MaxSatVal}$ * is complete for ${\rm FP}^{{\rm NP}[\log]}$ under metric reductions.*

The following proposition is a basic ingredient in our results. It can be proved using basic calculus.

Proposition 1.

Let $f(x)=x^{a}(1-x)^{b}$ where $a,b$ are non-negative integers, the maximum value of $f(x)$ over the domain $[0,1]$ is attained when $x=\frac{a}{a+b}$ . The maximum value of the function is $\left(\frac{a}{a+b}\right)^{a}\left(\frac{b}{a+b}\right)^{b}$ .

3 Computational Complexity of Confidence Maximization

For semantics over Viterbi semiring we assume the standard closed world semantics and use the negation function $\mbox{$ \daleth $}(x)=1-x$ . Thus we have $\mathsf{Conf}(\neg x,\pi)+\mathsf{Conf}(x,\pi)=1$ . However, our upper bound proofs go through for any reasonable negation function. We discuss this in Remark 2.

Since $\mathsf{Conf}(\varphi,\pi)$ can be computed in polynomial time, $\mathsf{optConf}$ is at least as hard as $\mathsf{optConfVal}$ . The following observation states that computing $\mathsf{optConfVal}$ and $\mathsf{optConf}$ are ${\rm NP}$ -hard.

Observation 3.1.

For a formula $\varphi$ , $\mathsf{optConfVal}(\varphi)=1$ if and only if $\varphi$ satisfiable. Hence both $\mathsf{optConf}$ and $\mathsf{optConfVal}$ are NP-hard.

While both $\mathsf{optConf}$ and $\mathsf{optConfVal}$ are ${\rm NP}$ -hard, we would like to understand their relation to other maximization problems. In the study of optimization problems, the complexity classes ${\rm FP}^{{\rm NP}}$ and ${\rm FP}^{{\rm NP}[\log]}$ play a key role. In this section, we investigate both upper and lower bounds for these problems in relation to the classes ${\rm FP}^{{\rm NP}}$ and ${\rm FP}^{{\rm NP}[\log]}$ .

An Illustrative Example.

We first provide an illustrative example that gives an idea behind the upper bound. Consider the formula $\phi(x_{1},x_{2})=(x_{1})\wedge(x_{2})\wedge(\neg x_{1}\vee\neg x_{2})$ . Clearly, the formula is not satisfiable. Over the Viterbi semiring the value of the $\mathsf{optConfVal}=\max\limits_{x_{i}\in[0,1]}\left\{x_{1}x_{2}(1-x_{1}),x_{1}x_{2}(1-x_{2})\right\}$ by distributivity. This is maximized when (by Proposition 1) $x_{1}=1$ and $x_{2}=0.5$ or $x_{1}=0.5$ and $x_{2}=1$ , leading to an optimum value of $0.25$ . In the following section, we show that the computation of $\mathsf{optConfVal}$ reduces to maximization over a set of polynomial terms wherein each polynomial term corresponds to a proof tree, which we define. While the number of polynomial terms could be exponential, we use an NP oracle to binary search for the term that gives the maximum value.

3.1 An Upper Bound for General Formulae

We show that $\mathsf{optConfVal}$ and $\mathsf{optConf}$ can be computed in polynomial-time with oracle queries to an ${\rm NP}$ language.

Theorem 3.2.

$\mathsf{optConfVal}$ * for formulas in negation normal form is in ${\rm FP}^{{\rm NP}}$ .*

Proof Idea: In order to show that $\mathsf{optConfVal}$ is in ${\rm FP}^{{\rm NP}}$ , we use a binary search strategy using a language in ${\rm NP}$ . One of the challenges is that the confidence value could potentially be any real number in $[0,1]$ and thus apriori we may not be able to bound the number of binary search queries. However, we first argue that for any formula $\varphi$ on $n$ variables and with size $m$ , $\mathsf{optConf}(\varphi)$ is a fraction of the form $A/B$ where $1\leq A\leq B\leq 2^{nm\log m}$ . Ordered fractions of such form are known as Farey sequence of order $2^{nm\log m}$ (denoted as ${\mathcal{F}}_{2^{nm\log m}}$ ). Thus our task is to do a binary search over ${\mathcal{F}}_{2^{nm\log m}}$ with time complexity $O(nm\log m)$ . However, in general binary search for an unknown element in the Farey sequence ${\mathcal{F}}_{N}$ with time complexity $O(\log N)$ appears to be unknown. We overcome this difficulty by using an ${\rm NP}$ oracle to aid the binary search. We will give the details now.

Definition 3.3.

Let $\varphi(x_{1},\cdots,x_{n})$ be a propositional formula in negation normal form with size $m$ . Let $T_{\varphi}$ be its formula tree. A proof tree $T$ of $T_{\varphi}$ is a subtree obtained by the following process: for every OR node $v$ , choose one of the sub-trees of $v$ . For every AND node $v$ , keep all the subtrees.

Note that in a proof tree every OR node has only one child.

Definition 3.4.

Let $\varphi(x_{1},\cdots,x_{n})$ be a propositional formula in negation normal form and let $T$ be a proof tree. We define the proof tree polynomial $p_{T}$ by inductively defining a polynomial for the subtree at every node $v$ (denoted by $p_{v}$ ): If the node $v$ is a variable $x_{i}$ , the polynoimal is $x_{i}$ and if it is $\neg x_{i}$ , the polynomial is $(1-x_{i})$ . If $v$ is an AND node with children $v_{1},\ldots,v_{s}$ , then $p_{v}=\prod_{i=1}^{s}p_{s}$ . If $v$ is an OR node with a child $u$ , then $p_{v}=p_{u}$ .

Claim 3.4.1.

Let $\varphi(x_{1},\cdots,x_{n})$ be a propositional formula in negation normal form and let $T$ be a proof tree of $\varphi$ .

The proof tree polynomial $p_{T}$ is of the form

[TABLE]

where $0\leq a_{i}+b_{i}\leq m.$ 2. 2.

For a $\mathbb{V}$ -interpretation $\pi$ ,

[TABLE] 3. 3.

Both $\mathsf{optConf}(T)$ and $\mathsf{optConfVal}(T)$ can be computed in polynomial-time. 4. 4.

$\mathsf{optConfVal}(T)=\Pi_{i=1}^{n}\left(\frac{a_{i}}{a_{i}+b_{i}}\right)^{a_{i}}\left(\frac{b_{i}}{a_{i}+b_{i}}\right)^{b_{i}}.$ **

Proof.

Item (1) follows from the definition of the proof tree polynomial and a routine induction and the fact that the size of the formula $\varphi$ is $m$ . Item (2) follows from the definitions.

Note that the polynomial $\pi_{i=1}^{n}x_{i}^{a_{i}}(1-x_{i})^{b_{i}}$ can be maximized by maximizing each of the individual terms $x_{i}^{a_{i}}(1-x_{i})^{b_{i}}$ . By Proposition 1, the maximum value for a polynomial of this form is achieved at $x_{i}=\frac{a_{i}}{a_{i}+b_{i}}$ . Thus the interpretation $\pi(x_{i})=\frac{a_{i}}{a_{i}+b_{i}}$ is an optimal $\mathbb{V}$ -interpretation that can be computed in polynomial-time. Since $0\leq a_{i}+b_{i}\leq m$ , $\mathsf{optConfVal}$ also can be computed in polynomial-time. Item (4) follows from Item (3), by substituting the values $\pi(x_{i})$ for in the polynomial $p_{T}$ . ∎

The next claim relates $\mathsf{optConf}$ of the formula $\varphi$ to $\mathsf{optConf}$ of its proof trees. The proof of this claim follows from the definition of proof tree and standard induction.

Claim 3.4.2.

For a formula $\varphi$ ,

[TABLE]

where maximum is taken over all proof trees $T$ of $T_{\varphi}$ . If $T^{*}$ is the proof tree for which $\mathsf{optConf}(T)$ is maximized, then $\mathsf{optConf}(T^{*})=\mathsf{optConf}(\varphi)$ .

The above claim states that $\mathsf{optConf}(\varphi)$ can be computed by cycling through all proof trees $T$ of $\varphi$ and computing $\mathsf{optConf}(T)$ . Since there could be exponentially many proof trees, this process would take exponential time. Our task is to show that this process can be done in ${\rm FP}^{{\rm NP}}$ . To do this we establish a claim that restricts values that $\mathsf{optConfVal}(\varphi)$ can take. We need the notion of Farey sequence.

Definition 3.5.

For any positive integer $N$ , the Farey sequence of order $N$ , denoted by ${\mathcal{F}}_{N}$ , is the set of all irreducible fractions $p/q$ with $0<p<q\leq N$ arranged in increasing order.

Claim 3.5.1.

For a propositional formula $\varphi(x_{1},\cdots,x_{n})$ , $\mathsf{optConfVal}(\varphi)$ belongs to the Farey sequence ${\mathcal{F}}_{2^{nm\log m}}$ . 2. 2.

For any two fractions $u$ and $v$ from ${\mathcal{F}}_{2^{nm\log m}}$ , $|u-v|\geq 1/2^{2nm\log m}$

Proof.

By Claim 3.4.2, $\mathsf{optConfVal}(\varphi)$ equals $\mathsf{optConfVal}(T)$ , for some proof tree $T$ . By Item (4) of Claim 3.4.1 this value is a product of fractions, where the denominator of each fraction is of the form $(a_{i}+b_{i})^{a_{i}+b_{i}}$ where $a_{i}$ and $b_{i}$ are non-negative integers. Since $a_{i}+b_{i}\leq m$ , each denominator is at most $m^{m}$ , and thus the denominator of the product is bounded by $m^{nm}=2^{nm\log m}$ . Since the numerator is at most the denominator, the claim follows.

For the proof of the second part, let $u=p_{1}/q_{1}$ and $v=p_{2}/q_{2}$ , $u>v$ . Now $u-v=(p_{1}q2-p_{2}q_{1})/q_{1}q_{2}$ . Since $q_{1},q_{2}\leq 2^{nm\log m}$ , we have $u-v>p_{1}q_{2}-p_{2}q_{1}/2^{2nm\log m}$ . Since $p_{1},p_{2},q_{1}$ , $q_{2}$ are all integers, $p_{1}q_{2}-p_{2}q_{1}\geq 1$ . Thus $|u-v|\geq 1/2^{2nm\log m}$ .

∎

Consider the following language

[TABLE]

Claim 3.5.2.

$L_{\it opt}$ * is in ${\rm NP}$ .*

Proof.

Consider the following non-deterministic machine $M$ . On input $\varphi$ , $M$ guesses a proof tree $T$ of $\varphi$ : for every OR node, non-deterministically pick one of the subtrees. For $T$ , compute $\mathsf{optConfVal}(T)$ and accept if $\mathsf{optConfVal}(T)\geq v$ . This can be done in polynomial-time using Item (3) of Claim 3.4.1. The correctness of this algorithm follows from Claim 3.4.2. ∎

We need a method that given two fractions $u$ and $v$ and an integer $N$ , outputs a fraction $p/q:u\leq p/q\leq v$ , and $p/q\in{\mathcal{F}}_{N}$ . We give an ${\rm FP}^{{\rm NP}}$ algorithm that makes $O(N)$ queries to the ${\rm NP}$ oracle to achieve this. We first define the ${\rm NP}$ language $L_{\it farey}$ . For this we fix any standard encoding of fraction using the binary alphabet. Such an encoding will have $O(\log N)$ bit representation for any fraction in ${\mathcal{F}}_{N}$ .

[TABLE]

The following claim is easy to see.

Claim 3.5.3.

$L_{\it farey}\in{\rm NP}$ .

Now we are ready to prove the Theorem 3.2.

Proof.

(of Theorem 3.2). The algorithm performs a binary search over the range $[0,1]$ by making adaptive queries $\langle\varphi,v\rangle$ to the ${\rm NP}$ language $L_{\it opt}$ starting with $v=1$ . At any iteration of the binary search, we have an interval $I=[I_{l},I_{r}]$ and with the invariant $I_{l}\leq{\rm\mathsf{optConfVal}}(\varphi)<I_{r}$ . The binary search stops when the size of the interval $[I_{l},I_{r}]=1/2^{2nm\log m}$ . Since each iteration of the binary search reduces the size of the interval by a factor of 2, the search stops after making $2nm\log m$ queries to $L_{\it opt}$ . The invariant ensures that $\mathsf{optConfVal}(\varphi)$ is in this interval. Moreover, $\mathsf{optConfVal}(\varphi)\in{\mathcal{F}}_{2^{nm\log m}}$ (by item (1) of Claim 3.5.1) and there are no other fractions from ${\mathcal{F}}_{2^{nm\log m}}$ in this interval (by item (2) of Claim 3.5.1). Now, by making $O(nm\log m)$ queries to $L_{\it farey}$ with $N=2^{nm\log m}$ , $u=I_{l}$ , $v=I_{r}$ , we can construct the binary representation of the unique fraction in ${\mathcal{F}}_{2^{nm\log m}}$ that lies between $I_{l}$ and $I_{r}$ which is $\mathsf{optConfVal}(\varphi)$ . ∎

Next we show the optimal $\mathbb{V}$ -interpretation can also be computed in polynomial time with queries to an NP oracle.

Theorem 3.6.

$\mathsf{optConf}$ * for formulas in negation normal form can be computed in ${\rm FP}^{{\rm NP}}$ .*

Proof.

Let $\varphi$ be a propositional formula in negation normal form. We use a prefix search over the encoding of proof trees of $\varphi$ using an ${\rm NP}$ language to isolate a proof tree $T$ such that $\mathsf{optConfVal}(\varphi)=\mathsf{optConfVal}(T)$ . For this, we fix an encoding of proof trees of $\varphi$ . Consider the following ${\rm NP}$ language $L_{\it pt}$ :

[TABLE]

Claim 3.6.1.

$L_{\it pt}$ * is in * NP*.*

Proof.

Consider a non-deterministic machine that guesses a $z^{\prime}$ , verifies that $zz^{\prime}$ encodes a proof tree $T$ of $\varphi$ , and accepts if $\mathsf{optConfVal}(T)=v$ . By item (3) of Claim 3.4.1, $\mathsf{optConfVal}(T)$ can be computed in polynomial time. ∎

To complete the proof Theorem 3.6, given a propositional formula $\varphi$ , we first use ${\rm FP}^{{\rm NP}}$ algorithm from Theorem 3.2 to compute $v^{*}=\mathsf{optConfVal}(\varphi)$ . Now we can construct a proof tree $T$ of $\varphi$ so that $\mathsf{optConfVal}(T)=v^{*}$ by a prefix search using language $L_{\it pt}$ . Now by Claim 3.4.1, we can compute a $\mathbb{V}$ -interpretation $\pi^{*}$ so that $\mathsf{Conf}(T,\pi^{*})=v^{*}$ . Thus $\pi^{*}$ is an optimal $\mathbb{V}$ -interpretation for $\varphi$ , by Claim 3.4.2. ∎

Remark 2.

We revisit the semantics of negation. As stated earlier, by assuming the closed world semantics, we have $\mbox{$ \daleth $}(x)=1-x$ . We note that this assumption is not strictly necessary for the above proof to go through. Recall that Item (1) of Claim 3.4.1 states that the proof tree polynomial is of the form $\prod x_{i}^{a_{i}}(1-x_{i})^{b_{i}}$ . For a general negation function $\daleth$ , the proof tree polynomial is of the form $\prod x_{i}^{a_{i}}(\mbox{$ \daleth $}(x_{i}))^{b_{i}}$ . Now if the maximum value of a term $x^{a}(\mbox{$ \daleth $}(x))^{b}$ can be found, for example when $\daleth$ is an explicit differentiable function, the result will hold.

3.2 Relation to $\mathsf{MaxSat}$ for CNF Formulae

In this section we study the $\mathsf{optConfVal}$ problem for CNF formulae and establish its relation to the $\mathsf{MaxSat}$ problem. We first exhibit an upperbound on the $\mathsf{optConfVal}(\varphi)$ using the maximum number of satisfiable clauses. Building on this result, in Section 3.3 we show that $\mathsf{optConfVal}$ for CNF formulae is hard for the complexity class ${\rm FP}^{{\rm NP}[\log]}$ .

We first define some notation that will be used in this and next subsections. Let $\varphi(x_{1},\cdots x_{n})=C_{1}\wedge\cdots\wedge C_{m}$ be a CNF formula and let $\pi^{*}$ be an optimal $\mathbb{V}$ -interpretation. For each clause $C$ from $\varphi$ , let $\pi^{*}(C)$ be the value achieved by this interpretation, i.e $\pi^{*}(C)=\mathsf{Conf}(C,\pi^{*})$ . Observe that since $C$ is a disjunction of literals, $\pi^{*}(C)=\max_{\ell\in C}\{\pi^{*}(\ell)\}$ . For a clause $C$ , let

[TABLE]

In the above, if there are multiple maximums, we take the smallest literal as $\ell_{C}$ (By assuming an order $x_{1}<\neg{x_{1}}<x_{2}<\neg{x_{2}}\cdots<x_{n}<\neg{x_{n}})$ . Observe that, since we are working over the Viterbi semiring, $\mathsf{Conf}(C,\pi^{*})=\pi^{*}(\ell_{C})$ . A literal $\ell$ is maximizing literal for a clause $C$ , if $\ell_{C}=\ell$ .

Since $\varphi$ is a CNF formula, for any $\mathbb{V}$ -interpretation $\pi$ $\mathsf{Conf}(\varphi,\pi)$ is of the form $\Pi_{i=1}^{m}\mathsf{Conf}(C_{i},\pi)$ . Given a collection of clauses ${\mathcal{D}}$ from $\varphi$ , the contribution of ${\mathcal{D}}$ to $\mathsf{Conf}(\varphi,\pi)$ is defined as $\Pi_{c\in{\mathcal{D}}}\mathsf{Conf}(C,\pi)$ .

The following theorem provides an upperbound on $\mathsf{optConfVal}(\varphi)$ using $\mathsf{MaxSatVal}$ . This is the main result of this section.

Theorem 3.7.

Let $\varphi(x_{1},\cdots,x_{n})$ be a CNF formula with $m$ clauses. Let $r$ be the maximum number of clauses that can be satisfied. Then $\mathsf{optConfVal}(\varphi)\leq 1/4^{(m-r)}$ .

Proof.

Let $\pi^{*}$ be an optimal $\mathbb{V}$ -interpretation for $\varphi$ . A clause $C$ is called low-clause if $\pi^{*}(C)<1/2$ , $C$ is called a high-clause of $\pi^{*}(C)>1/2$ , and $C$ is a neutral-clause if $\pi^{*}(C)=1/2$ . Let $L$ , $H$ , and $N$ respectively denote the number of low, high, and neutral clauses.

We start with the following claim that relates the number of neutral clauses and the number of high-clauses to $r$ .

Claim 3.7.1.

$\frac{N}{2}+H\leq r$ **

Proof.

Suppose that the number of low-clauses is strictly less than $m-r$ , thus number of high-clauses is more than $r$ .

For a variable $x$ , let

[TABLE]

and

[TABLE]

That is $p_{x}$ is the number of neutral clauses for which $x$ is the maximizing literal and $q_{x}$ is the number of neutral clauses for which $\neg x$ is the maximizing literal.

Consider the truth assignment that is constructed based on the following three rules: (1) For every high-clause $C$ , set $\ell_{C}$ to True and $\neg{\ell_{C}}$ to False, 2) For every variable $x$ , if one of $p_{x}$ or $q_{x}$ is not zero, then if $p_{x}\geq q_{x}$ , then set $x$ to True, otherwise set $x$ to False. (3) All remaining variables are consistently assigned arbitrary to True/False values.

We argue that this is a consistent assignment: I.e, for every literal $\ell$ , both $\ell$ and $\neg{\ell}$ are not assigned the same truth value. Consider a literal $\ell$ . If there is a high clause $C$ such that $\ell=\ell_{C}$ , then this literal is assigned truth value True and $\neg{\ell}$ is assigned False. In this case, since $\pi^{*}(\ell)>1/2$ , $\pi^{*}(\neg{\ell})<1/2$ . Thus $\neg{\ell}$ can not be maximizing literal for any high-clause and thus Rule (1) does not assign True to $\neg{\ell}$ . Again, since $\pi^{*}(\ell)>1/2$ , there is no neutral-clause $D$ such that $\ell=\ell_{D}$ or $\neg{\ell}=\ell_{D}$ . Thus Rule (2) does not assign a truth value to either of $\ell$ or $\neg{\ell}$ . Since $\ell$ and $\neg{\ell}$ are assigned truth values, Rule (3) does not assign a truth value to $\ell$ or $\neg{\ell}$ .

Consider a variable $x$ where at least one of $p_{x}$ or $q_{x}$ is not zero. In this case $x$ or $\neg{x}$ is maximizing literal for a neutral clause. Thus $\pi^{*}(x)=\pi^{*}(\neg{x})=1/2$ and neither $x$ nor ${\neg{x}}$ is maximizing literal for a high-clause. Thus Rule (1) does not assign a truth value to $x$ or ${\neg{x}}$ . Now $x$ is True if and only if $p_{x}\geq q_{x}$ , thus the truth value assigned to $x$ (and $\neg{x}$ ) is consistent. Since Rule (3) consistently assigns truth values of literals that are not covered by Rules (1) and (2), the constructed assignment is a consistent assignment.

For every high clause $C$ , literal $\ell_{C}$ is set to true. Thus the assignment satisfies all the high-clauses. Consider a variable $x$ and let $\mathcal{D}$ be the (non-empty) collection of neutral clauses for which either $x$ or $\neg{x}$ is a maximizing literal. As $x$ is assigned True if and only if $p_{x}\geq q_{x}$ , at least half the clauses from $\mathcal{D}$ are satisfied. Thus this assignment satisfies at least $H+\frac{N}{2}$ clauses. Since $r$ is the maximum number of satisfiable clauses, the claim follows. ∎

For a literal $\ell$ , let $a_{\ell}$ be the number of low-clauses $C$ for which $\ell$ is a maximizing literal, i.e,

[TABLE]

and

[TABLE]

We show the following relation between $a_{\ell}$ and $b_{\ell}$ .

Claim 3.7.2.

For every literal $\ell$ , $a_{\ell}\leq b_{\ell}$ .

Proof.

[TABLE]

Now suppose that $a_{\ell}>b_{\ell}$ for some literal $\ell$ . Let $x_{j}$ be the variable corresponding to the literal $\ell$ . Note that

[TABLE]

where $\pi(\ell)<1/2$ . Consider a new interpretation $\pi^{\prime}$ where $\pi^{\prime}(\ell)=1-\pi^{*}(\ell)$ , and for all other literals the value of $\pi^{\prime}$ is the same as the value of $\pi^{*}$ . Now

[TABLE]

The last inequality follows because $\pi(\ell)<1/2$ and the assumption that $a_{\ell}>b_{\ell}$ . Since $\mathsf{Conf}(\varphi_{\mid x},\pi^{*})=\mathsf{Conf}(\varphi_{\mid x},\pi^{\prime})$ for every $x\neq x_{j}$ , combining the above inequality with Equation 1, we obtain that $\mathsf{Conf}(\varphi,\pi^{\prime})>\mathsf{Conf}(\varphi,\pi^{*})$ and thus $\pi^{*}$ is not an optimal $\mathbb{V}$ -interpretation. This is a contradiction. Thus $a_{\ell}\leq b_{\ell}$ ∎

We next bound the contribution of neutral and low clauses to $\mathsf{optConfVal}(\varphi)$ . For every neutral clause $C$ , $\pi^{*}(C)=1/2$ , thus we have the following observation.

Observation 3.8.

The contribution of neutral clauses to $\mathsf{Conf}(\varphi,\pi^{*})$ is exactly $1/2^{N}$ .

We establish the following claim.

Claim 3.8.1.

[TABLE]

Proof.

By Observation 3.8, the contribution of neutral clauses to $\mathsf{Conf}(\varphi,\pi^{*})$ is $1/2^{N}$ . Next we show that the contribution of all high and low clauses is exactly.

[TABLE]

For this we first claim that exactly one of $\ell$ or $\neg{\ell}$ contribute to the above product. For this it suffices to prove that for every literal $\ell$ exactly one of $a_{\ell}$ ( $b_{\ell}$ resp.) or $a_{\neg{\ell}}$ ( $b_{\neg{\ell}}$ ) is zero. Suppose $a_{\ell}\neq 0$ , in this case $\neg{\ell}$ can not be maximizing literal for any low clause. Thus $a_{\neg{\ell}}=0$ . Suppose that $b_{\ell}\neq 0$ , then $\neg{\ell}$ is a maximizing literal for a high clause and thus $\pi^{*}(\neg{\ell})>1/2$ , and $\pi^{*}(\ell)\leq 1/2$ . If $b_{\neg{\ell}}\neq 0$ , then $\ell$ must be a maximizing literal for a high-clause, and this is not possible as $\pi^{*}(\ell)\leq 1/2$ . Thus $b_{\neg{\ell}}=0$ .

Let $Z$ be the collection of literals $\ell$ for which $a_{\ell}>0$ . Now that quantity $\prod_{\ell\in Z}\pi^{*}(\ell)^{a_{\ell}}\times(1-\pi^{*}(\ell))^{b_{\ell}}$ captures the contribution of all low clauses and $\sum_{\ell\in Z}$ many high-clauses. For all remaining high-clauses, there exist a literal $\ell$ such that $\ell\notin Z$ and $b_{\ell}\neq 0$ . The contribution of all the remaining high- clauses is $\prod_{\ell\notin Z}(1-\pi(\ell))^{b_{\ell}}$ . This quantity equals $\prod_{\ell\notin Z}\pi^{*}(\ell)^{a_{\ell)}}\times(1-\pi(\ell))^{b_{\ell}}$ as $a_{\ell}=0$ for $\ell\notin Z$ . ∎

Finally, we are ready to complete the proof of Theorem 3.7. For every literal $\ell$ , By Claim 3.7.2, $a_{\ell}\leq b_{\ell}$ . Let $b_{\ell}=a_{\ell}+c_{\ell}$ , $c_{\ell}\geq 0$ . Consider the following inequalities.

[TABLE]

In the above, equality at line 2 is due to Claim 3.8.1. The inequality at line 4 follows because $(1-\pi^{*}(\ell))\leq 1$ . The last inequality follows because $x(1-x)$ is maximized at $x=1/2$ . The last equality follows as $\sum a_{\ell}=L$ . Note that the number of clauses $m=N+H+L$ and by Claim 3.7.1 $H+N/2\leq r$ . It follows that $L+N/2\geq m-r$ . Thus $\mathsf{optConfVal}(\varphi)=\mathsf{Conf}(\varphi,\pi^{*})\leq{1\over 4^{L+N/2}}\leq{1\over 4^{m-r}}$ . ∎

3.3 ${\rm FP}^{{\rm NP}[\log]}$ - Hardness

In this subsection, we show that $\mathsf{optConfVal}$ is hard for the class ${\rm FP}^{{\rm NP}[\log]}$ . We show this by reducing $\mathsf{MaxSatVal}$ to $\mathsf{optConfVal}$ . Since $\mathsf{MaxSatVal}$ is complete for ${\rm FP}^{{\rm NP}[\log]}$ , the result follows. We also show that the same reduction can be used to compute a $\mathsf{MaxSat}$ assignment from an optimal $\mathbb{V}$ -interpretation.

Theorem 3.9.

$\mathsf{MaxSatVal}$ * metric reduces to $\mathsf{optConfVal}$ for CNF formulae. Hence $\mathsf{optConfVal}$ is hard for ${\mathrm{FP}}^{\mathrm{NP}[\log]}$ for CNF formulae.*

Proof.

Let $\varphi(x_{i},\ldots,x_{n})=C_{1}\wedge\ldots\wedge C_{m}$ be a formula with $m$ clauses on variables $x_{1},\ldots,x_{n}$ . Consider the formula $\varphi^{\prime}$ with $m$ additional variables $y_{1},\ldots,y_{m}$ constructed as follows: For each clause $C_{i}$ of $\varphi$ , add the clause $C^{\prime}_{i}=C_{i}\vee y_{i}$ in $\varphi^{\prime}$ . Also add $m$ unit clauses $\neg y_{i}$ . That is

[TABLE]

Claim 3.9.1.

$\mathsf{optConfVal}(\varphi^{\prime})=\frac{1}{4^{m-r}}$ * where $r$ is the maximum number of clauses that can be satisfied in $\varphi$ .*

Proof.

We show this claim by first showing that $\mathsf{optConfVal}(\varphi^{\prime})\leq\frac{1}{4^{m-r}}$ and exhibiting an interpretation $\pi^{*}$ so that $\mathsf{Conf}(\varphi,\pi^{*})=\frac{1}{4^{m-r}}$ . We claim that if $r$ is the maximum number of clauses that can be satisfied in $\varphi$ , then $m+r$ is the maximum number of clauses that can be satisfied in $\varphi^{\prime}$ . We will argue this by contradiction. Let $\mathbf{a}$ be an assignment that satisfies $>m+r$ clause in $\varphi^{\prime}$ . Let $s$ be the number of $y_{i}$ s that are set to False. This assignment will satisfy $m-s$ clauses of the form $C_{i}\vee y_{i}$ . However the total number of clauses of the form $C_{i}\vee y_{i}$ that are satisfied is $>m+r-s$ . Thus there are $>r$ clauses of the form $C_{i}\vee y_{i}$ that are satisfied where $y_{i}$ is set to False. This assignment when restricted to $x_{i}$ s will satisfy more than $r$ clauses of $\varphi$ . Hence the contradiction.

Thus from Theorem 3.7, it follows that $\mathsf{optConfVal}(\varphi^{\prime})\leq\frac{1}{4^{m-r}}$ . Now we exhibit an interpretation $\pi^{*}$ so that $\mathsf{Conf}(\varphi,\pi^{*})=\frac{1}{4^{m-r}}$ . Consider an assignment $\mathbf{a}=a_{1},\ldots,a_{n}$ for $\varphi$ that satisfies $r$ clauses. Consider the following interpretation $\pi^{*}$ over the variable of $\varphi^{\prime}$ : $\pi^{*}(x_{i})=1$ if $a_{i}={\rm True}$ and $\pi^{*}(x_{i})=0$ if $a_{i}={\rm False}$ . $\pi^{*}(y_{i})=0$ if and only if $C_{i}$ is satisfied by $\mathbf{a}$ . Else $\pi^{*}(y_{i})=1/2$ . For every satisfiable clause $C_{i}$ , $\mathsf{Conf}(C_{i}\vee y_{i},\pi^{*})=1$ and $\mathsf{Conf}(\neg y_{i},\pi^{*})=1$ . For all other clauses $C$ in $\varphi^{\prime}$ , $\mathsf{Conf}(C,\pi^{*})=1/2$ . Since there are $r$ clauses that are satisfied, the number of clauses for which $\mathsf{Conf}(C,\pi^{*})=1/2$ is $2m-2r$ . Hence the $\mathsf{Conf}(\varphi^{\prime},\pi^{*})=\frac{1}{4^{(m-r)}}$ . Thus $\mathsf{optConfVal}(\varphi^{\prime})=\frac{1}{4^{m-r}}$ . ∎

Since $\mathsf{optConfVal}(\varphi^{\prime})=1/4^{m-r}$ , $\mathsf{MaxSatVal}$ for $\varphi$ can be computed by knowing the $\mathsf{optConfVal}$ . ∎

While the above theorem shows that $\mathsf{MaxSatVal}$ can be computed from $\mathsf{optConfVal}$ , the next theorem shows that a maxsat assignment can be computed from an optimal $\mathbb{V}$ -interpretation.

Theorem 3.10.

$\mathsf{MaxSat}$ * metric reduces to $\mathsf{optConf}$ .*

Proof.

Consider the same reduction as from the previous theorem. Our task is to construct a $\mathsf{MaxSat}$ assignment for $\varphi$ , given an optimal $\mathbb{V}$ -interpretation $\pi$ for $\varphi^{\prime}$ . By the earlier theorem, $\mathsf{Conf}(\varphi^{\prime},\pi)=\frac{1}{4^{m-r}}$ , where $r$ is the maximum number of satisfiable clauses of $\varphi$ .

We next establish a series of claims on the values takes by $\pi(y_{i})$ and $\pi(x_{i})$ .

Claim 3.10.1.

For all $y_{i}$ ; $\pi(y_{i})\in\{0,1/2\}$ .

Proof.

Consider a clause $C_{i}^{\prime}=(C_{i}\vee y_{i})$ for which $\ell_{C^{\prime}_{i}}=y_{i}$ . Now the contribution of $C^{\prime}_{i}$ and the clause $\neg{y_{i}}$ to $\mathsf{Conf}(\varphi^{\prime},\pi)$ is $\pi(y_{i})\times(1-\pi(y_{i}))$ . Since there is no clause $C^{\prime}_{j}$ for which $\ell_{C^{\prime}_{j}}=y_{i}$ , the above value is maximized when $\pi(y_{i})=1/2$ . Now consider a clause $C^{\prime}_{j}=(C_{j}\vee y_{j})$ , for which $\ell_{C^{\prime}_{j}}\neq y_{j}$ . Contribution of $C^{\prime}_{j}$ and the clause $\neg{y_{j}}$ to $\mathsf{Conf}(\varphi^{\prime},\pi)$ is $\pi(\ell_{C^{\prime}_{j}})\times\pi(\neg{y_{j})}$ . Since, $\ell_{C^{\prime}_{j}}\neq y_{j}$ , and there is no other clause in which $y_{j}$ or $\neg{y_{j}}$ appear, the above expression is maximized when $\pi(\neg{y_{j}})=1$ and thus $\pi(y_{j})=0$ . ∎

Claim 3.10.2.

For every $i$ , if $y_{i}$ is not maximizing literal for clause $C^{\prime}_{i}$ , then $\pi(y_{i})=0$ .

Proof.

Let $C^{\prime}_{i}$ be a clause for which $y_{i}$ is not maximizing literal. Say $\ell_{j}$ is the maximizing literal. We first consider the case $\pi(\ell_{j})<1/2$ . By previous claim, $\pi(y_{i})\in\{0,1/2\}$ , and if $\pi(y_{i})=1/2$ , then $\ell_{j}$ can not be maximizing literal for clause $C^{\prime}_{i}$ . Thus $\pi(y_{i})=0$ . Now consider the case $\pi(\ell_{j})\geq 1/2$ . Suppose that $\pi(y_{i})=1/2$ . Now the contribution of the clauses $C^{\prime}_{i}$ and $\neg{y_{i}}$ to $\mathsf{Conf}(\varphi,\pi)$ is $\pi(\ell_{j})/2$ . However, if we change $\pi(y_{i})=0$ , then the contribution of these clauses would become $\pi(\ell_{j})$ and this would contradict the optimality of $\pi$ . Thus by Claim 3.10.1, $\pi(y_{i})=0$ . ∎

Claim 3.10.3.

For all $x_{i}$ , if $x_{i}$ or $\neg{x_{i}}$ is a maximizing literal, then $\pi(x_{i})\in\{0,1,1/2\}$

Proof.

We argue for the case when $x_{i}$ is a maximizing literal. The case when $\neg{x_{i}}$ is a maximizing literal follows by similar arguments. Suppose that $x_{i}$ is a maximizing literal and $\pi(x_{i})<1/2$ and $\pi(x_{i})$ is neither 0 nor 1. It must be the case that $\neg{x_{i}}$ is also a maximizing literal, otherwise making $\pi(x_{i})=1$ will increase the trust value. Suppose $x_{i}$ is a maximizing literal for $a$ many clauses and $\neg{x_{i}}$ is a maximizing literal for $b$ many clauses. If $a>b$ , then we can obtain a $\mathbb{V}$ -interpretation, by swapping $\pi(x_{i})$ with $\pi(\neg{x_{i}})$ . If $a$ equals $b$ , then $\pi(x_{i})$ must be equal to $1/2$ as $x^{a}(1-x)^{a}$ is maximized for $x=1/2$ . Thus $a<b$ . For every clause $C^{\prime}_{j}$ for which $x_{i}$ or $\neg{x_{j}}$ is the maximizing literal, it must be the case that $\pi(y_{j})=0$ , by Claim 3.10.2. Let $\mathcal{C}$ be the collection of all clauses $C^{\prime}_{j}$ together with $\neg{y_{j}}$ , where either $x_{i}$ or $\neg{x_{i}}$ is maximizing literal. The contribution of these clauses to $\mathsf{Conf}(\varphi,\pi)$ is $\pi(x_{i})^{a}\times(1-\pi(x_{i}))^{b}\times 1^{a+b}$ .

We now construct a new $\mathbb{V}$ -interpretation $\pi^{\prime}$ that will contradict the optimality of $\pi$ . For every clause $C^{\prime}_{j}\in\mathcal{C}$ in which $x_{i}$ is the maximizing literal, $\pi^{\prime}(y_{i})=1/2$ and $\pi^{\prime}(x_{i})=0$ . Now the contribution of clauses from $\mathcal{C}$ to $\mathsf{Conf}(\varphi,\pi^{\prime})$ is $(\frac{1}{2})^{a}\times 1^{b}\times(\frac{1}{2})^{a}\times 1^{b}$

Since $x^{a}(1-x)^{b}<1/4^{a}$ (when $a<b$ ),

[TABLE]

Thus $\mathsf{Conf}(\varphi,\pi^{\prime})>\mathsf{Conf}(\varphi,\pi)$ which is a contradiction. Thus if $\pi(x_{i})<1/2$ , then $\pi(x_{i})=0$ , a similar argument shows that if $\pi(x_{i})>1/2$ , then $\pi(x_{i})=1$ . ∎

Claim 3.10.4.

For every $x_{i}$ with $\pi(x_{i})=1/2$ , $x_{i}$ and $\neg{x_{i}}$ are maximizing literals for exactly the same number of clauses.

Proof.

Let $\mathcal{C}$ be the collection of clauses for which either $x_{i}$ or $\neg{x_{i}}$ is maximizing literal. Suppose that $x_{i}$ is maximizing literal for $a$ clauses and $\neg{x_{i}}$ is maximizing literal for $b$ clauses. If $a\neq b$ , $\pi(x_{i})=\frac{a}{a+b}\notin\{0,1,1/2\}$ and this contradicts Claim 3.10.3. ∎

We will show how to construct a $\mathsf{MaxSat}$ assignment from $\pi$ : If $\pi(x_{i})=0$ , set the truth value of $x_{i}$ to False, else set the truth value of $x_{i}$ to True.

By Claim 3.10.3, $\pi(x_{i})=\{0,1/2,1\}$ . Let $H$ be the number of clauses for which maximizing literal $\ell$ is a $x$ -variable and $\pi(\ell)=1$ . Note that the above truth assignment will satisfy all the $H$ clauses. Let $N$ be number of clauses for which maximizing literal $\ell$ is a $x$ -variable and $\pi(\ell)=1/2$ . By Claim 3.10.4, in exactly $N/2$ clauses a positive literal is maximizing, and thus all these $N/2$ clauses are satisfied by our truth assignment. Thus the total number of clauses satisfied by the truth assignment is $N/2+H$ . Let $Y$ the number of clauses in which $y_{i}$ is maximizing literal. By Claim 3.10.1, $\pi(y_{i})=1/2$ when $y_{i}$ is maximizing literal. Thus

[TABLE]

The last equality follows from Claim 3.9.1. Thus $m-r=N/2+Y$ , combining this with $m=H+N+Y$ , we obtain that $N/2+H=r$ . Thus the truth assignment constructed will satisfy $r$ clauses and is thus a $\mathsf{MaxSat}$ assignment. ∎

4 Approximating $\mathsf{optConfVal}$

We study the problem of approximating $\mathsf{optConfVal}$ efficiently. Below, a $k$ -SAT formula is a CNF formula with exactly $k$ distinct variables in any clause. We start with the following proposition.

Lemma 4.1.

Let $a_{1},\cdots a_{n}$ be an assignment, that satisfies $r$ clauses of a CNF formula $\varphi(x_{1},\cdots x_{n})$ . There is an interpretation $\pi$ so that $\mathsf{Conf}(\varphi,\pi)$ is $\left(\frac{m-r}{m}\right)^{m-r}\left(\frac{r}{m}\right)^{r}$

Proof.

If $a_{i}=1$ , set $\pi(x_{i})=(1-\epsilon)$ and if $a_{i}=0$ , then set $\pi(x_{i})=\epsilon$ . For every clause $C_{i}$ that is satisfied, we obtain a max value of $(1-\epsilon)$ and for every clause that is not satisfied, the max value is $\geq\epsilon$ . Thus the $\mathsf{optConf}$ obtained by this assignment is $(1-\epsilon)^{r}\epsilon^{m-r}$ , and this is maximized when $\epsilon=\frac{m-r}{m}$ by Proposition 1. ∎

Hence, for example, if $\varphi$ is a 3-SAT formula, since a random assignment satisfies $7/8$ fraction of the clauses in expectation, for a random assignment $r\geq 7m/8$ , and by Lemma 4.1, $\mathsf{optConfVal}(\varphi)>0.686^{m}$ . The following lemma shows that one can get a better lower bound on $\mathsf{optConfVal}$ in terms of the clause sizes for CNF formulae.

Lemma 4.2.

For every CNF formula $\varphi$ , $\mathsf{optConfVal}(\varphi)\geq e^{-\sum_{i}\frac{1}{k_{i}}}$ where $k_{i}$ is the arity of the $i$ ’th clause in $\varphi$ .

Proof.

Consider the interpretation $\pi$ that assigns every variable $x_{i}$ a uniformly chosen value in the interval $[0,1]$ . Let the clauses in $\varphi$ be $C_{1},\dots,C_{m}$ . Then:

[TABLE]

Hence, there exists a choice of $\pi$ achieving this trust value. ∎

This yields a probabilistic algorithm. For example, if $\varphi$ is a $3$ -SAT formula, $\mathsf{optConfVal}(\varphi)>0.716^{m}$ and thus improving on the result of Lemma 4.1. In fact, we can design a deterministic polynomial time algorithm that finds an interpretation achieving the trust value guaranteed by Lemma 4.2, using the well-known ‘method of conditional expectation’ to derandomize the construction in the proof (For example, see [AS08, GW94]).

Theorem 4.3.

There is a polynomial-time, $e^{-m/k}$ -approximation algorithm for $\mathsf{optConf}$ , when the input formulas are $k$ -CNF formulas with $m$ -clauses.

Proof.

Arbitrarily ordering the variables $x_{1},x_{2},\dots,x_{n}$ , the idea is to sequentially set $\pi^{*}(x_{1}),\pi^{*}(x_{2}),\dots,\pi^{*}(x_{n})$ ensuring that for every $i$ :

[TABLE]

Assuming $\pi^{*}(x_{1}),\dots,\pi^{*}(x_{i-1})$ have already been fixed, we show how to choose $\pi^{*}(x_{i})$ satisfying the above. We use $\pi_{<i}$ to denote $\pi(x_{1})\cdots\pi(x_{i-1})$ . For a clause $C$ , let $\alpha=\max_{\ell\in C\cap\{x_{j},\bar{x}_{j}:j<i\}}\pi^{*}(\ell)$ , and suppose $x_{i}\in C$ . Then:

[TABLE]

where $k^{\prime}$ is the number of literals in the clause $C$ involving variables $x_{i+1},\dots,x_{n}$ . One can similarly evaluate the conditional expectation in the cases $\bar{x}_{i}\in C$ and $C\cap\{x_{i},\bar{x}_{i}\}=\emptyset$ .

Summing up over all the clauses $C$ , we get that

[TABLE]

is a continuous function of $p$ that is a piecewise polynomial in at most $m$ intervals. In polynomial time111For simplicity, we ignore issues of precision here, but the error can be made inversely polynomial in $n$ ., we can find a value of $p$ that maximizes this function. By induction on $i$ , the maximum value of this function is at least $-\sum_{i}\frac{1}{k_{i}}$ , and hence (*) is satisfied. This completes the description of the algorithm.

∎

Next, we show that the approximation factor $e^{-m/k}$ can not be significantly improved.

We use the following result on hardness of approximating $\mathsf{MaxSat}$ established by Hastad [Hås01].

Theorem 4.4 ([Hås01]).

For any $\varepsilon>0$ and any $k\geq 3$ it is NP-hard to distinguish satisfiable $k$ -SAT formulas from $k$ -SAT formulae $<m(1-2^{-k}+\varepsilon)$ satisfiable clauses.

We are now ready to show the following.

Theorem 4.5.

There is no polynomial-time ${1\over 4^{m(2^{-k}-\varepsilon)}}$ -approximation algorithm for $\mathsf{optConf}$ for $k$ -SAT formulae, unless ${\rm P}={\rm NP}$ .

Proof.

Assuming such an approximation algorithm $A$ exists, we contradict Hastad’s Theorem (Theorem 4.4). Consider the following algorithm $A^{\prime}$ that on input a $k$ -SAT formula $\varphi$ , runs $A(\varphi)$ . If $A$ outputs a value that is $\geq{1\over 4^{m(2^{-k}-\varepsilon)}}$ , then $A^{\prime}$ outputs YES otherwise outputs NO. Suppose $\varphi$ is satisfiable, then $\mathsf{optConf}(\varphi)=1$ . Hence $A$ will output a value $\geq{1\over 4^{m(2^{-k}-\varepsilon)}}$ . Thus $A^{\prime}$ output YES. Suppose maximum number of satisfiable clauses for $\varphi$ is $\leq m(1-2^{-k}+\varepsilon)$ . By Theorem 3.7,

[TABLE]

Hence output of $A$ is $<{1\over 4^{m(2^{-k}-\varepsilon)}}$ and hence $A^{\prime}$ will output NO.

Thus $A^{\prime}$ contradicts Theorem 4.4, unless ${\rm P}={\rm NP}$ . ∎

Thus, for example for $3$ -SAT formulas, while we have a polynomial-time, $0.716^{m}$ -approximation algorithm (by Theorem 4.3), we cannot expect an efficient $0.845^{m}$ -approximation algorithm by the above result unless ${\rm P}$ equals ${\rm NP}$ . It remains an interesting open problem to determine the optimal approximation ratio for this problem achievable by a polynomial time algorithm.

5 Complexity of Access Maximization

In this section, we study the optimization problems for the access control semiring $\mathbb{A}_{k}=([k],\max,\min,0,k)$ . We refer to the corresponding computational problems as $\mathsf{optAccessVal}$ and $\mathsf{optAccess}$ . For this section we first assume the negation function is the additive inverse modulo $k$ . That is $\mbox{$ \daleth $}(a)=b$ such that $a+b\equiv 0~{}({\rm mod}~{}k)$ .

Theorem 5.1.

Let $\varphi(x_{1},\cdots x_{n})$ be a propositional formula in negation normal form and $\mathbb{A}_{k}=([k],\max,\min,0,k)$ . The following statement holds.

•

If $\varphi$ is satisfiable, then $\mathsf{optAccessVal}(\varphi)=k$ .

•

If $\varphi$ is not satisfiable, then $\mathsf{optAccessVal}(\varphi)=\lfloor\frac{k}{2}\rfloor$ .

Proof.

We will first prove it for the case when $\varphi$ is in the CNF form, i.e $\varphi=C_{1}\wedge\cdots\wedge C_{m}$ . Suppose that the formula is satisfiable and $a_{1}\cdots a_{n}$ is a satisfying assignment to the variables $x_{1},x_{2},\cdots,x_{n}$ . Consider the interpretation $\pi$ defined as follows: If $a_{i}$ is true, then $\pi(x_{i})=k$ , else $\pi(x_{i})=\mbox{$ \daleth $}(k)$ . Consider a clause $C$ , since the formula is satisfiable, there exists a literal $\ell_{i}$ (either $x_{i}$ or $\neg x_{i}$ for some $i$ ) in $C$ such that $\ell_{i}$ is set to true. If $\ell_{i}=x_{i}$ , then $\pi(x_{i})=k$ and $\mathsf{Sem}(x_{i},\pi)=k$ . If $\ell_{i}=\neg x_{i}$ , then $\pi((x_{i})=\mbox{$ \daleth $}(k)=0$ and $\mathsf{Sem}(\neg x_{i},\pi)=\mbox{$ \daleth $}(0)=k$ . Since $C$ is a disjunction $\mathsf{Sem}(C,\pi)=k$ . Thus for every clause $C_{i}$ , $\mathsf{Sem}(C_{i},\pi)=k$ . Since $\varphi$ is a conjunction of $C_{1},\cdots C_{m}$ , it follows that $\mathsf{Sem}(\varphi,\pi)=k$ .

For the proof of the second item, first assume that $k$ is even, the proof when $k$ is odd is very similar. Note that in this case, $\mbox{$ \daleth $}(k/2)=k/2$ . Let $\varphi=C_{1}\wedge\cdots\wedge C_{m}$ be an unsatisfiable formula. Consider an interpretation $\pi$ where $\pi(x_{i})=k/2$ for every $1\leq i\leq n$ . Clearly, for this interpretation, $\mathsf{Sem}(\varphi,\pi)=k/2$ . Suppose that $\pi^{\prime}$ be an interpretation $\mathsf{Sem}(\varphi,\pi^{\prime})>k/2$ . Consider the following satisfying assignment: $a_{i}$ is true if $\varphi^{\prime}(x_{i})>k/2$ , else $a_{i}$ is false. Observe that this is a consistent assignment. We will establish that this assignment satisfies $\varphi$ . This establishes that $\mathsf{optAccessVal}(\varphi)=k/2$ .

Note that for every clause $C_{j}$ , $1\leq j\leq m$ , $\mathsf{Sem}(C_{j},\pi^{\prime})>k/2$ . Fix a clause $C$ , since $\mathsf{Sem}(C,\pi^{\prime})>k/2$ , there exists a literal $\ell_{i}$ in $C$ such that $\mathsf{Sem}(\ell_{i},\pi^{\prime})>k/2$ . If $\ell_{i}=x_{i}$ , then $\mathsf{Sem}(x_{i},\pi^{\prime})>k/2$ which implies that $\pi^{\prime}(x_{i})>k/2$ . Thus $a_{i}$ is true and the clause $C$ is satisfied by the assignment. If $\ell_{i}=\neg x_{i}$ , then $\mathsf{Sem}(\neg x_{i},\pi^{\prime})>k/2$ . Thus $\mbox{$ \daleth $}(\pi^{\prime}(x_{i}))>k/2$ . By the definition of $\daleth$ , we have $\pi^{\prime}(x_{i})<k/2$ . Thus $a_{i}$ is set to false. Thus the clause $C$ is satisfiable. This proves that the assignment $a_{1},\cdots,a_{n}$ satisfies the formula $\varphi(x_{1},\cdots,x_{n})$ .

The case where the general formula is in the negation normal form follows by similar ideas using the notion of proof trees as in the case of Viterbi semiring. ∎

For a general negation function, we can establish an analogous theorem. For this, we define the notion of the index of negation. Given a negation function $\daleth$ , its index denoted by ${\it Index}(\mbox{$ \daleth $})$ is the largest $\ell$ for which there exists $a\in[k]$ , such that both $a$ and $\mbox{$ \daleth $}(a)$ are at least $\ell$ .

Theorem 5.2.

Let $\varphi(x_{1},\cdots x_{n})$ be a propositional formula in negation normal form and $\mathbb{A}_{k}=([k],\max,\min,0,k)$ . The following statement holds.

•

If $\varphi$ is satisfiable, then $\mathsf{optAccessVal}(\varphi)=k$ .

•

If $\varphi$ is not satisfiable, then $\mathsf{optAccessVal}(\varphi)=Index(\mbox{$ \daleth $})$ .

The following is a corollary to the above result and its proof which states that the complexity of optimization problems over access control semiring is equivalent to their complexity over the Boolean semiring.

Theorem 5.3.

The problem $\mathsf{optAccessVal}$ and ${\mathsf{SAT}}$ are equivalent under metric reductions. Similarly, the problem $\mathsf{optAccess}$ and the problem of computing a satisfying assignment of a given Boolean formula are equivalent under metric reductions.

6 Conclusion

In this work, we provided a comprehensive study of the computational complexity of $\mathsf{optSem}$ and the related problem $\mathsf{optSemVal}$ over various semirings such as Viterbi semiring, tropical semiring, access control semiring and fuzzy semiring, from both an algorithmic and a complexity-theoretic viewpoint. An exciting recent development in the field of CSP/SAT solving has been the development of solvers for $\mathsf{LexSAT}$ , which seeks to find the smallest lexicographic satisfying assignment of a formula [MSAGL11]. In this regard, Theorem 3.2 opens up exciting directions of future work to develop efficient techniques for $\mathsf{optConf}$ .

7 Acknowledgements

We thank Val Tannen for introducing us to the world of semiring semantics and for helpful conversations during the nascent stages of the project. We thank the anonymous reviewers of AAAI-23 for valuable comments. This research is supported by the National Research Foundation under the NRF Fellowship Programme [NRF-NRFFAI1-2019-0004] and Campus for Research Excellence and Technological Enterprise (CREATE) program. Bhattacharyya was supported in part by the NRF Fellowship Programme [NRF-NRFFAI1-2019-0002] and an Amazon Research Award. Vinod was supported in part by NSF CCF-2130608 and NSF HDR:TRIPODS-1934884 awards. Pavan was supported in part by NSF CCF-2130536, and NSF HDR:TRIPODS-1934884 awards.

Bibliography29

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[ADT 11] Yael Amsterdamer, Daniel Deutch, and Val Tannen. Provenance for aggregate queries. In Proc. of PODS , pages 153–164, 2011.
2[AS 08] Noga Alon and Joel H. Spencer. The Probabilistic Method, Third Edition . Wiley-Interscience series in discrete mathematics and optimization. Wiley, 2008.
3[BG 06] Stefano Bistarelli and Fabio Gadducci. Enhancing constraints manipulation in semiring-based formalisms. In ECAI , volume 141, pages 63–67, 2006.
4[Bis 04] Stefano Bistarelli. Semirings for soft constraint solving and programming , volume 2962. Springer Science & Business Media, 2004.
5[BMR 95] Stefano Bistarelli, Ugo Montanari, and Francesca Rossi. Constraint solving over semirings. In IJCAI (1) , pages 624–630. Citeseer, 1995.
6[BMR 97] Stefano Bistarelli, Ugo Montanari, and Francesca Rossi. Semiring-based constraint satisfaction and optimization. J. ACM , 44(2):201–236, 1997.
7[BMR + 99] Stefano Bistarelli, Ugo Montanari, Francesca Rossi, Thomas Schiex, Gérard Verfaillie, and Hélene Fargier. Semiring-based csps and valued csps: Frameworks, properties, and comparison. Constraints , 4(3):199–240, 1999.
8[Cui 02] Yingwei Cui. Lineage tracing in data warehouses . Ph D thesis, Stanford University, 2002.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Constraint Optimization over Semirings††thanks: The authors decided to forgo the old convention

Abstract

1 Introduction

1.1 Our Results

Remark 1**.**

2 Preliminaries

2.1 Optimization Problems and Complexity Classes

Definition 2.1** (optSem\mathsf{optSem}optSem and optSemVal\mathsf{optSemVal}optSemVal).**

Definition 2.2** (MaxSat\mathsf{MaxSat}MaxSat and MaxSatVal\mathsf{MaxSatVal}MaxSatVal).**

Definition 2.3** (Metric Reduction).**

Definition 2.4**.**

Theorem 2.5** ([Kre88]).**

Proposition 1**.**

3 Computational Complexity of Confidence Maximization

Observation 3.1**.**

An Illustrative Example.

3.1 An Upper Bound for General Formulae

Theorem 3.2**.**

Definition 3.3**.**

Definition 3.4**.**

Claim 3.4.1**.**

Proof.

Claim 3.4.2**.**

Definition 3.5**.**

Claim 3.5.1**.**

Proof.

Claim 3.5.2**.**

Proof.

Claim 3.5.3**.**

Proof.

Theorem 3.6**.**

Proof.

Claim 3.6.1**.**

Proof.

Remark 2**.**

3.2 Relation to MaxSat\mathsf{MaxSat}MaxSat for CNF Formulae

Theorem 3.7**.**

Proof.

Claim 3.7.1**.**

Proof.

Claim 3.7.2**.**

Proof.

Observation 3.8**.**

Claim 3.8.1**.**

Proof.

3.3 FPNP[log⁡]{\rm FP}^{{\rm NP}[\log]}FPNP[log]- Hardness

Theorem 3.9**.**

Proof.

Claim 3.9.1**.**

Proof.

Theorem 3.10**.**

Proof.

Claim 3.10.1**.**

Proof.

Claim 3.10.2**.**

Proof.

Claim 3.10.3**.**

Proof.

Claim 3.10.4**.**

Proof.

4 Approximating optConfVal\mathsf{optConfVal}optConfVal

Lemma 4.1**.**

Proof.

Lemma 4.2**.**

Proof.

Theorem 4.3**.**

Proof.

Theorem 4.4** ([Hås01]).**

Theorem 4.5**.**

Proof.

5 Complexity of Access Maximization

Theorem 5.1**.**

Proof.

Remark 1.

Definition 2.1 ( $\mathsf{optSem}$ and $\mathsf{optSemVal}$ ).

Definition 2.2 ( $\mathsf{MaxSat}$ and $\mathsf{MaxSatVal}$ ).

Definition 2.3 (Metric Reduction).

Definition 2.4.

Theorem 2.5 ([Kre88]).

Proposition 1.

Observation 3.1.

Theorem 3.2.

Definition 3.3.

Definition 3.4.

Claim 3.4.1.

Claim 3.4.2.

Definition 3.5.

Claim 3.5.1.

Claim 3.5.2.

Claim 3.5.3.

Theorem 3.6.

Claim 3.6.1.

Remark 2.

3.2 Relation to $\mathsf{MaxSat}$ for CNF Formulae

Theorem 3.7.

Claim 3.7.1.

Claim 3.7.2.

Observation 3.8.

Claim 3.8.1.

3.3 ${\rm FP}^{{\rm NP}[\log]}$ - Hardness

Theorem 3.9.

Claim 3.9.1.

Theorem 3.10.

Claim 3.10.1.

Claim 3.10.2.

Claim 3.10.3.

Claim 3.10.4.

4 Approximating $\mathsf{optConfVal}$

Lemma 4.1.

Lemma 4.2.

Theorem 4.3.

Theorem 4.4 ([Hås01]).

Theorem 4.5.

Theorem 5.1.

Theorem 5.2.

Theorem 5.3.