Facets of Distribution Identities in Probabilistic Team Semantics
Miika Hannula, {\AA}sa Hirvonen, Juha Kontinen, Vadim Kulikov, and, Jonni Virtema

TL;DR
This paper explores the expressive power of probabilistic team semantics, focusing on logical and probabilistic dependencies, and addresses the complexity of the implication problem of conditional independence.
Contribution
It classifies the expressive power of various probabilistic atoms and relates the framework to the first-order theory of the reals, advancing understanding of probabilistic dependencies.
Findings
Classifies expressive power of probabilistic atoms
Relates probabilistic team semantics to the first-order theory of reals
Addresses complexity of the implication problem of conditional independence
Abstract
We study probabilistic team semantics which is a semantical framework allowing the study of logical and probabilistic dependencies simultaneously. We examine and classify the expressive power of logical formalisms arising by different probabilistic atoms such as conditional independence and different variants of marginal distribution equivalences. We also relate the framework to the first-order theory of the reals and apply our methods to the open question on the complexity of the implication problem of conditional independence.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
\NewEnviron
repproposition[1]
Proposition 1
\BODY
\NewEnviron
reptheorem[1]
Theorem 0.1
\BODY
\NewEnviron
replemma[1]
Lemma 1
\BODY
11institutetext: University of Helsinki, Finland 11email: {miika.hannula,asa.hirvonen,juha.kontinen}@helsinki.fi 22institutetext: Aalto University, Finland, 22email: [email protected] 33institutetext: Hasselt University, Belgium, 33email: [email protected]
Facets of Distribution Identities in Probabilistic Team Semantics††thanks: The first and the third author were supported by grant 308712, the fourth by grant 285203 of the Academy of Finland.
Miika Hannula 11 0000-0002-9637-6664
Åsa Hirvonen 11 0000-0003-2149-4153
Juha Kontinen 11 0000-0003-0115-5154
Vadim Kulikov 1122
Jonni Virtema 33 0000-0002-1582-3718
Abstract
We study probabilistic team semantics which is a semantical framework allowing the study of logical and probabilistic dependencies simultaneously. We examine and classify the expressive power of logical formalisms arising by different probabilistic atoms such as conditional independence and different variants of marginal distribution equivalences. We also relate the framework to the first-order theory of the reals and apply our methods to the open question on the complexity of the implication problem of conditional independence.
Keywords:
team semantics probabilistic logic conditional independence
1 Introduction
Team semantics, introduced by Hodges [20] and popularised by Väänänen [25], shifts the focus of logics away from assignments as the primitive notion connected to satisfaction. In team semantics formulae are evaluated with respect to sets of assignments (i.e., teams) as opposed to single assignments of Tarskian semantics. During the last decade the research on team semantics has flourished, many logical formalisms have been defined, and surprising connections to other fields identified. In particular, several promising application areas of team semantics have been identified recently. Krebs et al. [22] developed a team based approach to linear temporal logic for the verification of information flow properties. In applications to database theory, a team corresponds exactly to a database table (see, e.g., [16]). Hannula et al. [18] introduced a framework that extends the connection of team semantics and database theory to polyrelational databases and data exchange.
The focus of this article is probabilistic team semantics which connects team based logics to probabilistic dependency notions. Probabilistic team semantics is built compositionally upon the notion of a probabilistic team, that is, a probability distribution over variable assignments. While the first ideas of probabilistic teams trace back to the works of Galliani [11] and Hyttinen et al. [21], the systematic study of the topic was initiated and further continued by Durand et al. in [8, 9]. It is worth noting that in [2] so-called causal teams have been introduced to logically model causality and interventions. Probabilistic team semantics has also a close connection to the area of metafinite model theory [14]. In metafinite model theory, finite structures are extended with an another (infinite) domain sort such as the real numbers (often with arithmetic) and with weight functions that work as a bridge between the two sorts. This approach provides an elegant way to model weighted graphs and other structures that refer to infinite structures. The exact relationship between probabilistic team semantics and logics over metafinite models as well as with probabilistic databases of [6] will be a topic of future research.
The starting point of this work comes from [9] in which probabilistic team semantics was defined following the lines of [11]. The main theme in [9] was to characterize logical formalisms in this framework in terms of existential second-order logic. Two main probabilistic dependency atoms were examined. The probabilistic conditional independence atom states that the two variable tuples and are independent given the third tuple . The marginal identity atom states that the marginal distributions induced from the two tuples and (of the same length) are identical. The extension of first-order logic with these atoms () was then shown to correspond to a two-sorted variant of existential second-order logic that allows a restricted access to arithmetical operations for numerical function terms. What was left unexamined were the relationships between different logical formalisms in probabilistic team semantics. In fact, it was unknown whether there are any meaningful probabilistic dependency notions such that the properties definable with one notion are comparable to those definable with another.
In this article we study the relative expressivity of first-order logic with probabilistic conditional independence atoms () and with marginal identity atoms (). The logic is a probabilistic variant of inclusion logic that is strictly less expressive than independence logic, after which is modelled [12, 15]. In addition, we examine which is another extension defined in terms of so-called marginal distribution equivalence. The marginal distribution equivalence atom for two variable tuples and (not necessarily of the same length) relaxes the truth condition of the marginal identity atom in that the two distributions induced from and are required to determine the same multisets of probabilities. The aforementioned open question is now answered in the positive. The logics mentioned above are not only comparable, but they form a linear expressivity hierarchy: . We also show that enjoys a union closure property that is a generalization of the union closure property of inclusion logic, and that conditional independence atoms can be defined with an access to only marginal independence atoms between two variable tuples. Furthermore, we show that, surprisingly, corresponds to , where refers to the dependence atom defined as a declaration of functional dependence over the support of the probabilistic team. The question whether is strictly less expressive than is left as an open question; in team semantics the corresponding logics are known to be equivalent. The above findings look outwardly very similar to many results in team semantics. However, it is important to note that, apart perhaps from the union closure property, the results of this paper base on entirely new ideas and do not recycle old arguments from the team semantics context.
We also investigate (quantified) propositional logics with probabilistic team semantics. By connecting these logics to the arithmetic of the reals we show upper bounds for their associated computational problems. Our results suggest that the addition of probabilities to team semantics entails an increase in the complexity. Satisfiability of propositional team logic (), i.e., propositional logic with classical negation is in team semantics known to be complete for alternating exponential time with polynomially many alternations [19]. Shifting to probabilistic team semantics analogous problems are here shown to enjoy double exponential space upper bound. This is still lower than the complexity of satisfiability for modal team logic () in team semantics, known to be complete for the non-elementary complexity class which consists of problems solvable in time restricted by some tower of exponentials of polynomial height [23]. One intriguing consequence of our translation to real arithmetic is that the implication problem of conditional independence statements over binary distributions is decidable in exponential space. The decidability of this problem is open relative to all discrete probability distributions [24].
2 Preliminaries
First-order variables are denoted by and tuples of first-order variables by . By we denote the set of variables that appear in the variable sequence . The length of the tuple is denoted by . A vocabulary is a set of relation symbols and function symbols with prescribed arities. We mostly denote relation symbols by and function symbols by , and the related arities by and , respectively. The closed interval of real numbers between [math] and is denoted by . Given a finite set , a function is called a (probability) distribution if . In addition, the empty function is a distribution.
The probabilistic logics investigated in this paper are extensions of first-order logic over a vocabulary given by the grammar rules:
[TABLE]
where is a tuple of first-order variables and a relation symbol from .
Let be a finite set of first-order variables and be a nonempty set. A function is called an assignment. For a variable and , the assignment is equal to with the exception that . A team is a finite set of assignments from to . The set is called the domain of (written ) and the set the range of (written ). Let be a team with range , and let be a function. We denote by the modified team , and by the team . A probabilistic team is a distribution . Let be a -structure and a probabilistic team such that the domain of is the range of . Then we say that is a probabilistic team of . In the following, we will define two notations and . Let be a probabilistic team, a finite non-empty set, the set of all probability distributions , and a function. We denote by the probabilistic team such that
[TABLE]
for each and . Note that if does not belong to the domain of then the righthand side of the above equation is simply . By we denote the probabilistic team defined such that
[TABLE]
for each and . Again if does not belong to the domain of , can be dropped from the above equation.
If and are probabilistic teams and , then we write for the -scaled union of and , that is, the probabilistic team defined such that for each .
We may now define probabilistic team semantics for first-order formulae. The definition is the same as in [9]. The only exception is that it is here applied to probabilistic teams that have real probabilities, whereas in [9] rational probabilities were used.
Definition 1
Let be a probabilistic -structure over a finite domain , and a probabilistic team of . The satisfaction relation for first-order logic is defined as follows:
[TABLE]
Probabilistic team semantics is in line with Tarskian semantics for first-order formulae ():
[TABLE]
In particular the non-classical semantics for negation is required for the above equivalence to hold.
In this paper we consider three probabilistic atoms: marginal identity, probabilistic independence, and marginal distribution equivalence atom. The first two were first introduced in the context of multiteam semantics in [8], and they extend the notions of inclusion and independence atoms from team semantics [12].
We define where is a tuple of variables and a tuple of values, as
[TABLE]
If is some first-order formula, then is defined analogously as the total sum of weights of those assignments in that satisfy .
If are variable sequences of length , then is a marginal identity atom with the following semantics:
[TABLE]
Note that the equality in (4) can be equivalently replaced with since the tuples range over for a finite (see [8, Definition 7] for details). Due to this alternative formulation, marginal identity atoms were in [8] called probabilistic inclusion atoms. Intuitively, the atom states that the distributions induced from and are identical.
The marginal distribution equivalence atom is defined in terms of multisets of assignment weights. We distinguish multisets from sets by using double wave brackets, e.g., denotes the multiset where and are given multiplicities and . If are variable sequences, then is a marginal distribution equivalence atom with the following semantics:
[TABLE]
The next example illustrates the relationships between marginal distribution equivalence atoms and marginal identity atoms; the latter implies the former, but not vice versa.
Example 1
Let be the probabilistic team depicted in Figure 1. The team satisfies the atoms , , , and . The team falsies the atom , whereas is not a well formed formula.
If are variable sequences, then is a probabilistic conditional independence atom with the satisfaction relation defined as
[TABLE]
if for all it holds that
[TABLE]
Furthermore, we define probabilistic marginal independence atom as , i.e., probabilistic independence conditioned by the empty tuple.
In addition to atoms based on counting or arithmetic operations, we may also include all dependency atoms from the team semantics literature. Let be an atom that is interpreted in team semantics, let be a finite structure, and a probabilistic team. We define if , where consists of those assignments of that are given positive weight by . In this paper we will discuss dependence atoms also in the context of probabilistic team semantics. If are two variable sequences, then is a dependence atom with team semantics:
[TABLE]
A dependence atom of the form is called a constancy atom, written in shorthand notation. Dependence atoms can be expressed by using probabilistic independence atoms. This has been shown for multiteams in [8], and the proof applies to probabilistic teams.
Proposition 2 ([8])
Let be a structure, a probabilistic team of , and and two sequences of variables. Then
Given a collection of atoms from , we write for the logic that extends with the atoms in .
Example 2
Let be univariate distributions. Then is a finite mixture of if it can be expressed as a convex combination of , i.e., if there are non-negative real numbers such that and . A probabilistic team gives rise to a univariate distribution for each variable from the domain of . The next formula expresses that the distribution is a finite mixture of the distributions :
[TABLE]
where the indices are also thought of as distinct constants, and stands for . The non-negative real numbers are represented by the weights of where is distributed independently of each . The summand is then represented by the weight of and by the weight of . The quantified subformula expresses that the former weight matches the weight of , which implies that is .
Example 3
Probabilistic team semantics can be also used to model properties of data obtained from a quantum experiment (adapting the approach of [1]). Consider a probabilistic team over variables . The intended interpretation of is that the joint probability that was measured with outcome , for , is . In this setting many important properties of the experiment can be expressed using our formalism. For example the formula
[TABLE]
expresses a property called Outcome-Independence; given the measurements , the outcome at is independent of the outcomes at other positions. The dependence atom on the other hand corresponds to a property called Weak-Determinism. Moreover, if describes some property of hidden-variable models (Outcome-Independence, etc.), then the formula expresses that the experiment can be explained by a hidden-variable model satisfying that property.
The next example relates probabilistic team semantics to Bayesian networks. The example is an adaptation of an example discussed also in [8].
Example 4
Consider the Bayesian network in Fig. 2 that models beliefs about house safety using four Boolean random variables thief, cat, guard and alarm. We refer to these variables by . The dependence structure of a Bayesian network is characterized by the so-called local directed Markov property stating that each variable is conditionally independent of its non-descendants given its parents. For our network the only non-trivial independence given by this property is . Hence a joint distribution over factorizes according to if satisfies . In this case can be factorized by
[TABLE]
where, for instance, abbreviates either or , and is the probability of given . The joint probability distribution (i.e., the team ) can hence be stored as in Fig. 2. Note that while expresses the independence statement , -formulas can be used to further refine the joint probability distribution as follows. Assume we have information suggesting that we may safely assume an formula on :
- •
indicates that guard never raises alarm in absence of thief. In this case the two bottom rows of the conditional probability distribution for guard become superfluous.
- •
the assumption that is satisfied also exemplifies an interesting form of contex-specific independence (CSI) that cannot be formalized by the usual Bayesian networks (see, e.g., [7]). Namely, implies that guard is independent of cat in the context . Interestingly such CSI statements can be formalized utilizing the disjunction of :
[TABLE]
- •
satisfaction of would imply that alarm and guard have the same reliability for any given value of thief and cat. Consequently, the conditional distributions for alarm and guard are equal and one of the them could be removed.
The following locality property dictates that satisfaction of a formula in probabilistic team semantics depends only on the free variables of . For this, we define the restriction of a team to as where denotes the restriction of the assignment to . The restriction of a probabilistic team to is then defined as the probabilistic team where The set of free variables of a formula over probabilistic team semantics is defined recursively as in first-order logic; note that for any atom , consists of all variables that appear in .
Proposition 3 (Locality, [9])
Let be a formula with free variables from . Then for all structures and probabilistic teams where ,
Given two logics and over probabilistic team semantics, we write if for all open formulae there is a formula such that , for all structures and probabilistic teams . The equality ”” and strict inequality ”” relations between and are defined from ”” in the standard way.
Alternative Definition.
Probabilistic teams can also be defined as mappings that have no restriction for the total sum of assignment weights, being the set of all non-negative reals. Probabilistic team semantics with respect to such real weighted teams is then given exactly as in Definition 1, except that we define disjunction without scaling:
[TABLE]
where the union is defined such that for each . Whether interpreting probabilistic teams as probability distributions or just mappings from assignments to non-negative reals does not make any difference in our framework. Hence we write for a probabilistic team that is a distribution such that , and for a probabilistic team that is any mapping from assignments to non-negative reals. A probabilistic team of the former type is then a special case of that of the latter. We will use both notions and their associated semantics interchangeably. If we need to distinguish between the two semantics, we write and respectively for the scaled (i.e., Definition 1) and non-scaled variants. Given and , we write for the total weight of , and for the probabilistic team such that for all . The proposition below follows from a straightforward induction (see Appendix 0.A).
{repproposition}
prop Let be a structure, a probabilistic team of , and . Then
3 Expressiveness of
Let be a probabilistic team where is a finite set of assignements from a finite set of variables. A variable is uniformly distributed in over a set of values , if
[TABLE]
The following lemma says essentially that if we can express constancy and independence for a uniform distribution, then we can express . Note that it may happen that we can express “ uniformly distributed and independent of ” even when we cannot express “ is independent of ” in general. For a proof of the lemma, see Appendix 0.B.
{replemma}
lem Let be structure with at least two elements and an -tuple of variables. Let be a formula such that for all probabilistic teams , whose variable domain includes and for which and , it holds that
[TABLE]
Then can be expressed for -tuples and using and the constancy atom.
Theorem 3.1
.
Proof
Proposition 2 established that the constancy atom can be equivalently expressed by the independence atom . Hence it is enough to show that we can define the formula of Lemma 3 by using .
Let and be as assumed in Lemma 3. We use below as an abbreviation for , and for \forall b(b\neq c_{1}\land b\neq c_{2})\lor\big{(}(b=c_{1}\lor b=c_{2})\land\theta\big{)}. Define as
[TABLE]
It suffices to prove (5). The formula clearly states that and are independent. The formula also states that the values of range over the values of and . It remains to be shown, conditioned on that and are independent, that
[TABLE]
Note that, by assumption of Lemma 3, and are distinct constants. Let be a team obtained from by the quantification of and . By the definition of universal quantification, in is uniformly distributed and independent of everything else except maybe . Note that is uniformly distributed over the values of and in if and only if it is in .
If is uniformly distributed over the values of and , then picking values of with a uniform probability such that the right conjunct in
[TABLE]
holds clearly yields a team in which the left conjunct also holds. However, if is not uniformly distributed over and , then picking values for such that the right conjunct of (7) holds will yield that is not independent on . ∎
We also note that conditional independence is definable using marginal independence. The proof applies ideas from [9] and can be found in Appendix 0.C. {reptheorem}thm .
4 Expressiveness of and
Initially it may seem that first-order logic with marginal distribution equivalence atoms is less expressive than that with marginal identity atoms, as the former atoms are given a strictly weaker truth condition. Contrary to this intuition, however, we will in this section show that is actually strictly more expressive than . The result is proven in two phases. First, in Sect. 4.1 we show that dependence and marginal identity can be defined in , the former by a single marginal distribution equivalence atom and the latter by a more complex formula. Second, in Sect. 4.2 we show that the expressiveness of is restricted by a union closure property which is similar to that of inclusion logic in team semantics. Since dependence atoms lack this property, the strict inequality between and follows.
4.1 Translations of Dependence and Marginal Identity to
We observe first that dependence atoms can be expressed in terms of marginal distribution equivalence atoms, which in turn are definable using marginal identity and dependence atoms.
Proposition 4
The following equivalences hold:
, 2. 2.
.
Defining marginal identity atoms in is more cumbersome. Let be a probabilistic team, and a quantifier-free first-order formula over the empty vocabulary (i.e., such that its satisfaction depends only on the variable assignment). We define as the probabilistic team such that if satisfies , and otherwise. Given two sequences of variables and , we write as a shorthand for .
Theorem 4.1
* is equivalent to where*
[TABLE]
Proof
Assume that are all -ary. Let be a structure with domain , and let a probabilistic team. Assume first that , that is, for all , the weights and coincide. It suffices to show that for where is and is the probabilistic team obtained from by distributing to uniformly. For each we consider three weight measures, obtained by dividing assignments associated with into three parts, , , and . Then
[TABLE]
Observe that for we first partition each assignment in uniformly to parts in terms of the value of and then keep only those parts where holds. Since and disagree for every assignment in , the total weight of is obtained by multiplying with . For we have identical and , and hence its weight is obtained by multiplying with . By analogous reasoning we obtain that
[TABLE]
Since our assumption implies for all , the claim now follows from the observation that are identical multisets for .
Vice versa, assuming we show . Let the weights and the probabilistic team be as above. By assumption we have , and thus the following multisets are identical:
[TABLE]
where and . Assume to the contrary that , that is, for some . Observe that whenever agree, then contributes the same weight to all , , and . Therefore, we may assume without loss of generality that for all . Assume that is the smallest element from . Since , it follows that for some . If , then which contradicts the assumption that is smallest. Since , similar contradiction follows from , too. Hence, which concludes the proof. ∎
The following theorem now combines the results of this section. Note that the translations to both directions are of linear size.
Theorem 4.2
.
4.2 Scaled Union Closure of
Inclusion logic is known to be union closed over teams. This means that for all structures , teams , and inclusion logic formulae : if and , then . The following proposition, proven in Appendix 0.D, demonstrates that is endowed with an analogous closure property, namely, that all formulae of are closed under all -scaled unions of probabilistic teams. {repproposition}prop:ucl Let be a model, a formula, and and two probabilistic teams. Then for all :
[TABLE]
As a corollary we observe that is strictly weaker than . Recall from Proposition 4 that the constancy atom is definable in . However, constancy is clearly not preserved under -scaled unions, therefore falling outside the scope of . Furthremore, by Theorem 4.1 is at least as expressive as .
Corollary 1
.
5 Binary Probabilistic Teams
In this section we restrict attention to binary probabilistic teams and propositional logic extended with quantifiers (see [17] for related work). We define the syntax of quantified propositional logic by the following grammar
[TABLE]
where is a proposition variable. The probabilistic team semantics of is defined analogously to that of first-order formulae. We say that a probabilistic team is binary if assigns variables into . For a formula and a binary probabilistic team , we write iff , where is the first-order formula obtained from by substituting for and for , and letting . Furthermore, we denote classical negation by ””. That is, we write if . We let denote the logic obtained by the grammar (8) extended with , and denote by the extension of by any collection of dependencies .
We observe that can be interpreted as statements of real arithmetic. As truth in real arithmetic is decidable, this gives us some fairly conservative upper bounds with respect to the complexity of satisfiability and validity of . We say that is satisfiable if is satisfied by some non-empty binary probabilistic team.111Empty team satisfies every formula without ; with it is a non-interesting special case [19]. Also, is valid is is satisfied by all binary probabilistic teams. Note that the free variables of a formula are defined analogously to the first-order case.
Theorem 5.1
For each (, resp.) there exists a first-order sentence over vocabulary (, resp.) such that is satisfiable iff (, resp.).
Proof
We show that satisfiability of a formula is definable in real arithmetic in terms of the non-scaled variant of probabililistic team semantics. For a given tuple of proposition variables, we introduce fresh first-order variables for each propositional assignment , where is a binary string of length . We write to denote the complete tuple of these variables. For a listing the free variables of , we define
[TABLE]
where the mapping is defined recursively as follows:
- •
If is a propositional literal, then .
- •
If is , where for some , then is defined as
[TABLE]
- •
If is , where for some , then
[TABLE]
- •
If is , then .
- •
If is , then .
- •
If is , then
[TABLE]
- •
If is , then
[TABLE]
- •
If is , then
[TABLE]
It is straightforward to check that the claim follows. ∎
From the translation above we immediately obtain some complexity bounds for the satisfiability and validity problems of quantified propositional logics over probabilistic team semantics. We write for the class of problems solvable in space , and (, resp.) for the class of problems solvable by alternating Turing machine in time (, resp.) with many alternations, where is a polynomial.
Theorem 5.2
The satisfiability/validity problems of the logics and are in and , respectively.
Proof
By the proof of Theorem 5.1, satisfiability and validity of quantified propositional formulae can be reduced to truth of a real arithmetic sentence of size . The stated upper bounds for and then follow because the theory of real-closed fields, , is in [3], and the theory of real addition, , is in [4, 10]. ∎
We also obtain an upper bound for the implication problem of conditional independence over binary probability distributions. The implication problem for conditional independence is given as a finite set of conditional independence statements, and the problem is to decide whether all probability distributions that satisfy satisfy also . It is a famous open problem to determine whether implication of conditional independence is decidable over discrete distributions. Since binary probabilistic teams can be interpreted as discrete distributions of binary random variables, we obtain that the implication problem for conditional independence statements is decidable in exponential space over binary distributions. The result follows since any instance of such an implication problem can be expressed as an existential formula of exponential size (Theorem 5.1), and since the existential theory of real-closed fields is in [5].
Corollary 2
The implication problem for conditional independence over binary probability distributions is in .
It may be conjectured that the obtained complexity bounds are not optimal. The first-order translations provide only access to a very restricted type of arithmetic expressions. For instance, real multiplication is only available between sums of reals from the unit interval. We leave it as an open problem to determine whether the results of this section can be optimized using more refined arguments.
6 Conclusions and further directions
We have studied probabilistic team semantics in association with three notions of dependency atoms: probabilistic independence, marginal identity, and marginal distribution equivalence atoms. Our investigations give rise to an overall classification that is already familiar from the team semantics context (see Table 1). Similar to inclusion logic () in team semantics, we observed that enjoys a union closure property which renders it strictly less expressive than . A further analogous fact is that both dependence and marginal identity are definable with conditional independence, which in turn is definable using only marginal independence. An interesting open question is to determine the relationship between (or equivalently ) and . Contrary to the picture arising from team semantics, we conjecture that the latter is strictly more expressive.
One motivation behind our marginal distribution equivalence atom was that it seemed to be weaker than marginal identity but still enough to guarantee the same entropy of two distributions. A natural next step would be to consider some form of entropy atom/atoms and study the expressive power of the resulting logics. The exact formulation of such atoms will make all the difference, as one can detect both functional dependencies and marginal independence if one has full access to the conditional entropy as a function.
We also studied (quantified) propositional logics with probabilistic team semantics. By connecting real-valued probabilistic teams to real arithmetic we showed upper bounds for computational problems associated with these logics. As a consequence of our translation to real arithmetic we also obtained an upper bound for the implication problem of conditional independence statements over binary distributions.
Appendix 0.A Proof of Proposition 2
Proposition LABEL:prop
Proof
The cases for first-order literals, , , and the conjunction are immediate. The claim for the independence atom follows from the equivalence below together with the observation that the former is the definition of the atom in the unscaled team whereas the latter is equivalent to that of the scaled team .
[TABLE]
The case for disjuction follows from the following chain of equivalences
[TABLE]
where the last equivalence follows form the definition of the disjunction for and , since
[TABLE]
The cases for the quantifiers are similar; we show the case for the universal quantifier
[TABLE]
where the second last equivalence follows, since and .∎
Appendix 0.B Proof of Lemma 3
Lemma LABEL:lem
Proof
We will write a formula which is to be equivalent with . But first we need to define an auxiliary formula . Define
[TABLE]
This formula says that always equals either or and is a “detector” for which one it is. We use the abbreviation below to denote . Now define
[TABLE]
Suppose holds in a team over variables and . We want to show that is satisfied by . Let be the expansion of obtained by the quantification of , , and . We may assume that , were picked such that they attain constant but distinct values. Also note that is independent of all other variables and uniformly distributed over the domain of . Now let be a variable that takes its values from the values of and such that it “detects” whether equals or not (value of is the value of iff and have the same value). Let be the expansion of by this . We need to check that satisfies
[TABLE]
Let be the maximal subteam of where . So now we have to check that
[TABLE]
holds in . Recall that says in particular that equals either or , so (9) holds in if and only if holds in the maximal subteam of in which this is the case. We also just defined to attain the value if and only if and the only other option is that in which case , so is satisfied. What about ; note that is such that (6) holds. Now fix any value of in . Since holds, we have . When we expand to and further to this property is (clearly) preserved. It is also preserved when we take the subteam , because when we move from to , we only remove assignments where , so if an assignment with is deleted, then also an assignment with is deleted (the same assignment). When we move to we still have which follows from the fact that is independent of . Therefore
[TABLE]
But this means that conditioned on , is uniformly distributed in . Since this holds for any , is uniformly distributed and independent of as desired and is satisfied by .
Suppose now that a team satisfies . We want to show that . But the chain of reasoning above also works “backwards”. Fix a value of . We want to show that . It is clear that it is sufficient to look at as defined above. But because says that is a “detector” of whether or not, it is in fact sufficient to check for the subteam (also as defined above). But in , this follows from .∎
Appendix 0.C Proof of Theorem 3
Theorem 3 follows from Lemma 3 presented below. Lemma 3 can be proven following the proof of Theorem 2 in [9]. We omit the details and instead delineate intuition behind the translation. The idea is to simulate the semantics of the probabilistic conditional independence atom using only marginal independence and marginal identity atoms. First, the universally quantified in the translation represents all possible variable assignments of . Second, and indicate that the marginal distributions of , , , and are distributed respectively to independently of and of each other. Third, encodes the product of the weights of and by , and similarly the product of the weights of and by . Finally, conditional independence between and given follows iff these products are equal relative to all assignments of . Theorem 3 then follows from this lemma since the constant [math] and the marginal identity atom are both definable in .
Lemma 3
Let be three sequences of variables from , and let [math] be a constant symbol. Then is equivalent to
[TABLE]
where
[TABLE]
Appendix 0.D Proof of Proposition 4.2
Proposition LABEL:prop:ucl
Proof
We may assume that and . We prove the claim by structural induction on . We omit the cases for atomic formulae and conjunction which are straightforward.
- •
Assume that . By the semantics of the disjunction, we find and distributions over such that , , , , , and . Define and . By the induction hypothesis and , since for , and for . Then for because
[TABLE]
Consequently, follows from the semantics of the disjuction which completes the disjunction step of the induction.
- •
Assume that . Then and , and by induction assumption . The claim then follows since .
- •
Assume that . Then and where and are functions that map each to a probability distribution over . We let be a function that maps to a probability distribution over such that
[TABLE]
Note that follows from . By induction assumption . The claim now follows from , which holds since for all :
[TABLE]
This concludes the case of existential quantification and the proof.∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Abramsky, S.: Relational hidden variables and non-locality. Studia Logica 101 (2), 411–452 (2013)
- 2[2] Barbero, F., Sandu, G.: Interventionist counterfactuals on causal teams. In: Finkbeiner, B., Kleinberg, S. (eds.) Proceedings 3rd Workshop on formal reasoning about Causation, Responsibility, and Explanations in Science and Technology, Thessaloniki, Greece, 21st April 2018. Electronic Proceedings in Theoretical Computer Science, vol. 286, pp. 16–30. Open Publishing Association (2019). https://doi.org/10.4204/EPTCS.286.2
- 3[3] Ben-Or, M., Kozen, D., Reif, J.: The complexity of elementary algebra and geometry. Journal of Computer and System Sciences 32 (2), 251 – 264 (1986)
- 4[4] Berman, L.: The complexity of logical theories. Theoretical Computer Science 11 (1), 71 – 77 (1980)
- 5[5] Canny, J.: Some algebraic and geometric computations in pspace. In: Proceedings of the Twentieth Annual ACM Symposium on Theory of Computing. pp. 460–467. STOC ’88, ACM, New York, NY, USA (1988)
- 6[6] Cavallo, R., Pittarelli, M.: The theory of probabilistic databases. In: Proceedings of the 13th International Conference on Very Large Data Bases. pp. 71–81. VLDB ’87, Morgan Kaufmann Publishers Inc., San Francisco, CA, USA (1987)
- 7[7] Corander, J., Hyttinen, A., Kontinen, J., Pensar, J., Väänänen, J.: A logical approach to context-specific independence. In: Väänänen, J.A., Hirvonen, Å., de Queiroz, R.J.G.B. (eds.) Logic, Language, Information, and Computation - 23rd International Workshop, Wo LLIC 2016, Puebla, Mexico, August 16-19th, 2016. Proceedings. Lecture Notes in Computer Science, vol. 9803, pp. 165–182. Springer (2016). https://doi.org/10.1007/978-3-662-52921-8_11
- 8[8] Durand, A., Hannula, M., Kontinen, J., Meier, A., Virtema, J.: Approximation and dependence via multiteam semantics. Ann. Math. Artif. Intell. 83 (3-4), 297–320 (2018), https://doi.org/10.1007/s 10472-017-9568-4 · doi ↗
