Choosing with unknown causal information: Action-outcome probabilities for decision making can be grounded in causal models
Mauricio Gonzalez Soto, David Danks, Hugo J. Escalante Balderas, L. Enrique Sucar

TL;DR
This paper demonstrates how decision-making probabilities can be grounded in causal models, even when the causal mechanisms are unknown, extending classical decision theories and strategic equilibrium concepts.
Contribution
It extends causal decision-making frameworks to unknown causal models, integrating causal information into strategic game analysis.
Findings
Probabilities can be grounded in causal models with unknown mechanisms.
Extended Nash Equilibrium concept considering causal information.
Generalized causal decision-making framework for unknown environments.
Abstract
Decision-making under uncertainty and causal thinking are fundamental aspects of intelligent reasoning. Decision-making has been well studied when the available information is considered at the associative (probabilistic) level. The classical Theorems of von Neumann-Morgenstern and Savage provide a formal criterion for rational choice using associative information: maximize expected utility. There is an ongoing debate around the origin of probabilities involved in such calculation. In this work, we will show how the probabilities for decision-making can be grounded in causal models by considering decision problems in which the available actions and consequences are causally connected. In this setting, actions are regarded as an intervention over a causal model. Then, we extend a previous causal decision-making result, which relies on a known causal model, to the case in which the causal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Choosing with unknown causal information: Action-outcome probabilities for decision making can be grounded in causal models
Mauricio Gonzalez Soto1,2
David Danks3
Hugo J. Escalante Balderas2
Luis E. Sucar Succar2
(1FSG NeuroInformatics, Faculty of Computer Science, University of Vienna.
2Coordinación de Ciencias Computacionales, Instituto Nacional de Astrofísica Óptica y Electrónica (INAOE), México
3Halicioglu Data Science Institute and Department of Philosophy, University of California, San Diego.
)
Abstract
Decision-making under uncertainty and causal thinking are fundamental aspects of intelligent reasoning. Decision-making has been well studied when the available information is considered at the associative (probabilistic) level. The classical Theorems of von Neumann-Morgenstern and Savage provide a formal criterion for rational choice using associative information: maximize expected utility. There is an ongoing debate around the origin of probabilities involved in such calculation. In this work, we will show how the probabilities for decision-making can be grounded in causal models by considering decision problems in which the available actions and consequences are causally connected. In this setting, actions are regarded as an intervention over a causal model. Then, we extend a previous causal decision-making result, which relies on a known causal model, to the case in which the causal mechanism that controls some environment is unknown to a rational decision-maker. In this way, action-outcome probabilities can be grounded in causal models in known and unknown cases. Finally, as an application, we extend the well-known concept of Nash Equilibrium to the case in which the players of a strategic game consider causal information.
1 Introduction
An important aspect of acting in the world is being able to make decisions under uncertain conditions: Which route should I use to get to work? Could there be traffic? If one wishes to avoid certain incoherent, inconsistent behavior, then the natural and most well-known idea is to balance between how desired an object is, and how available is it. Such balancing is to be done via the calculation of the expected utility (Bernoulli,, 1954; Von Neumann and Morgenstern,, 1944; Savage,, 1954), where how desired an object is, is measured in terms of the utility it produces; and how available is it is encoded in terms of probabilities. Such probabilities are required to either be objectively known or subjectively (internally) defined by a decision maker. There is no standard agreement in the decision-making literature on where such probabilities must come from.
Acting in the world is conceived by human beings as causally intervening on it, and it is known that humans are also able to learn and use causal relations while making choices (Lagnado et al.,, 2007; Hagmayer et al.,, 2007; Hagmayer and Meder,, 2008; Hagmayer and Sloman,, 2009; Hagmayer and Meder,, 2013; Hagmayer and Fernbach,, 2017). Thinking in terms of causes and effects is an everyday task and in fact, causal reasoning is to be found at the very core of our minds since we constantly ask why: Why do we get sick? Why does a drug work? (Tversky and Kahneman,, 1977; Spirtes et al.,, 2000; Waldmann and Hagmayer,, 2013; Danks,, 2014; Lake et al.,, 2017; Pearl and Mackenzie,, 2018; Neil et al.,, 2019).
If, as argued, another important aspect of acting in the world is making choices, then taking causal information as a basis is fundamental for decision-making. Therefore, it is a natural question how to formalize decision making when causal information (known or unknown) is present? Answering the such question is relevant given the importance of causal relations as well as reasoning in everyday life and in science (Spirtes et al.,, 2000; Pearl and Mackenzie,, 2018). Given that human beings actually use causal information while making choices (Tversky and Kahneman,, 1980), and the importance of decision-making results based on associative information, it is desirable to have an explicit and computationally implementable criterion for decision-making, which also addresses the question of the origin of probabilities for decision making in a sensitive, reasonable way.
The previous question has been already considered by Nozick, (1969); Lewis, (1981); Joyce, (1999); Eells, (1982); Stern, (2017) as well as by Pearl, (2009) who provides an optimality criterion for decision making under causal-controlled uncertainty when the causal mechanism which controls the environment is known by the decision maker (Pearl,, 2009).
In this paper, we will provide a decision-making criterion for unknown causal information, and such criterion has the form of maximization of expected utility; thus showing that action-outcome probabilities for decision-making can be grounded in causal models.
The remainder of the paper goes as follows: In Section 2 we review the basics of Classical Decision Making: first we give some basic definitions; then we review the decision making results from von Neumann-Morgenstern and Savage, where we discuss the General Expected Utility Criterion, and finally we discuss about the problem around the origin of the action-outcome probabilities. In Section 3, the basics of Causation and Causal Graphical Models, as well as the basic definitions required for Causal Decision Making. In Section 4 we describe some fundamental previous results for Causal Decision Making, and in particular our main result in Section 4.2, finally in Section 5 we describe an application into the domain of Game Theory, where we are able to define a Causal Nash Equilibrium.
2 Classical Decision Making
A Decision Problem under Uncertainty is the mathematical model of a situation in which an agent must choose one out of many available actions with uncertain consequences which depend on different, possibly unknown, factors (Bernardo and Smith,, 1994; Robert,, 2007; Gilboa,, 2009). Such consequences are assumed to produce satisfaction in the decision maker, and this satisfaction is represented by a preference relation, defined over actions, and denoted by , where is read as being preferred to , in terms of the consequences that would be obtained by choosing action instead of action , if these were the only two available options and given the current knowledge the agent has. The agent is assumed to be rational; that is, it is assumed that the preference relation satisfies the so-called rationality or coherence axioms described in Appendix LABEL:appendixA.
Von Neumann and Morgenstern, gave an answer for how to make choices if rational preferences are assumed, utilities are unknown, and the stochastic relation (i.e., probabilities of events) between actions and outcomes are considered as objective and given to the decision maker: maximize expected utility with respect to a utility function whose existence is guaranteed (Von Neumann and Morgenstern,, 1944).
If probabilities are not known, then Savage, showed that a rational decision maker must choose as if she is maximizing the expected utility using a subjective probability distribution (Savage,, 1954).
Such theorems provide a formal criterion for associative, also called evidential, decision-making if rationality is assumed: maximize expected utility.
Other decision-making theories exist, such as Kahneman and Tversky’s Prospect Theory (Kahneman and Tversky,, 1979), Gilboa’s Case-Based Decision Theory (Gilboa and Schmeidler,, 1995), among others that are out of the scope of this work, since such theories defy the classical notion of rationality, which is the one we are adopting in this work.
In Section 4.1 we make reference to another theory, Joyce’s Causal Decision Making (Joyce,, 1999) as well as Pearl’s criterion for known causal information. For further details on classical (non-causal) decision making, see Bernardo and Smith, (1994) and Gilboa, (2009). For further applications of Decision Making in other fields, such as Quantum Mechanics, see Wallace, (2012); Berkovitz, (2012).
2.1 Preliminary Definitions
In this Section, we formally state the definitions required for a precise formulation of what follows. The notation and setting we use here is the one used by Bernardo and Smith, (1994). In Definitions 2.1 and 2.2, we consider a non-empty set and a countable set of available actions; for each action , a partition of and a set of consequences . Let be the -algebra generated by the union of every , and let the union of all over . Bernardo and Smith, (1994) derives the existence of a subjective probability measure from a set of coherence axioms, according to which a decision maker has some mechanism of quantifying uncertainty in terms of real numbers within the interval.
Note: In the context of Chapter 2 of Bernardo and Smith, (1994), from a preference relation defined over actions of the form , and the coherence axioms stated in the Appendix, further relationships , as well as can be defined both over consequences, as well as over events, where is to be read as considering more likely than , as well as , which is the conditional likelihood relation, but such development is beyond the scope of this work. Having said this, we can reduce the notation to simply But it must be kept in mind that the index set depends on as well as and each . We note that each links, or maps, a partition of uncertain events to a corresponding set of consequences Thus bridging this notation to the original formulation of Savage’s Theorem, where the objects of choice are functions from states, which are defined by Gilboa, (2009) as …an exhaustive list of all scenarios that might unfold…, to outcomes (Bernardo and Smith,, 1994; Gilboa,, 2009).
We will now first consider the general setting for a decision problem: an uncertain environment.
Definition 2.1**.**
Let be a non-empty set. An uncertain environment is a tuple . Where is a non-empty set of available actions, a set of consequences, and is a -algebra of events over .
If we consider the preferences of some decision maker over the set of actions of some uncertain environment, then we have a Decision Problem under Uncertainty (Bernardo and Smith,, 1994).
Definition 2.2**.**
A Decision Problem under Uncertainty is an uncertain environment endowed with a preference relation over .
As previously mentioned, we can identify, as done in Bernardo and Smith, (1994), consequences as actions by simply writing and say that consequence is preferred over consequence , denoted as , if action is preferred over action .
Definition 2.3**.**
A Decision Problem under Uncertainty is said to be bounded if there exists a pair of consequences and such that for every , .
Definition 2.4**.**
A Decision Problem under Uncertainty is said to be finite if the set of available actions is finite.
2.2 Objective Probabilities: von Neumann-Morgenstern Theory
Consider the following scenario: throwing a die. In this scenario, using DeFinetti’s Representation Theorem (De Finetti,, 1937; Schervish,, 1995), we can model such throws as independent realizations of a random variable that has a known fixed law. We can think of such a law as being objective. The Von Neumann and Morgenstern, (vNM) Theorem considers a scenario of decision under risk with rational preferences; this is, rationally choosing between alternatives with uncertain outcomes with known, objective, probabilities.
Formally, using the notation by Gilboa, (2009), we consider a set of available options. Let be the set of lotteries with finite support over . The objects of choice are elements , which are known to the decision maker; we represent the decision maker’s preferences by a preference relation that satisfies being complete, transitive, continuous, and a condition called independence. This family of conditions is called von Neumann-Morgenstern rationality axioms as described by Gilboa, (2009) and Schervish, (1995).
Theorem 2.5** (von Neumann-Morgenstern).**
A preference relation where is a set of lotteries with finite support over a set satisfies the von Neumann-Morgenstern rationality axioms if and only if there exists a function such that for every we have that
[TABLE]
The theorem states that if a rational decision maker knows the probabilities of obtaining a certain outcome, then she must choose as if maximizing the expected value of some function whose existence is guaranteed by Theorem 2.5. See Gilboa, (2009) for details on the proof.
2.3 Subjective Probabilities: Savage
Thanks to DeFinetti’s Theorem, we can think of objective probabilities as long-term frequencies, but the question remains open regarding inquiries of the type: what is the probability that John Doe was born on May 1st? It is not the case that John Doe sometimes is born one day and sometimes born again some other day. In this context, we may talk of probabilities as specifying beliefs an agent has over some the occurrence or not of some event. Savage’s Theorem studies decision-making in this context; this is, if a rational decision-maker is uncertain about the probabilities of obtaining certain outcomes and does not have a precise, objective quantification of her preferences (utility function), then it is Savage’s Theorem (Savage,, 1954), which gives a formal decision criterion.
Savage’s result extends von Neumann-Morgenstern Theorem since it considers the case in which a rational decision maker knows neither her utility function nor the probabilities to be used in order to obtain the expected values required for making choices according to the von Neumann-Morgenstern Theorem (Gilboa,, 2009).
2.3.1 General Expected Utility Principle (Savage)
We now state the general Expected Utility Principle as found in (Bernardo and Smith,, 1994). Further details can be found in (Savage,, 1954; Anscombe et al.,, 1963; Kreps,, 1988; Schervish,, 1995; Robert,, 2007; Gilboa,, 2009).
Theorem 2.6** (Expected Utility Principle, Savage, (1954); Bernardo and Smith, (1994)).**
In a finite, bounded Decision Problem under Uncertainty , the preference relation satisfies the coherence rationality axioms if and only if there exists: A probability measure , called a subjective probability, that associates with each uncertain event a real number and a utility function such that it associates each consequence with a real number . Such that for and actions in , and any
[TABLE]
This theorem states that if a rational decision maker does not know the precise probabilities of outcomes given that a certain action has been taken, then she must choose as if having in mind a probability assignment to the uncertainties in her environment and use such probabilities to calculate the expected utility with respect to a subjective utility function that represents her preferences. This result also gives a precise definition of subjective probability as a quantification of uncertainty which is used to make good decisions (Gilboa,, 2009). See (Hens,, 1992; Bernardo and Smith,, 1994), and (Gilboa,, 2009) for further details. See (Ellsberg,, 1961; Tversky,, 1975; Tversky and Kahneman,, 1989; Binmore,, 2008; Gilboa et al.,, 2009) and (Gilboa et al.,, 2012) for critiques of the coherence axioms.
2.4 Origins of outcome-action probabilities
The previous theorems, both vNM and Savage’s, have the limitation of being stated only in terms of associative information (probabilities), which leaves open the question of the origin of such probabilities. We know that vNM provides utilities given probabilities, and Savage provides both utilities and probabilities given the coherence axioms, but what do such probabilities mean and what is their origin?
Interpreting the action-outcome conditional probabilities, either as causal or observational, is exactly the issue at the heart of the debate between causal and evidential decision theorists, see Joyce, (1999). It is recognized in the literature that the source of these conditional probabilities should be explained (Peterson,, 2017), but there is substantial debate about their source, see for example Binmore, (2008); Gilboa, (2009); Eells, (1982). We need a further justification to show that the right action-outcome conditional probabilities can be grounded in causal models. The objective of this paper is to provide that basis. In the next section we state some basic facts on causality and why is different from for some value .
3 Causation
The concept of Causality deals with regularities found in a given environment (context) that are stronger than probabilistic (or associative) relations in the sense that a causal relation allows for evaluating a change in the consequence given a change in the cause. This is known as an intervention, which is different from observing and consists of a change in the joint distribution of the variables, which is performed by forcing the value of some variable to a specific value. Causal reasoning is able to deal with changes in the data-generating distributions, while observational reasoning does not. In a non-causal world, patients would avoid going to the doctor in order to avoid being sick (Spirtes et al.,, 2000; Pearl,, 2009; Koller and Friedman,, 2009; Pearl and Mackenzie,, 2018).
In this work, the manipulationist interpretation of Causality is adopted (Woodward,, 2003). The main paradigm is clearly expressed by Campbell and Cook, as manipulation of a cause will result in a manipulation of the effect (Campbell and Cook,, 1979). Consider the following example from Woodward,: manually forcing a barometer to go down won’t cause a storm, whereas the occurrence of a storm will cause the barometer to go down (Woodward,, 2003).
Here, we take the formal definition of Probabilistic Causality given by Spirtes et al., as a working definition for the notion of Causation. Similar descriptions of the manipulationist approach were described by Holland, (1986). Causal inference tools, such as Pearl’s do-calculus allow finding the effect of an intervention in terms of probabilistic information when certain conditions are met (Pearl,, 2009).
3.1 A definition of Causality
Causality is a stochastic relation between events within a probability space; this is, some event (or events) causes another event to occur, (Spirtes et al.,, 2000).
Definition 3.1**.**
Let be a probability space, and consider a binary relation which is: Transitive: If and for any then . Irreflexive: For all it doesn’t hold that . Antisymmetric: For such that if then it doesn’t hold that .
We consider an extra pair of conditions. The first one, known as Causal Sufficiency, is about the nature of the model: for any variables in the model , there are no common causes of outside of the model (Spirtes et al.,, 2000; Pearl,, 2009; Sucar,, 2015).111Alternately, if noise terms are specified in to capture unmeasured causes, then Causal Sufficiency can be written as “there are no unmodeled causes of .” Common causes that are uncaused by other factors in are the usual emphasis with Causal Sufficiency, but it also excludes some unobserved intermediate events (e.g., if and , then must be observed). Causal Sufficiency thus implies that the causal connections in do not share unmodeled mechanisms (Spirtes et al.,, 2000).
The second required condition is that an intervention on a particular variable or event will change the value of (and so can “break” the causal influences on ), but do not otherwise affect the causal mechanisms in . This assumption, called Invariance by Woodward, (2003), more specifically implies that still has the same effects as before, even though the joint distribution of the variables is reconfigured (Woodward,, 2003).
These conditions are required in an axiomatic fashion, so we do not discuss them further here.
3.2 Representation into a Directed Acyclic Graph
The causal relations, defined between events, contained in can be summarized in a graph in the following way: If then the graph must contain a node representing , a node representing and a directed edge connecting the respective nodes in the direction of the causal relation.
Notice that since the graph is finite, there exist some nodes that do not have causes, which are called exogenous. If an event is caused by some other event, then we say it is endogenous and we denote the set of its causes as . It is proven by Kiiveri et al., that at least one exogenous node exists in a causal graph (Kiiveri et al.,, 1984).
3.3 Causal Graphical Models
A Causal Graphical Model (CGM) consists of a set of random variables , and a Directed Acyclic Graph (DAG) whose nodes are in correspondence with the variables in and whose edges represent relations of cause-effect in the sense that their realizations correspond to the events encoded by the causal relation(Koller and Friedman,, 2009; Sucar,, 2015). Also, the model is enriched with an operator called which is a functional defined over graphs, and whose action is described as follows: given and an element of the set of all possible values of the variables belonging to , the action corresponds to assigning to each the value and to delete every incoming edge into the node corresponding to each in the graph . In the context of CGMs, to apply the operator over a variable (or set of variables) is called as an intervention over the variable. It is this interventional operator which separates associative models from causal models (Pearl,, 2009; Koller and Friedman,, 2009; Sucar,, 2015).
It is required that the probability distribution that results from an intervention over a variable is Markov compatible with the graph; this is, the resulting interventional distribution is equivalent to the product of the conditional probability of every variable given its parents in the intervened graph (Sucar,, 2015).
3.4 Causal Environments and Causal Decision Problems
We consider decision-making using causal information. In this section we define a Causal Environment to be an uncertain environment, as defined in Section 2.1, with the extra condition that there exists a CGM which controls it in the following sense: there exists a causal relation between available actions and consequences in the sense that any chosen action will stochastically cause a consequence. The role of the CGM is to encode all of the causal relations present in the environment, not only between actions and consequences but also between any other variables in the environment.
Definition 3.2**.**
*A Causal Environment is a tuple where
is an uncertain environment and is a CGM such that the set of uncertain events correspond to the different realizations of the variables in as well as the possible ways that the variables are related one with each other.*
Definition 3.3**.**
We define a Causal Decision Problem (CDP) as a tuple , where is a Causal Environment and is a preference relation defined over actions.
For the CGM in a CDP, we distinguish two particular variables: one corresponding to the available actions, and one corresponding to the produced (caused) outcome. We are considering that only one variable can be intervened upon and that the values of such variable represent the actions available to the decision maker; i.e., the value forced upon such variable under an intervention represents the action taken by the decision maker.
The intuition behind the definition of a Causal Decision Problem is this: a decision maker chooses an action , which is automatically fed into the model , which outputs the causal outcome . The definitions of a Bounded and Finite Causal Decision Problem extend in an analogous way from the classical definition.
4 Causal Decision Theory
Now that we have addressed basic notions of Classical Decision Making and Causation, we move on to Causal Decision Theory, where actions and outcomes are causally related. In this Section, we will recall some related and previous work done in Causal Decision Theory, with particular emphasis on decision-making results by Judea Pearl, in which causal information is assumed to be known to a decision-maker. Then, we further generalize this result to the case of unknown causal information.
4.1 Related Work
According to Joyce,’s formulation of Causal Decision Theory, a decision maker must choose whatever action is more likely to (causally) produce the desired outcome while keeping any beliefs about causal relations fixed (Joyce,, 1999). This is resumed in Stalnaker’s equation (Stalnaker,, 1968):
[TABLE]
where is to be read as if the decision maker does then would be the case (Gibbard and Harper,, 1978; Kleinberg,, 2013). Lewis’ and Joyce’s work captured the intuition that causal relations may be used to control the environment and to predict what is caused by the actions of a decision-maker. We will refine the operator by an explicit way of calculating the probability of causing an outcome by doing a certain action in terms of Pearl’s do-calculus. Heckerman and Shachter, provides a framework for defining the notions of cause and effect in terms of decision theoretical concepts and gives a theoretical basis for a graphical description of causes and effects, such as the causal influence diagrams introduced by Dawid, (Dawid,, 2002). Heckerman and Shachter, gave an elegant definition of causality but did not address how to actually make choices using causal information (Heckerman and Shachter,, 1995). Dawid, presents a decision-theoretic approach to causal inference in which a decision maker must take into account how alternatives compare against each other in terms of the average causal effect, the such approach uses the well-known influence diagrams (Dawid,, 2002, 2003) in order to derive formulas that allow an explicit calculation of the average causal effect (Dawid,, 2012). Influence diagrams have the ability to express both intervention variables and chance variables in a single graphical structure. An optimality criterion for sequential interventions is obtained by Dawid and Didelez, by maximizing the expectation of outcomes (Dawid and Didelez,, 2008).
On the other hand, J. Pearl proposes the following criterion for decision-making: Consider a rational decision-maker who faces a causal environment in which she knows the causal model controlling the relation between her actions and outcomes. She can use the known causal model in order to find the probabilities of causing the desired outcome given she takes a certain action. The following theorem is found in Section 4.1 of Pearl, (2009), but the intuitions that lie behind can be traced back to Lewis, (1981) and Joyce, (1999).
Theorem 4.1** (Pearl, (2009), Section 4.1).**
Let be a Causal Graphical Model, and its associated distribution . Let be a set of consequences of interest for a decision-maker. If the decision maker faces a causal environment and if the causal graphical model is known, then the preference relation satisfies the von Neumann-Morgenstern rationality axioms if and only if:
[TABLE]
Equivalently, the action that must be chosen is
[TABLE]
Further related work Lewis, (1981); Heckerman and Shachter, (1995); Joyce, (1999); Dawid, (2002, 2012); Eells, (1982); Stern, (2017)
4.2 Main Result
We now consider the case in which a rational decision maker does not know the causal model which controls her environment. We must add a new axiom to the rationality axioms in order to keep simple our proof: Choosing within a Causal Decision Problem corresponds to intervening a variable of the true causal model which controls the environment.
Using Pearl’s result, we saw how a rational decision maker can use a known causal model in order to make a choice. Now, since the decision maker does not know the causal model, we argue that she must use beliefs about which causal relations hold in her environment, and use them in order to make a choice. In this case, in which a Causal Graphical Model controls the relation between actions and outcomes, any subjective information about the environment must consider causal structures. For this reason, we assert that the probability distribution that the decision maker has in mind is in fact a distribution over causal structures, where the decision maker uses each structure as if it were the true one in order to choose the best action within each structure by using Theorem 4.1. We assume a finite set of actions and outcomes.
Theorem 4.2** (Main Result).**
*In a finite, bounded, Causal Decision Problem
, where is a Causal Graphical Model, we have that the preferences of a decision maker are rational if and only if there exists a utility function, a probability distribution over a non-empty family of causal graphical models such that for each :*
[TABLE]
where is the probability distribution associated with the causal structure .
Proof.
Following Bernardo and Smith, proof of the Expected Utility Principle (Proposition 2.22, p. 52, Bernardo and Smith, (1994)), let be an option.
The existence of a probability , associated with uncertain events , is given by Axiom 5(2). Also, the utility , is the real number , associated with any standard event , such that
[TABLE]
From the coherence Axioms 4(3) and 5(2), and Proposition 2.13 in Bernardo and Smith, (1994), there exists events and , such that
[TABLE]
Then, by Proposition 2.10 in Bernardo and Smith, (1994), , where .
Now, for any other option and ,
[TABLE]
According to Definitions 3.2 and 3.3, and the new axiom added, we can split each event into , where
[TABLE]
where , are the variables of the CGM that controls the environment ; and , specifies a graphical structure over the set of variables ; and is a pair of indices which indicates which variable has been intervened in order to affect which other variable.
We can define as
[TABLE]
Then, it is shown in Bernardo and Smith, (1994) that events with this structure hold that:
[TABLE]
And, we can express as:
[TABLE]
This means that,
[TABLE]
We now use the fact that , to see that
[TABLE]
Where the last term , is supported within a family of causal models. Now, this last expression can be further expanded as:
[TABLE]
Therefore,
[TABLE]
And the result follows, since ∎
4.3 Interpretation
Theorem 4.2 asserts that a rational decision maker who faces a Causal Decision Problem with unknown causal information must use a probability distribution over a family of causal structures, and, within each structure, , use the term in order to find the probability of obtaining a certain consequence given that the intervention is performed; in this way, the optimal action is given by:
[TABLE]
We note that is obtained by taking into account the utility obtained by every possible consequence weighted using both the probability of causing such action within a specific causal model and the probability that the decision-maker assign to such .
We are considering a normative interpretation for Theorem 4.2 according to which a decision maker must use any causal information in order to obtain the best possible action. Such action must be obtained by considering the beliefs of the decision maker about the causal relations that hold in her environment (the distribution ), how such relations could produce the best action when considered as if they were true (distribution ), and the satisfaction (utility ) produced by the consequences of actions.
It is known that humans tend to ignore pure probabilistic information over causal information (Tversky and Kahneman,, 1980), and are in fact able to learn, and use, causal models in sequential decision-making processes (Lagnado et al.,, 2007; Sloman and Hagmayer,, 2006; Nichols and Danks,, 2007; Meder et al.,, 2010; Hagmayer and Meder,, 2013; Wellen and Danks,, 2012), even though such learning is not perfect (Rottman and Hastie,, 2014). Therefore, this theorem provides the basis for a much stronger, and computationally implementable, framework for decision-making in which causal information is used over associative information, even though complete causal information may not be available to the decision-maker.
5 Application: Causal Games and Nash Equilibrium
In this section we study and develop an application of the previous result in the domain of Game Theory: we consider a strategic game between rational players who are situated in a causal environment. A game is a model of a situation in which several players must take an action and afterward they will be affected both by the outcome of their own action as well as the actions of the other players (Osborne and Rubinstein,, 1994).
In a strategic game, it is assumed that no player knows the action taken by any other players; we also assume that the causal mechanism, which is represented by a Causal Graphical Model , remains fixed and it is unknown for each player. In this game, players ignore the actions taken by any other player, and since the causal model which controls the environment is unknown to every player, the players also ignore the information that players will use in order to take their respective actions: strategic games of this type are called Bayesian Games, introduced by Harsanyi (Harsanyi,, 1967; Harsanyi, 1968b, ; Harsanyi, 1968a, ).
With this contribution, we expect to show that standard notions of game theory such as Nash Equilibrium can be extended to the case in which causal information is considered over associative information. Therefore, provide motivation to further extend classical results to use causal information as a basis.
In the games we will consider, the uncertainty of every player consists of two levels: on the first level, the true causal model ; on the second level, what an action causes if a certain Causal Graphical Model is considered to be the causal model.
Definition 5.1**.**
A Bayesian strategic game (Osborne and Rubinstein,, 1994) consists of: A finite set of players. A finite set of states of nature. For each player, a nonempty set of actions. For each player, a finite set and a function is the signal function of the player. For each player, a probability measure over such that for all . A preference relation defined over the set of probability measures over where
We consider to be a family of admissible causal models; in this way, being the true state of Nature fixes a causal model which controls the environment in which the players make their choices. In classical Bayesian games, once is realized as the true state, then each player receives a signal and the posterior belief given by if . In the case of causal Bayesian games, we must consider both the probability of being the true state as well as the probability of observing a certain consequence when doing some action if is the true model. Following Osborne and Rubinstein, (1994), we define a new game in which its players are all of the possible combinations , where the possible actions for is . Osborne and Rubinstein, (1994) show that for a fixed player , the posterior probability induces a lottery over the pairs (a^{\ast}(j,\tau_{j}(\omega)))_{j},\omega)\textrm{ for some other j\in N.} This lottery assigns to the probability if . The classical Bayesian game’s Nash Equilibrium is the Nash equilibrium of (Osborne and Rubinstein,, 1994). Now, we consider the second level of uncertainty: the consequences caused by some action through a causal model . We notice that the posterior probability itself induces a probability distribution defined over actions for each player once a desired consequence is fixed, this distribution, according to Theorem 4.2 is given by This motivates the following definition of a Causal Nash equilibrium.
5.1 Causal Nash Equilibrium
For each player in the strategic game, we define the following probability distribution over consequences:
[TABLE]
where is the probability of causing a certain consequence within a causal structure , and is the player’s posterior beliefs about the causal structure that controls the environment, and is the well-known intervention operator from (Pearl,, 2009). We now define:
[TABLE]
Notice that evaluates an action profile in terms of: The knowledge each player has about the causal structure represented by , which allows each player to evaluate the probability of causing outcomes in terms of actions by using the operator, as well as the observed actions that are taken by the other players, given by , and the preferences of each player . Using this new function, we define the equilibrium for a strategic game with causal information and Bayesian players as:
Definition 5.2**.**
A Nash equilibrium for this causal strategic game is an action profile if and only if
[TABLE]
This is, an action profile is a Nash equilibrium if and only if each player uses her current knowledge about the causal structure of the environment in order to (causally) produce the best possible outcome given the actions taken by the other players. The existence of the Causal Nash Equilibrium is guaranteed if every is a nonempty compact convex set in some and if the preference relation induced by is continuous and quasi-concave as proved by Osborne and Rubinstein, (1994).
6 Limitations
We are working within the classical rationality assumption. Rationality can be ultimately thought of as a consistent or coherent way of making choices, but the precise definition has been a subject of debate. See Ellsberg, (1961); Gilboa, (2009) and Machina and Siniscalchi, (2014) for critiques of the Savage Rationality Axioms. We have favored causal graphical models over other alternatives since it has been argued that several cognitive processes, such as causal reasoning, can be best represented as graphical models (Danks,, 2014; Sloman and Lagnado,, 2015; Hagmayer,, 2016).
7 Summary
We have defined a Causal Decision Problem in terms of a classical Decision Problem under Uncertainty provided by a causal mechanism that mediates between actions and outcomes; this causal mechanism is assumed to be represented as a causal graphical model. In the case in which a rational decision maker knows such causal relations, we have seen that Pearl, has provided a decision-making result Pearl, (2009).
On the other hand, when a decision maker does not know the causal mechanism, in Theorem 4.2 we have provided preference representation result for causal decision making; our result explicitly states how a rational decision maker should use subjective beliefs, encoded as a probability distribution over causal models, as well as the causal inference machinery within the considered causal structures in order to find an optimal action.
With these two cases being covered, we have shown how causal models can provide the action-outcome probabilities for decision-making.
As an application, by using Theorem 4.2 and taking as a basis Harsanyi’s model of a Bayesian Game in which every player has incomplete information about both the actions taken by other players as well as the information that made each player take his action, we have been able to provide a definition of a Causal Nash Equilibrium (Equation 7) in which every player is aware that there exists a causal mechanism that will produce some consequence once he takes an action.
Finally, by extending a known result from Pearl to the case of unknown causal information, we have shown how action-outcome probabilities for decision-making can be grounded in causal models.
8 Acknowledgements
MGS thanks Dr. Gerardo Gonzalez Robert for his thoughts and comments.
Appendix A Coherence axioms (Bernardo and Smith,, 1994)
- •
Axiom 1: Comparability of consequences and dichotomized options:
There exists consequences such that . 2. 2.
For any consequences and any events , then either or
- •
Axiom 2: Transitivity of preferences:
. 2. 2.
, , .
- •
Axiom 3: Consistency of preferences:
If , then for any event more likely than , . 2. 2.
If and , then .
- •
Axiom 4: Existence of standard events: There exists a sub-algebra of and a function such that
if and only if 2. 2.
Disjoint and imply that 3. 3.
For any , and events , there exists a standard event such that and and 4. 4.
For and , independent standard events, we have that 5. 5.
If , , and , then .
- •
Axiom 5: Precise measurement:
If , there exists a standard event such that . 2. 2.
For each event , there exists a standard event such that .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Anscombe et al., (1963) Anscombe, F. J., Aumann, R. J., et al. (1963). A definition of subjective probability. Annals of mathematical statistics , 34(1):199–205.
- 2Berkovitz, (2012) Berkovitz, J. (2012). The world according to de finetti: On de finetti’s theory of probability and its application to quantum mechanics. In Probability in physics , pages 249–280. Springer.
- 3Bernardo and Smith, (1994) Bernardo, J. M. and Smith, A. F. M. (1994). Bayesian theory. Wiley Series in Probability and Statistics.
- 4Bernoulli, (1954) Bernoulli, D. (1738/1954). Exposition of a new theory on the measurement of risk. Econometrica , 22(1):23–36.
- 5Binmore, (2008) Binmore, K. (2008). Rational Decisions . The Gorman Lectures in Economics. Princeton University Press.
- 6Campbell and Cook, (1979) Campbell, D. T. and Cook, T. D. (1979). Quasi-experimentation: Design & analysis issues for field settings . Rand Mc Nally College Publishing Company Chicago.
- 7Danks, (2014) Danks, D. (2014). Unifying the mind: Cognitive representations as graphical models . MIT Press.
- 8Dawid, (2002) Dawid, P. (2002). Influence diagrams for causal modelling and inference. International Statistical Review , 70(2):161–189.
